Monarch geneset OGS2.0

DPOGS208667
TranscriptDPOGS208667-TA1752 bp
ProteinDPOGS208667-PA583 aa
Genomic positionDPSCF300281 + 347403-369446
RNAseq coverage378x (Rank: top 32%)
Annotation
HeliconiusHMEL0117332e-3580.72% 
BombyxBGIBMGA007764-TA2e-8179.65% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020648D51e-3631.17%UPI00020648D5 related cluster n=4 Tax=unknown RepID=UPI00020648D5
NCBI RefSeqXP_971339.26e-5735.43%PREDICTED: similar to interleukin 1 receptor accessory protein-like 2 [Tribolium castaneum]
NCBI nr blastpgi|1892373771e-5535.43%PREDICTED: similar to interleukin 1 receptor accessory protein-like 2 [Tribolium castaneum]
NCBI nr blastxgi|1892373771e-5435.43%PREDICTED: similar to interleukin 1 receptor accessory protein-like 2 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[37-129] IPR0137831e-10Immunoglobulin-like fold
[53-125] IPR0130989.1e-08Immunoglobulin I-set
[139-239] IPR0035991.5e-06Immunoglobulin subtype
Orthology groupMCL15081 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208667-TA
ATGTGGATCCTAGTGGCGCTGGGTGTGGTGACCGGCGCTTCGGTGGTACGTTCAGCCGGCTACTGCTCCACAAATACCTTCCCCACGAACACATCCTCGATGCACTTCAGCAAAGAACCAGTGCCTTATGAATACGGCTTCCAGGACAAGTTTAAGAGCATACACTGCTGCGTCAAAGGATACAGGAGCATCGAATGGTTCAAAGACGGTGTGGCGTACCCGTGGTCAGCGGGAGTGTCCAACCTGATACTGTATCCCGAGGCAGCCAATCAGACGCTGTACACCAGACGGGCAGCGAGATCCGACTCGGGGAACTACACGTGCAAACTGAGCAACGAAACCCACAGCGAAACACATACAGTCCGATTGAATATTTTAGAAAAGCCTACGGACGCGCCTAAAACGATGTTCATATCGAAAGACCAATGGGTGGAAGAAGACAGCGAGCTGCGTTTGTTCTGCGAGGCTCTCATCGGTCGTCCATATTTAGCGGATGCTGTGCGCGATCTTCGTTGGAGGAAGGTTTGGCCGAATGGTACCGAGGGAGATTTATCTCCAACTCAGACCGAAATCAAAACAACGAGGGAGGACGTGGAGGATATCATCGGCTCCTACCTCACCATCTCCCGGGTGTCGGCGCATGATTACGGAACATACGTGTGTGTGGTTCACAGTAACGACGTGGTCGCCAGGAACTACGTTACAGTACACTATAAATTTCTGGAGAAGGAGTTCGACGTGCTCGTCTGTTGGACCGCCGTGGACGGGGAGCTGGTCCGCGGCGCGCTGCTGCCGACCCTCGCGCTCAAATACAAGTACAGAGTACACACGGCGCCGCTCTCCACCACGCCCGACAACTGGTACAGTTCCCTGGTGTGTGAGGTGTCCCGCTGTCGCTCGCTGGTGGCGGTGCTGTCTCCGTGTCAGTACTCCCCTCAACAACTGCTGACGGCTCTCAGACAGCTGCGAGCGCTGCCCGTGCCGCCTGTCGTGGTCCTTCTACAGAGTGAAACGTATGATAATGATATGTCCAGGGAGGACGTGGAGGATATCATCGGCTCCTACCTCACCATCTCCCGGGTGTCGGCGCATGATTACGGAACATACGTGTGTGTGGTTCACAGTAACGACGTGGTCGCCAGGAACTACGTTACAGTACACTATAAATGTGGTAAGTTTGATATTGCCTTGTGCAGACGCTGGGGGGTCGGGGTGGTGCGGCGGGCCCGGGGTGCCGTGGCGCGCGGTGGCGGCGGGCGGTGCTGCCGGCGCGCTGCCGCCGTGGACGGGGAGCTGGTCCGCGGCGCGCTGCTGCCGACCCTCGCGCTCAAATACAAGTACAGAGTACACACGGCGCCGCTCTCCACCACGCCCGACAACTGGTACAGTTCCCTGGTGTGTGAGGTGTCCCGCTGTCGCTCGCTGGTGGCGGTGCTGTCTCCGTGTCAGTACTCCCCTCAACAACTGCTGACGGCTCTCAGACAGCTGCGAGCGCTGCCCGTGCCGCCTGTCGTGGTCCTTCTACAGGACCTGCCCAAGCTGAAGCGAGAGGCGAAAGAGAGCGGAGAGAGTCTGGTGGAAGTGCTGCGAAGGACGCGCCTCGTCGCCTGGAGACACGTGCACGAGCGAGCCTTCTGGACGCAGCTGCGACTGGCCCTGCCTCTCCCGCCGCCGAGGACGCAGGAAAAGCACGTTGTCGAGACCGAGGAGAGCAAAAACTCCCGTTCGGGGAGTCTGACCGCGCTCGTGTGA

Protein sequence:

>DPOGS208667-PA
MWILVALGVVTGASVVRSAGYCSTNTFPTNTSSMHFSKEPVPYEYGFQDKFKSIHCCVKGYRSIEWFKDGVAYPWSAGVSNLILYPEAANQTLYTRRAARSDSGNYTCKLSNETHSETHTVRLNILEKPTDAPKTMFISKDQWVEEDSELRLFCEALIGRPYLADAVRDLRWRKVWPNGTEGDLSPTQTEIKTTREDVEDIIGSYLTISRVSAHDYGTYVCVVHSNDVVARNYVTVHYKFLEKEFDVLVCWTAVDGELVRGALLPTLALKYKYRVHTAPLSTTPDNWYSSLVCEVSRCRSLVAVLSPCQYSPQQLLTALRQLRALPVPPVVVLLQSETYDNDMSREDVEDIIGSYLTISRVSAHDYGTYVCVVHSNDVVARNYVTVHYKCGKFDIALCRRWGVGVVRRARGAVARGGGGRCCRRAAAVDGELVRGALLPTLALKYKYRVHTAPLSTTPDNWYSSLVCEVSRCRSLVAVLSPCQYSPQQLLTALRQLRALPVPPVVVLLQDLPKLKREAKESGESLVEVLRRTRLVAWRHVHERAFWTQLRLALPLPPPRTQEKHVVETEESKNSRSGSLTALV-