Monarch geneset OGS2.0

DPOGS215723
TranscriptDPOGS215723-TA3453 bp
ProteinDPOGS215723-PA1150 aa
Genomic positionDPSCF300041 + 286108-290820
RNAseq coverage1391x (Rank: top 9%)
Annotation
HeliconiusHMEL0096480.057.14% 
BombyxBGIBMGA005815-TA0.052.66% 
DrosophilaCG9715-PA1e-3439.31% 
EBI UniRef50UniRef50_Q7PP022e-3737.44%AGAP006128-PA n=2 Tax=cellular organisms RepID=Q7PP02_ANOGA
NCBI RefSeqXP_316189.43e-3837.44%AGAP006128-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582953926e-3737.44%AGAP006128-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3454932499e-4123.68%PREDICTED: hypothetical protein LOC100678029 [Nasonia vitripennis]
Group
KEGG pathwayaga:AgaP_AGAP0061288e-38 
 K12597 (AIR1_2)maps-> RNA degradation
Orthology groupMCL25175 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215723-TA
ATGGAAGAGGAGCTTAGCGAAAATGAGCTGGAGGAACAAATGTATGCTATGATACACTATGTTGACGATACGCAATCAAATGTCAACTCAAACCAAAACGATAACAACATTGTTGAGAATGTTCCTCAGAGTACCGTACGTCGCTACTGGCGTACTAATGTAGACCAAAACACACCTTATCAGAAAATAAACACACCTAAAGATTCTACTAACAACAAAGAAACAGGTGAAAAGAAAGATGACAAAAGTAAATCCTCAGACCAAAACACATCAGATTTGTCTCTTTTTCAACAACCGGTACCCTCGAATGTCAAGAAAACTGTAGAAATATTGGAAAATGATGATGACAAAAATATAGTGGAACTCGAAACAAGCGACGAAGATGAAGTTATTGAAGTGGCACTTCCACCCAAACCCACCATCACCATTGAGAGTTCAGACGAAGATGATGTCTGTCCAGTTGATCCAGAGCCTGACATTAAACACAAGCCAACAAAGCCCACACAGGAAATTAAAAACTCAGTTGACAGAGAAGTCACTACCAGTCCAGTACCATCCGTGGTGTCATCTATCTCAGATGACTTCATAAGAGGAGACTGCATCGCACTCAATATATCATCAAAACATCCAAATAACCAAAGCTTTGATTTCAGTCTTCATGGCTCCGATCTTCTCGACCAAACACCATCGAAGAAAAAAAAGAAAAAGAAAAGCAAAGAAAAAAATACACCATCATCAGTAACAACGCCTCTTTCAGTATCAACTCCCGTGAGCTCAAAGCAAACGGCTGGTGCAGTTGATGAATGTTTTGCTACTCCCAAGAGCAAGGCCAAGAATAAACGCCAAAGGACAAAATCATATCGAGTTTCAGAGAAAAGTTTACCAAATGCTGACGTGTATGACTCAGACAGCAACCAATCACTGAATGAGAGTAACAAAAACCAGATGACATATGAGGTTACCGACAAAAGTGTACCAAGCACTGATGTCTACGAGTCCGATTCCAATCCATCAGAAAATGCTACAGAATCTGTATCTAAAGAGGCTATTAATGACGCTGAAAGCTCAGAAAGCACAACGGAGAGCCCCGTTGTTGAAATAGTCAAAACATCAAAAAATACTGACATTTCCAATAAGTCCGTAGTAGACCTCACGGAACCAGCCATGAACACAAGTATAGACGAAAACATAGTGATGGGTAACGTTACGGGATTCACAAACATGGAAGAATTCAGTGATCATGATATTTCAGTCAAAGACATATCCAAATGCGGCTCAACTAAAATACCAGCCATCCTTAATGAGGATCTCGATTTTGACAATCTCAAAGGCAGCAACAAAGTGTGCAAACGACGACGATATTCACTAACTACATTGCGAGCCGAAATGGAAAAGTTCTACAACGAAAGCTGGGGAGGAGAGGAATTCAACCATCGGGAGATACAGAAGAATATGTCACGTGACAAAAGTTTGTGGGTAATTGATGCAAAGGACCGTATGCCATCGCTGACTAGACGAAAGACCACATGCAACTACTGTAACCGCGCCGGCCACCGCGACGACGCGTGTCACTTCAAGCCGCCCGTGTGTTTCATGTGCGGCGACGCGGGACACTACGAACCGCGCTGTCCCAGGAAGATATGCGTCAATTGCGGGTCACCCAACTACGTGTACTCCACGATGTGTCGGAACTGCTCCACGTGGAAGTGCATCAAGTGTGCGGAGTGCGACCAGAGCGGTCACCCGGCCAGCCACTGCCCGGACGTGTGGCGCAGATACCATGATACCTTGTCGTTGGAGACTCCGTTGGAAGAGAATCGTCAAACGAAGAAGAATCACCAGATGTTCTGCAGTGGTTGCACGCGCCGTGGTCATTTAGTCCACACCTGCCGTCTCTCTCTACCGTTCTCAGGCCTGCCGATGAACTCCCCATACGTCTCGGTCTACCGACCCGTCTACCAGATGCTGGACACTAACAACCAAAGTAACGATATCGGTAACAAGAAATTTAAGAACAGGAATAATTCTGAAAATTCCTCAACGATCAGACAGGACAGAATGAAACGACAGTCCAAGTCGCCGACCACCCACGATTCACATCTCAACAAGAAACGTAATATGGGTACCATTGAAGTTGAAATAAGCTCAGGAAACAAGTTTCCTACGGGGAATCAAAGGAAAGTTATAATCTCTGAAGAAAATCCCAATAACAGCAGCAAAATTATCACAGAAACAAACATCAATAAAAAATCCACCGAAGTTCAAAGTACAGAGAGGGCTCCAGACTTTATACCGATAACATCATCAGACAATCGAGACAAGAGGGGACAAATAATACAAGACAATGAAGTGTCGGACACGAGCGAGGTCATCACATCCGCGAGGGTCTACATCACCAAGGAGATAGCGGATCTCTTAATGACAGATGAAGGAAGCCTGTGGCTCAACACGACCATCAAAAACAACGATCTGATATTGGAGAATGACACCATAACATTCTACCTGAGCATCAACGGAACAGTCGGCAACCAGGAGGCCTTCCAAGCTGAACTGGGAGAGTGGATCAAGAAGAAACAAGCCGGCAGAGAGAAAGAACGATTTGTGTCCGAGAGTGAAACCGACGTTACCCAGGAAGGTACAAACGATCAGCAATCGTTGACGAATAACATACCCAAGAACAGAAACAACGCTTTGCGCAAACTGAACAAAGCCTTCGATTCATTAAAGAAAGATCTGGGAGATCCGAAGACCATTTATAAGGAGCTGACGTATTTGCAAAATAAACATCAGCAACTTATAAACCAGAAAGTCATAAGCCCCAAAAAACTGTCCAACAACAGGGACAATATTAATCTGATGCTGAGAAAACTTAATATGGTACTTCTCGGGCAAGCTGGTCTAGCGGACGGCTCCACACATTTAAAGGAACTGTACTCCTTACAAGAGAAACTAAGCAATTTCAGGCAGAAAAATATACCGACGTCGCTGCGCGAGGAAATCGGTGAGCACTTTCATTGCATCTTCGCTGCGATACCCAGGGATGATTACATAGAACTCTTAAGTAAATTTTACAATAAACCGGTCATAACGTTCAAGAAGAAAAATGATAGGTCCTTCAAAGTCAGTCCGAAGCCAAACCAGAAGACGTTGAATCCGATCCAGAACATACAACGCAACGTGAGCGGCGTGAAGGATGACACGAAGGAAAACAACGTGGCCAACGACACGTCCCAACTGACCGCGGCGACCAAGAACAAGCTGGTGTTCTATCACAGGCGGTTGCTGCGCTCGCGACCCATGGACGCGGTTCTCAAGAAGACAAAAAGCGAACTGCTAAGGAAGCTCCACTTCAATCTCGCCCTATTAGGCGACAAGGCTCATATATCTTCGAAGGCTCTGAAGAAAATGAGAAAGATTCAAGAGCAGGCCCAGCTGTTCTTAAATAACTTTTAG

Protein sequence:

>DPOGS215723-PA
MEEELSENELEEQMYAMIHYVDDTQSNVNSNQNDNNIVENVPQSTVRRYWRTNVDQNTPYQKINTPKDSTNNKETGEKKDDKSKSSDQNTSDLSLFQQPVPSNVKKTVEILENDDDKNIVELETSDEDEVIEVALPPKPTITIESSDEDDVCPVDPEPDIKHKPTKPTQEIKNSVDREVTTSPVPSVVSSISDDFIRGDCIALNISSKHPNNQSFDFSLHGSDLLDQTPSKKKKKKKSKEKNTPSSVTTPLSVSTPVSSKQTAGAVDECFATPKSKAKNKRQRTKSYRVSEKSLPNADVYDSDSNQSLNESNKNQMTYEVTDKSVPSTDVYESDSNPSENATESVSKEAINDAESSESTTESPVVEIVKTSKNTDISNKSVVDLTEPAMNTSIDENIVMGNVTGFTNMEEFSDHDISVKDISKCGSTKIPAILNEDLDFDNLKGSNKVCKRRRYSLTTLRAEMEKFYNESWGGEEFNHREIQKNMSRDKSLWVIDAKDRMPSLTRRKTTCNYCNRAGHRDDACHFKPPVCFMCGDAGHYEPRCPRKICVNCGSPNYVYSTMCRNCSTWKCIKCAECDQSGHPASHCPDVWRRYHDTLSLETPLEENRQTKKNHQMFCSGCTRRGHLVHTCRLSLPFSGLPMNSPYVSVYRPVYQMLDTNNQSNDIGNKKFKNRNNSENSSTIRQDRMKRQSKSPTTHDSHLNKKRNMGTIEVEISSGNKFPTGNQRKVIISEENPNNSSKIITETNINKKSTEVQSTERAPDFIPITSSDNRDKRGQIIQDNEVSDTSEVITSARVYITKEIADLLMTDEGSLWLNTTIKNNDLILENDTITFYLSINGTVGNQEAFQAELGEWIKKKQAGREKERFVSESETDVTQEGTNDQQSLTNNIPKNRNNALRKLNKAFDSLKKDLGDPKTIYKELTYLQNKHQQLINQKVISPKKLSNNRDNINLMLRKLNMVLLGQAGLADGSTHLKELYSLQEKLSNFRQKNIPTSLREEIGEHFHCIFAAIPRDDYIELLSKFYNKPVITFKKKNDRSFKVSPKPNQKTLNPIQNIQRNVSGVKDDTKENNVANDTSQLTAATKNKLVFYHRRLLRSRPMDAVLKKTKSELLRKLHFNLALLGDKAHISSKALKKMRKIQEQAQLFLNNF-