Monarch geneset OGS2.0

DPOGS212653
TranscriptDPOGS212653-TA5304 bp
ProteinDPOGS212653-PA1767 aa
Genomic positionDPSCF300319 + 137821-146284
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0136380.056.93% 
BombyxBGIBMGA013949-TA0.047.03% 
DrosophilaCG15080-PA2e-2646.76% 
EBI UniRef50UniRef50_D6WUE11e-3240.18%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WUE1_TRICA
NCBI RefSeqXP_973151.12e-3340.18%PREDICTED: similar to CG15080 CG15080-PA [Tribolium castaneum]
NCBI nr blastpgi|910835293e-3240.18%PREDICTED: similar to CG15080 CG15080-PA [Tribolium castaneum]
NCBI nr blastxgi|1700454677e-6424.27%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
Orthology groupMCL26151 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212653-TA
ATGTCACGATATCTCAATAAGTGTAGTGTCCCTGTCGTTGTATCCCCAGAGCCGGAAATGGCACGCGTCGTGCTAGCTCTTCTATTTATCATCACTGTCGTCTCGGCAAAAAAGGATCAAACGAGGTGGTCTCGACAGATCAGCACTTACACATCAGATATCAGTGATTGGGTTCCTCTCCAAGGGCCTGAATTATCCCAATTCAAAAGACAAGCAGTCGCCGAGCCCAGAGTATTGAGTGAACCGTTCGCAGGATTTACAAGACCGACTGGATTTAGTGCTGAAGACTTCCAATCGAGAGCTTTTTATACCGCTCCCGTAAACCGTCATCTTTATGTTCAATCTATACCTTCGGCATCCCAAAACTATCTTTCAGAACAAGGGTTTAACCAAGGAATGAGATTAGGACTAACTCAACCTAATTACCCGCTCAGCCAGGGTTTTATAGCGCCCCAGTTTAGTTTCGACGGAATTCCACAATTTAAAGCGAGACCTCAAATTGTTTCTAATCCGATCAAATTTGATAATTCTCCCAAAGTCAAGGTACCTGCGCCGAATCCTGTGAAACAATTTGCGATTCAGTCACCCAATTTAAAATCAGAAGCACCAAAATTCGTAGACGGATACCGCTTAGAAAACAACAGTTCTAATTATCTATGGAAAAAGCCACAATCATTACAGAAAATTAAATTCGAAACAGATAAATCCAAAGAAGTTCTTGGCTCTGCTAAAAACAATTTAGAAAGAGAAGAGGTTCAATTACTTTACGTTCCACTCGAATCCTTAAACAGGGGTCAATTCAACTTTAGGAGTCCTCTAACGTCACCACAGATTTTGAATACCGAACTTTACAATCAACATACTAGAACAAATACTGCATCACAGAATATTGGATCAGAACTACAGAGTTCAATAATGAAACAGTTGGAAAATTTTAATTCTCAAGAGTACATGAACAATTACAACAACCCGATTCAGGAAATAGATCCTATTGCCAGATTTTCGACAACCACGACACCGTCCCCTAGCACGTCGCCTACAACTCCTAAACCGAAAAAATTAAAACCTCATCAACCGCCTCTTGCAATCTTTTTGACTCAAAATGGAAAACAAGATGACAAAATAAAAGTTGGGGACGTTTTGTATTCTTTGAAAAACGCTGATACTGTTGCGGTGTTAGATTCAGTGAATCCTTTAAATGCGCCCTCGGTTTTCATTGGACCAGCTTCGTTGACACCCCCAGAACATTTTGTCAAGTTTGAATTACCTTATTTGTCTAACATAGAAAATAGCGATAAAAAACTAAGACAACTACCATTCTTCGTGGCACCTTTAAGTTACAATACGCCTCAAGGATTTGCTAAAATTCCATTCCCAGCCCCACACGTGGGATCTGTTGTTATAAATTCTCAAATGAAGGAAACTTCAAGCAAGGCCATACCAACACCCGAAGTTTATACAAATTCCTTTTCGAGGCCAACAATTTACAGACAAGAGCAAAAACCAGCAACTCAAAAGCCTGTTATAAGCTACTACTCTACATCAACACCAAGTGTAAATTCTCCAAACTATGAACAAAATTATTATTCAATTGAACCACAATCAGTTAATTCATTACCAACGCCCAAGGAACCAGAACCTAAATTCGTCACTCAGCAACCGGCCCCAATAAAAACTGGATCATATTTCTTAAACAACGTTGGAAATCAACAGCCCTATAATCAATACAACCAGCAACAATATGTCAACTTCCCTCAAGAAAGTAATTTTAAAACGCCGGAGCCTCCGAGGACAGTTCGAGTGTATAAAACTGAATCAGTTACCAGCCCACGAACAACAGTGTCGACGACTACACAAACGCCAACCTACTCCAGTCAACTCTTAGAGACACATAATCCATATTCGATCAACCAGGCCTTCAGCTTAAGCACGCCGCTAGATTACCATAACTTCTTTGATGAATACAAAGAGACTTATGCAACTACTTCGCCTGACCAATCACCTCCAGCTTCTCAAATACCAGAAAATGTTGAACCAGCTAGTGGTAGTCCCAAAACACCTGAACAGCAAATTCTTAACGAAGAATCACATCAATCGTCACCGTCACCGGTACAAGGACACCAGCAAATGAGTTTCTATGCTCCTGAAATTCATAATTTGCCTGAAAATCATAATCTACGGTACCCTGTATTTAATACGAATTATTATTCTACTAAAACTGAGACACCTCCTGAAACCAGTTATACCAATGCCGCGGAACAATCAAATGGAAATACCCAAGAGATTAATAACTTCCAGACAAATAAAGAATATAGTAACCAATTTGAGTCCGAATCGCCTCCGTCTGGTAATTATTCTTACACATCTTCAACCAACGAAGAAAGTTTACCAGAAGAGATACAGACTCAGAAAGTTGCTCCTTCATATAATCAATATGATATTAATAATGACTCCCATGAAACAATCAGCTTAGATAGTAGCAGAACCAGCACCACTTCCAGTACGACTACAACAAGAAGGACAACTCTCAGATCCAGAAACCGTCCACGATACTCTTCACCTAAACCAGACTACAACGATTATTCAACTCGATCGACGATCACCCGAAGACCACTAAGAGAAAGAAAACCGCTACCCTCGAGACCTCGATACGAACCAAATAAAATAACGACAGAAAGACAAACTAGAAAACCAATTGATCCCATTGAAAGTACAACAAAATCATCGAGATCTAGAACCAGGGGAAGAATACAATTCAAACCTTCTGATTCGGATGAAATATTTATAAAACGAAATAAACAAGGCAGTACAGAACAAGATTTGGCTTATCAAAGAGATGTATTGCATCAGAATTACCCCGTAACATTAATGGAACGTACGAGCACGACAGACATTGAAGCCATTACTGAACCCACATATACAGCCAACATTCCCAACAACCAACAAAATCATAAGATAGAGGATACAGCAGATGCTTACAGTAGTGACAAAATCTCTATAACAGATAATGTCAACGAGGAATTAAAAGACCCTGTTCCTCAATACACTACTGCCGAAGATTCACAAACTGGAGCTTCGTATGTACCCAAAACACCCAACCGCGAAGAAGACTTTTCCTATCAACATAAAACCAGTGAATTTCCACAAGAAGCTGCTTCACCATTGGAAGAAGAAAACCAAGAAGAAGCATCCCCACAAAAATACACAGAACATAATCCGCAACAAGACGATAACTTTGGAGCGACAAACCCAATTACATCAGTTACAGAACAAAACGTTCAAAATGAGAATCAGGAATCAGAGAACCAGACTGAAAAATCAGAAGAATTTATAACCCAAAATGAAAATGAAGAAAATACAAAAAGCCATAATCATGAATTAAACCCAGAACAAGAGCAGGAAGAAAATATTATTGAGACAACACCATCATATAACAGAGTAAGAATACGTCCAGGAGTAGTACGAAAATATCACCAAGGACTATCAGAATCTTCAAAAAATAACGTCGACAGGAGAAAACCGCAACAAGCTATAACATACCGACCAGCATTTGATAGACGCCGAACTACAATGAGAATTGAAGAAATCGAAGCTGATCTAAAAACAAAACAAATTCATTCCCGACCCGGCTTCCAAGATTACAGACAACCTGTGTATAAGCCTGAACCATCAACTGAACTAAGCTCTACATCTACTACTGAGGCTACAAAACGAGAAGACCTTATGAAGCGAAAAATCGGTTTAGAGGTAGGAGACCGACTGAGAAACCCACAGAAAAACCAGACACGCAAAGTGATCTACCAACGACTACTGTATAGCAGCAGGCCAAGGTTCTCAGAAAGATACAACAAAAACACGGAGGAACCAACGGCTGAAGACCAAGACTCCAATTATTCGATAACTATACCGAGATATGTCGAAGCTAATACTAACGAAGATTCAAATCGTTGGTCACCGAAGATCTCGCAGGATTCTTTCAAACCGTTCAATCCAAATAACATAGCAGATGAAACAAAAATTGCTACAGAAAAGTCCAAAGATGAAGAACTAGATATAATAACGGCAAAAAATGAATATGAAGATATACTCATTTCGGTGACTCCAGCGACAAATAACAGACTTAATAAAAAGTTACCCGACATTCCTCCAACACTCGAAGCTTTCGTCGAACAAAGCAAAATATCTAAAAGCGACTCTAGTGAAGCAGCATCCACCTTTGAAACAATGTTAGAAGAGGTGATGAAGAGTTTAGAAGAACAAGACGAAAATGAGTACACGAATAAAGTAATGAAACACAAGGGCGGAGAAATAGGGGAAATACCTCCTGAAATAATAATATCATCAGGAGAAAACTATTCCATTAAAACAACGACACCCTCACCAGAAGAAACAACCAGTTCACTACAAGACACTTCCGCTAGTAATAATGAGAACCTAGATGGACGGAAAAGTCGTCGTCGTGGATTCTGGAAGAAGGTCAAAGTTCGTCCTGTGTCAGAGAGCATTGATGTAGCCGAATCACAATACTATTCAAAGATTGTGAACCATTTAGGCCAAACAGTCTCCAAAGAACCTTTGGAAAAGAATGGAAAAGGAAATACTAAAGTGGTGGTGACGACTTACAAACCTAATTATGAATTCTTAAAAGACTTCTTTGAATCGGAGGAGGATCACGACAATATACCTGACATTGAGATAACCAAAATAATTGAAAATAATACCGAGAAAATTAAGAACGTTACTACCAAGACTGATTTTATTGAAAGAACGACAGAAAGAATGAATCCTGGTGAAATAGATTTAGGCACCGGCGCCCCAGATCGTACTTTTGCCGACAGCTCAGTTTACACAGAGGCCACAGAACCCACTACAAAATCGATAAACCGCTCAGACGGATTCAACTTTATGAACTATCTCTTCGGTACAACATCTTCGGACGAGGAATCCAATAACAATTCAAAGATAAAAAATGAAGATGTCAAGACCACTATCCAAATGCAAAGTGACACGGAAACGGAAGTTGCTAAAATAAAAACTACAACTGAGAATGTTTATATGCCGGATGAATTCACAGCCGACTCAACACGCAACGGTGACGCGGTGTTTGTGACGGAAAAATTACCTTATGAAATAGATCTCAAAGTTACGGAGAGAGATTTAGAACAACCAGAAAGTTCATCGCAATCTAGTTTCATGAACCCCGCTAACGTCCTCAGTACCTCAATGTCTACAGAAATATCCCACGAGACGGAGATATGTTTTAGAGGCAAATGTATAAAAACCAATAAAAATGTGCTATTATAA

Protein sequence:

>DPOGS212653-PA
MSRYLNKCSVPVVVSPEPEMARVVLALLFIITVVSAKKDQTRWSRQISTYTSDISDWVPLQGPELSQFKRQAVAEPRVLSEPFAGFTRPTGFSAEDFQSRAFYTAPVNRHLYVQSIPSASQNYLSEQGFNQGMRLGLTQPNYPLSQGFIAPQFSFDGIPQFKARPQIVSNPIKFDNSPKVKVPAPNPVKQFAIQSPNLKSEAPKFVDGYRLENNSSNYLWKKPQSLQKIKFETDKSKEVLGSAKNNLEREEVQLLYVPLESLNRGQFNFRSPLTSPQILNTELYNQHTRTNTASQNIGSELQSSIMKQLENFNSQEYMNNYNNPIQEIDPIARFSTTTTPSPSTSPTTPKPKKLKPHQPPLAIFLTQNGKQDDKIKVGDVLYSLKNADTVAVLDSVNPLNAPSVFIGPASLTPPEHFVKFELPYLSNIENSDKKLRQLPFFVAPLSYNTPQGFAKIPFPAPHVGSVVINSQMKETSSKAIPTPEVYTNSFSRPTIYRQEQKPATQKPVISYYSTSTPSVNSPNYEQNYYSIEPQSVNSLPTPKEPEPKFVTQQPAPIKTGSYFLNNVGNQQPYNQYNQQQYVNFPQESNFKTPEPPRTVRVYKTESVTSPRTTVSTTTQTPTYSSQLLETHNPYSINQAFSLSTPLDYHNFFDEYKETYATTSPDQSPPASQIPENVEPASGSPKTPEQQILNEESHQSSPSPVQGHQQMSFYAPEIHNLPENHNLRYPVFNTNYYSTKTETPPETSYTNAAEQSNGNTQEINNFQTNKEYSNQFESESPPSGNYSYTSSTNEESLPEEIQTQKVAPSYNQYDINNDSHETISLDSSRTSTTSSTTTTRRTTLRSRNRPRYSSPKPDYNDYSTRSTITRRPLRERKPLPSRPRYEPNKITTERQTRKPIDPIESTTKSSRSRTRGRIQFKPSDSDEIFIKRNKQGSTEQDLAYQRDVLHQNYPVTLMERTSTTDIEAITEPTYTANIPNNQQNHKIEDTADAYSSDKISITDNVNEELKDPVPQYTTAEDSQTGASYVPKTPNREEDFSYQHKTSEFPQEAASPLEEENQEEASPQKYTEHNPQQDDNFGATNPITSVTEQNVQNENQESENQTEKSEEFITQNENEENTKSHNHELNPEQEQEENIIETTPSYNRVRIRPGVVRKYHQGLSESSKNNVDRRKPQQAITYRPAFDRRRTTMRIEEIEADLKTKQIHSRPGFQDYRQPVYKPEPSTELSSTSTTEATKREDLMKRKIGLEVGDRLRNPQKNQTRKVIYQRLLYSSRPRFSERYNKNTEEPTAEDQDSNYSITIPRYVEANTNEDSNRWSPKISQDSFKPFNPNNIADETKIATEKSKDEELDIITAKNEYEDILISVTPATNNRLNKKLPDIPPTLEAFVEQSKISKSDSSEAASTFETMLEEVMKSLEEQDENEYTNKVMKHKGGEIGEIPPEIIISSGENYSIKTTTPSPEETTSSLQDTSASNNENLDGRKSRRRGFWKKVKVRPVSESIDVAESQYYSKIVNHLGQTVSKEPLEKNGKGNTKVVVTTYKPNYEFLKDFFESEEDHDNIPDIEITKIIENNTEKIKNVTTKTDFIERTTERMNPGEIDLGTGAPDRTFADSSVYTEATEPTTKSINRSDGFNFMNYLFGTTSSDEESNNNSKIKNEDVKTTIQMQSDTETEVAKIKTTTENVYMPDEFTADSTRNGDAVFVTEKLPYEIDLKVTERDLEQPESSSQSSFMNPANVLSTSMSTEISHETEICFRGKCIKTNKNVLL-