Monarch geneset OGS2.0

DPOGS211197
TranscriptDPOGS211197-TA3483 bp
ProteinDPOGS211197-PA1160 aa
Genomic positionDPSCF300007 + 816483-824618
RNAseq coverage235x (Rank: top 43%)
Annotation
HeliconiusHMEL0124510.087.94% 
BombyxBGIBMGA003185-TA0.077.00% 
DrosophilaCG2225-PE5e-3535.98% 
EBI UniRef50UniRef50_D6WKA81e-10345.88%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WKA8_TRICA
NCBI RefSeqXP_970727.16e-10445.79%PREDICTED: similar to AGAP009860-PA [Tribolium castaneum]
NCBI nr blastpgi|2700071614e-10345.88%hypothetical protein TcasGA2_TC013697 [Tribolium castaneum]
NCBI nr blastxgi|2700071610.040.98%hypothetical protein TcasGA2_TC013697 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL17519 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211197-TA
ATGTTGCTTTATGATCGCGGGTGCGATGACGTTCAGCGTTCTCTGGAGCTCCTGGACCAGGTGTTAAGTGAATACGACGAGGGTGAACCGCGATCGTTGGAGGCGCGTGCGACGCCCGCCACACCAGCGACACCCGCCACACCAGCCACGCCCGCCACTCCAGCCACAGACGACGACTCGCCGCTCGCCGGTCACACCTCCGAGGACGATGGATACATGAGCATGAACGGCAGACGGGCGAAATTCGCGCTGGGCTTCCGACCAACTGAGGAGCGTGAGGAGGGATCGCCTCCGGACCCTCCATCACCACACTCCCCACCACCGCCTGAAGAAGCTCAAAGAATTATATCAACACTTTTACCTAAGGTGTCTCCCACAAATTCTGCAAAACATACAAACTTATTCAGCCATTCCGAAGCTGAAGAGAAATTGGACTCTATTATTAGCGTCAACGGAATTACAACTGCAACACAGACGACTTTTCCAAAGACAAGGCACCAGCGTCCTTATGGATGGGACAAAGAAATAGAAGCGAATGGGTTTCAGCTGCAGCAGGCTCCGGTTCCTCCTTACAAATTTTCATCGTTACAGCATGGATCCAGGCCGCCGCAAACTGAGATTGCTCCCACGTGGCTACCTCCGCGCCACTCAGCCCCACCGAGCAATCAAAAAAAGTCACCTAGTGTTGAGAACCCTCCTGTATTTCCTTTTATGGGGTTCTCGAGTGATTTCACTGACAAACCGTATAACGAGCATCCCGTGACCATTTATCCTGGACCAAATATCCAATACAGTCGTCAGTCACACTCACCAACTTACAAAATATCTCCTTCAAATTCCAAGAGTATTGAGAAAAAGAACAGTATGAAACGAGACAGTAGGTCTGATGGGGAAATTCTTTCACCAAAATGTAAAATACCTCCCCGGACTCTTGCCAGCTCTATGGAAAGACATAGAGAGGTGTGTAGAGATTCGACGGAGGATATAATGGAGAGTTGCAAAATGCGAGGAGCAAATAGAAGCAATGACGAAGATCATTTTTCCGACGATTCACTCGAAGAGTCCTTTCCGCCGCCTCCCCCCGCAGTTAGCACGCCTTCGAAACGCAATTCCATAGCTTGGGAAGTGTCTCTCGATGGTGACGATCCTCTTTTGACTCCCGGTAGCACTAAGGTTATAGGAAGAAGACGAAAGAAATCTGGCGATCAATCGCATTCCAGTACGAGTTCTATTCCTCAAAGGATTGATGATGACTGGGGTGAGGAATATTGGCCCCCGCCCCCTCCGTTGTCACAGTGTGAACCGATAGTTTCACCAATATCAGACGCGGAACCAGAACTTCGAAGGCCCCAGGACTTATCAACAGGGACATATGTCATTCGAAAGGGAAAAAATAGAAAACAATTACCAACCTTCAATAAAAATACAAGCAACAAACCACAATCGAATACAAATAATCTATCACAGAGTCATAGTTTGAATAGAAGCATTGAAAAATCAATAGACGGTTCTAGTATTTCAATAAGCGGGTCAGGCTCTGTAGGTTCTTATAACTCTAGGCCCAGTTCAGATCTGAGTCTACCTCAATCCCGATACAGCGCGGACCTTAACTCAAGACTAAGCCGTGAACTCAACACGCCCAACTCTAGATATAGCATTGATCTTTGTACACCGAATTCTAGATTAAGTAAGGAGATTACTTCACCGAAGTCAAGATTAAGTTTAGATCTCAATAATGGTCAAGATAGATATATAAATCCAGAATATTTTGGTCATAGTCCTAAAAGTAGGCAAGTAAATAGTGATAAAAATGTCGTCGGCTCTAAAACTAACAGTGGCCAAAGCAAATTATCACCTCAAAATAGATTTACTGATTTTAAAAAATATTCATCTACTTTCGATAACATTCAATCCCTAATTAAAGAAGGTAAAGTGGAAGAGGCACCACAAAATGATTGCAATGAAACTGTCACTGAGCTTTCTGTTGTCCCTCCGACTATGGTACGTGTCATATCACTGCCGAGTCTAGGAGCTGAGGCAGACAGCAATAGTGCTAGTCGTCAAGCCCTTATCACGACAGTAGAGGAAGAGGATGATCAAGAAAGTGGTGAAATCGATTCTAATGAAGATACCTCACCTTTAAGAAAAATTGAAAATAACATAAGTGCTATATTGAACCAAGGACGGGACCAAAAACATTTTACCCCATATACACCTAAAGACTGGAATATACATAAGGATGAGTACTGTGATGAAATATCTAAGGATATTTTAGAGAACAATTACCGTTATAGCTCTAGGGAAAGACAAAAAAGACACGATATCCAGAAATCATCGAGTCATAACGAAATCCAAAATCAGCGATCAATTGATCGACGCGATCGTTCTTCTGGGAGGAGATCAAATAGTATGCATAAGTCGACTAGTGCTAAGGATGTGCCCGTAAGCTTGATGCGCCAGCCTTCCTCCTCAGATTCTGCTGTATCAAGTGGAGGTGATTTTCCTCTAAATATTCAAATAGTGGAACATCCCTATAGGCACAATCAATTACCTCCATCGCCTAGTACAGGTCATGAGATGGGACCGCTACCTCAGACACCGGAATCCCCAAAATTCCCACCCTTACCACCTTCACCCGTTCAAGAAGTTGAAGATGAATACACAGAGATAATGCAGCCTACGGGAAGACGTCACATTAAAAAAGCCGATACGTTGCCAACACCAACTATGGAATCAAGGAGACGGCCATCAGAACCTCCCGCGGTGCCGCCTCACCGCGATACAACAAACAGCCTTAAAACGAGATCAATGGAAAACAACTTCAACAAGAATAGAAGAAATAGTAATTTCAAAAGTGGCTCAACAGACAGACGTACTTTGCCAACAGACACTGGAACCAGCGGCGCTCGCCGTCGTACGCTGCAGCGTCAGAATCGCGAATCCGGTTACAATGTTCGTGGACAGCTTCAGACATCCGCCAGCCTGCCCGAGACACCTGTGTTCGCTCGTGGCTGTGATGTCCCAAGAACTCCGCCAAGAAATACTGGCCCGCCCCGACATAACACCATCAACTCGATGCAAACTATAGGAAGCAGCGCTACCCTAGGCGGTTACGGGCGTGGATCTGTAATGGGGGCTTCGGGAGTGTGCACCGGCGCTGACCTTCTCCGTCTGGGAGGTCCCCCGCGTGGCTGGTATCCAAGACAACGGAACCGGCCTGCATCTATCGAGCACCTGGATAGAATCTCGACATCAGCGAAGGTAGCCGCCGACCATCCGGTGGCATGGGAGGCGTCTGGTGCTCGCAAGCCTCTCACTCTCCCCCCGAACTTGACGCCTAAGTTCTTCCAAAAGTCTCCGAGGGAAGCGCTGCGGCGGGTCACAAGTCTATTAATACGGAAAGGTAAGGGCGTTAGTAATTTTAAAATGATATTCGATGATGTGCAGTATCCTATAGATTCTAGAACACAGTTAAAATATTAG

Protein sequence:

>DPOGS211197-PA
MLLYDRGCDDVQRSLELLDQVLSEYDEGEPRSLEARATPATPATPATPATPATPATDDDSPLAGHTSEDDGYMSMNGRRAKFALGFRPTEEREEGSPPDPPSPHSPPPPEEAQRIISTLLPKVSPTNSAKHTNLFSHSEAEEKLDSIISVNGITTATQTTFPKTRHQRPYGWDKEIEANGFQLQQAPVPPYKFSSLQHGSRPPQTEIAPTWLPPRHSAPPSNQKKSPSVENPPVFPFMGFSSDFTDKPYNEHPVTIYPGPNIQYSRQSHSPTYKISPSNSKSIEKKNSMKRDSRSDGEILSPKCKIPPRTLASSMERHREVCRDSTEDIMESCKMRGANRSNDEDHFSDDSLEESFPPPPPAVSTPSKRNSIAWEVSLDGDDPLLTPGSTKVIGRRRKKSGDQSHSSTSSIPQRIDDDWGEEYWPPPPPLSQCEPIVSPISDAEPELRRPQDLSTGTYVIRKGKNRKQLPTFNKNTSNKPQSNTNNLSQSHSLNRSIEKSIDGSSISISGSGSVGSYNSRPSSDLSLPQSRYSADLNSRLSRELNTPNSRYSIDLCTPNSRLSKEITSPKSRLSLDLNNGQDRYINPEYFGHSPKSRQVNSDKNVVGSKTNSGQSKLSPQNRFTDFKKYSSTFDNIQSLIKEGKVEEAPQNDCNETVTELSVVPPTMVRVISLPSLGAEADSNSASRQALITTVEEEDDQESGEIDSNEDTSPLRKIENNISAILNQGRDQKHFTPYTPKDWNIHKDEYCDEISKDILENNYRYSSRERQKRHDIQKSSSHNEIQNQRSIDRRDRSSGRRSNSMHKSTSAKDVPVSLMRQPSSSDSAVSSGGDFPLNIQIVEHPYRHNQLPPSPSTGHEMGPLPQTPESPKFPPLPPSPVQEVEDEYTEIMQPTGRRHIKKADTLPTPTMESRRRPSEPPAVPPHRDTTNSLKTRSMENNFNKNRRNSNFKSGSTDRRTLPTDTGTSGARRRTLQRQNRESGYNVRGQLQTSASLPETPVFARGCDVPRTPPRNTGPPRHNTINSMQTIGSSATLGGYGRGSVMGASGVCTGADLLRLGGPPRGWYPRQRNRPASIEHLDRISTSAKVAADHPVAWEASGARKPLTLPPNLTPKFFQKSPREALRRVTSLLIRKGKGVSNFKMIFDDVQYPIDSRTQLKY-