Monarch geneset OGS2.0

DPOGS210438
TranscriptDPOGS210438-TA2148 bp
ProteinDPOGS210438-PA715 aa
Genomic positionDPSCF300062 - 233865-238249
RNAseq coverage422x (Rank: top 29%)
Annotation
HeliconiusHMEL0151120.071.85% 
BombyxBGIBMGA001957-TA0.065.06% 
Drosophilatamo-PA2e-4440.30% 
EBI UniRef50UniRef50_D6WTX93e-6133.19%Tamo n=1 Tax=Tribolium castaneum RepID=D6WTX9_TRICA
NCBI RefSeqXP_001809413.15e-6233.19%PREDICTED: similar to GA17922-PA [Tribolium castaneum]
NCBI nr blastpgi|3838657774e-6236.12%PREDICTED: uncharacterized protein LOC100877493 [Megachile rotundata]
NCBI nr blastxgi|3072022441e-7932.03%Protein tamozhennic [Harpegnathos saltator]
Group
Gene OntologyGO:00082700.0001zinc ion binding
GO:00056220.0001intracellular
KEGG pathway 
Orthology groupMCL17451 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210438-TA
ATGATCATGTCGAGCATGATGGGTGATGCTATGTTAAGAGAACGTCTGCCTGCCTTATGGCGGCGGATAGAGGACGCGCATTACAGCTATTTGGAGGTAGACGACAGCCCCGAGAAGTTACTGCAGAAGAAAAAGTTAGAAGGCTATATCCTTGAATACCTTACGCTGGTGCCTCATGAGTGTAAGTTTGGACTAGCAGAGGCTGCTAAGATCTTCCAACGTACTATAGACGAGTTGCCCGATTACAGCGCGTATCGTGCTAGTATAGGCTGGGCGGCCATCGCCAGATACGCCGGCAACCTGCTGGCACAACCATGGAGGAAGGAGTATAAGGAAATACGGCTGTACTCCGGTTTCTACAAGCACGAGGTCGAGTCTAACCTGGTGTGTGCCGAGAGCCTTCTTCATATCCTGGGCTATAGACCATCTGGCCCGGGAAAACTCGCGCTCGACGGGCCCGTGTGCCCGGACATGGCCGCTGCCATATCGAGGGACGCCATCATCGCTTACTGCGAGTGTCAGATAATGTCTGAGATCTGGGAGAGCGTGTGGTCTAACGGCGGCAGAGTGTCGTGGGCGGCGGTGTCTCGGGAGCGAATTGCACGAGCTTCATCACCTTCCTCCGCGGCAGCGAGACTAGCCGGGTGCGGAGAATCTGAAATCTACTCAAATTTACCAGCGACATCAGACCAAAGGAGGATAGAAGCTGATCACACGTCATATCAGATGCCCCCGCCGTGTTACTGTGCGGACCAGGTCGAAGAACCGGTGCAACCCATAATAAACCCCTACCTTATACCTAAAATGCCGCCTATGATGTACGAACTGCCGCCCATGCAGTGCAATCCAGTGCCCATTATGACCCCGTACGGCGTTCCGTATTATTACCCCGTGCAAACGCCGTACATGATACCCACGCCGGTTTACGCGCCAATAAAACACGCGACAAATATACCGGTCAATGGTTATCCCCCGATATATCTGGATATGCCAGCTGTGCCCACTGCGCAGCTGATTGAGCTGGACGGAGCTTCGGTTTACGAGAACGGTAAATTTGGAAGGCAGGATGATAGGGGCCATAAGAGGAATAGACACGCCGAGTCGAAACGTACATCGAAATCCGGGTTCAGTGACGTCTCCCTGCCCAGTCTGCCGCGATCTGACACTCAGCCGGCGTTGAGTAAGGCGAAGGAAGACGGCATGGGCACTTACGAGAGTTGGGACTATGTTTTTCGGAATCTATCTAGCAAGGAACGCCATGGAGATAGCAGAAGTGGGTTTTCTCAATCGCTGGACAGAGATTCAAGGACGCTCGATAGGTTGGACAGAGAAGAGAGGAGATCTAAATACCAACCTACCACCCTAGACTTAGAAGACGGGCTCCAAGCTTTAAACTTGGATAGATCGTATGACGAAGACGCGTATCGGACGGCTAAGGTGAACGAGAATTTAATGAGGCTGAAGCAGGAGCAGGAGTTAAAAAAATCAAAGCAGATGAAGAAGCAAACAGAAGATAAACGACCGAAAAAAACCCTAGAACCCGTTGGCAACCCTAAGGCTGATGGTCTCATCACACCGAAGGTTGCACCGGATAAGGTTAAACTGCTGACCAAAAAGGAGATTAAGGATAGGAAGGACGTCATCAAGCAGCAAAACTCAAACGTCGAGTCATTATCAAGCGTGGCCGATGTCAAGAGAGTCAAGAAAGTATCCAAGCTCATACCGTCTGATGCTGATAAAAAAATTAAACCTTTAGAAAATGGCCTCCATAAGGCACAGAGCTCTAAAACTAATACAGTCGCAAATCAATCAAATCACGATTTGAAAGCACAACTGGTCGTCTCGCTCGACGAACCCGATTACAAGCGAACACCGAAACAGAACGGCGAACGCGAACGAACACCGAACCGAACGGACGAGGGGACTGACACCAGGAGAGACAAATGGGAGTGTAATACATGCACATACCTCAATAAGAACGCGGCTGTAGCGTGCGAGATGTGCGGGAAGTCGAAAAAGGGACCGGAAATCGAACCACTGACTTCCGGTGGCAGGGAATGTCCAGCCTGTACTCTGGTCAACAAACGGGAAGCCAGAATATGCGACGCTTGCGGCACGAGTCTAGACCATTGTCCTACATACATATAA

Protein sequence:

>DPOGS210438-PA
MIMSSMMGDAMLRERLPALWRRIEDAHYSYLEVDDSPEKLLQKKKLEGYILEYLTLVPHECKFGLAEAAKIFQRTIDELPDYSAYRASIGWAAIARYAGNLLAQPWRKEYKEIRLYSGFYKHEVESNLVCAESLLHILGYRPSGPGKLALDGPVCPDMAAAISRDAIIAYCECQIMSEIWESVWSNGGRVSWAAVSRERIARASSPSSAAARLAGCGESEIYSNLPATSDQRRIEADHTSYQMPPPCYCADQVEEPVQPIINPYLIPKMPPMMYELPPMQCNPVPIMTPYGVPYYYPVQTPYMIPTPVYAPIKHATNIPVNGYPPIYLDMPAVPTAQLIELDGASVYENGKFGRQDDRGHKRNRHAESKRTSKSGFSDVSLPSLPRSDTQPALSKAKEDGMGTYESWDYVFRNLSSKERHGDSRSGFSQSLDRDSRTLDRLDREERRSKYQPTTLDLEDGLQALNLDRSYDEDAYRTAKVNENLMRLKQEQELKKSKQMKKQTEDKRPKKTLEPVGNPKADGLITPKVAPDKVKLLTKKEIKDRKDVIKQQNSNVESLSSVADVKRVKKVSKLIPSDADKKIKPLENGLHKAQSSKTNTVANQSNHDLKAQLVVSLDEPDYKRTPKQNGERERTPNRTDEGTDTRRDKWECNTCTYLNKNAAVACEMCGKSKKGPEIEPLTSGGRECPACTLVNKREARICDACGTSLDHCPTYI-