Monarch geneset OGS2.0

DPOGS207892
TranscriptDPOGS207892-TA3333 bp
ProteinDPOGS207892-PA1110 aa
Genomic positionDPSCF300101 + 238562-243831
RNAseq coverage420x (Rank: top 29%)
Annotation
HeliconiusHMEL0102440.056.75% 
BombyxBGIBMGA008479-TA1e-17267.35% 
DrosophilaCG32685-PC1e-8143.45% 
EBI UniRef50UniRef50_D2A1C74e-9957.06%Putative uncharacterized protein GLEAN_07111 n=2 Tax=Tribolium castaneum RepID=D2A1C7_TRICA
NCBI RefSeqXP_975380.26e-10057.06%PREDICTED: similar to YLP motif containing 1 [Tribolium castaneum]
NCBI nr blastpgi|1892367631e-9857.06%PREDICTED: similar to YLP motif containing 1 [Tribolium castaneum]
NCBI nr blastxgi|3454915473e-10438.37%PREDICTED: hypothetical protein LOC100122795 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL12947 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207892-TA
ATGGCTTGGTCTATGCCTACGGGTCAATGGAATTCCGGCATTTCCATGACGCCAGATATTAATATAGCTAGTATGGGATCATACACACCAGAACAGTGGGCGCTAATGCAACAACAGAACTGGCAGCAGTGGGCACAATGGCAACAACAGTATGCTCAATGGCAAAGCCAGTATGGTGACAAGTATACTCAGCATATGCAAGCCCTTCATGCTATGAGCGGGATACCACCTCCAGCTCCTAATGCGGTTCCACCCCCAGCTCCACCTCCACCGGAAAAGCCTCCTCCACCACCACATGAAAACAACCAACCTCTGTACGGAAATACACCATCGCAAACACAGTCTGTGTCACACACGCCCCATCTTCCATACTCTAAAGTTGGTTATAATGTGGTTCCTAAAACAGGTAACAATTTTAATCAAACTTCCACAATAGATTCCATGTCCGATTTACCGACCTCTCAGGTTGTAAACACGGATGCGCTCATGAAGCTAGCTGAGGAGGAGCGTTTGTTTGATATACAATTTCAAAAATGGGAAGAGGAAATAAAAAAGTGGAAAAATGAGAATGTTAACCACCCCGATAAAAGAGCTTATATGGAGTATGAGCAAAAGTTTGCCAGCTGTCGTGCACAATTGCTGGAGCGGCGTCAACAGATGAAGCTGAAAAGAGATAGTCTAATGGGTGTTAAAGCCACACAGACAGCAAACACTACAATTAACAGCACAGGTAATATATCAACAAGTATTCCTCCACCTACACAAAATATTAACAAAACCAACTATAATACAAATGTGCAAAACTCTCAAACAAAAAATAATGTTACCCAGAATTACATAAACCAAAATCAATCTCAGTATGAGCCAATTGGTCATTTACACCAGACAAGTTTTAATAGGAGTAATAAAAAGCCAGAACATCAAGACAGGTATGAATATTATGGGGATATGAGCAATGATTACACTACTACAAGTGACACTTCTAATTTCTTACCCACAAATGATTCTTTTAACGGCATACCGGGATTAGACTTGGTACCAGATGGTGATAAATCTGTACAAAAACAATTAGATGTAATTGATATAACAGAAGACAGACAGAATCAGCAACGGCAACAAAATATTCAAGCACCTGACTATTCAAAAATATCTAAAGGGATTAACAACATTCTTGGGGATGAAAAAATTATGAACATCCTTTCTATGATGAGCAGTCAGAATACTAGGAACGAAAGCAAAGTAGGTTCAGTTGGTTCTCACAGGCAAGAGCTGAATACTCAGTCTGGAAGCTTGCAATATAGTGGAAATAATAGCAGCTACCACGGAAATAATTACAATAATATGCAACCTCGGAACACGTCCTATCAACAGTCAAGTGAGAATTATACTAATCAAGCTTCAAGTTCTAGTTATACCGATCCTGATTACCAGAGATATGGGGGATCCTCTAACGAACAAAATGAAAATGTAAGAAGTAATGATATGGACAAAAACTATGAATACAACAGACTTGGGAATCAAAATATACAACAAACTAGGTCTAACATGCCAAGAGTTATGCAAAATTTACATTCAAAGCAAAGTGATTATCCGCAAGGGGATTTCGTAAGGAGGATGGATCTAGATGTAAAACAGATTCAGCCTTTAAAACCTAAATGGGTCGATGAACCTCTGTTCACACCCTCAATAATAGTTGAATATGAGCACAAACCATTAAGATTGAAAGCTCGAGATTTTATTGAGCCCGTGCACATGTTTGAATACAATCATCAATCTAAAGATGGAGAAAGTTCGAATAAGAAAAACTTCGATAAAGAATTAGATGATTTATTTTCAAGGAAAAGAAGAGCAGACGATGACTGGAGTAGTTCAGACAAATTTTATTCCAGAGATTATGATCGGAGAGGTTTAAAGGATGATGCGAGAGATAGAAATCGGCTGCGAGATGATCGTGATATGTATGATAGAAGAATTGATGACAGAAGACGTGATGACCGTGATAGATTTAGAAGAGAAGAATATGATAGGAGAGATAGAATAGAACAAGAACGATCCAGAGATATGGGGAGGGGGCGTGATGAAAGAGACAGAGATAGAGATATGGCTAGAGACATGGGTCGAGAGAGAGAGTTGGGAAGAGATAGAGATTACACTAGAGATAGAGATAAAGATTTCAATAGAGATAGGGATACAAGAGATAGAAGTAAAGAATATAGAAAAGATGAAAGAGATATAAGAAATCGTAGTCGAAGTCGTGATAAAGAAAATCGTAAAAGAGGACATAGCAGAGAAACGGAATGTTTTGATAATTATGGATTGAAAAAGAATAGAGATATAAAAGATGAAACGGTTCCAACGAATAAGCCGAAACATGTGGTGATGATAGATGACCTTCTAGAGCCTCCGGGGCGCACCATGAGACCGGACAAGATGGTTATAATACTCAGAGGTCCACCGGGAAGTGGTAAATCTTATTTAGCTAAACTGATAAGAGATAAAGAAGCCGAGCACGGGGGCACAGCAAGAATAATGTCCATAGACGATTATTTCATGCAGGAAGGTGAAATTGAAGAAAAAGATCCCATTACGGGAAAAATTGTGAAGAAGCCGTCACTGAAATACGAATACGACGAGAGCTCCGAGGAATCATATATGACATCGCTAAAGCGGGCGTTCAAGAGGAGTATCACGGATGGCTACTTTACATTCTTAATATATGACGCCGTGAACGATCAGTTGAAGTCCTATGCTGATATTTGGAATTTCTCAAGGCAGAATGGCTTCCAGGTGTACATATGTACGATGGAAATGGATCCCCAAGCTTGCTTCAAGAGGAACATACACAATAGATCGTTGCAAGACATAGAAGCTATAGTTTCTAGTTTTTTCCCAACCCCAGCACATCACATACAGTTGGATCCGACGACCTTACTCCAGAGTGCGTCCATTCGGGAAGTACAAATGGAAGACGCCGATGACGTCACTATGGAGGAGGTGGAAAACCCTGAGGTCGATAATAGTTTTACGTCGAAATGGGAAAAAATGGAAGACGCCGCCCAACTAGCTCGTCTCGACGGCACTAGTCGGCCGCTGCGCTCGTCCCAGCTCTCCATGGAAGACTACCTACAGTTAGACGACTGGAAACCGAATACGGCTAAACCGGGAAAGAAAACTGTACGTTGGGCTGATATTGAAGAGAGAAGACAGCAAGAGAAAATGCGAGCCATCGGTTTTGTCGTAGGTCAAACTGATTGGAATAGAATGACTGACCCCACTATGGGGTCTAGTGCGCTCACGCAAACTAAATATATCGAGCGAGTCAGGCGGCATTGA

Protein sequence:

>DPOGS207892-PA
MAWSMPTGQWNSGISMTPDINIASMGSYTPEQWALMQQQNWQQWAQWQQQYAQWQSQYGDKYTQHMQALHAMSGIPPPAPNAVPPPAPPPPEKPPPPPHENNQPLYGNTPSQTQSVSHTPHLPYSKVGYNVVPKTGNNFNQTSTIDSMSDLPTSQVVNTDALMKLAEEERLFDIQFQKWEEEIKKWKNENVNHPDKRAYMEYEQKFASCRAQLLERRQQMKLKRDSLMGVKATQTANTTINSTGNISTSIPPPTQNINKTNYNTNVQNSQTKNNVTQNYINQNQSQYEPIGHLHQTSFNRSNKKPEHQDRYEYYGDMSNDYTTTSDTSNFLPTNDSFNGIPGLDLVPDGDKSVQKQLDVIDITEDRQNQQRQQNIQAPDYSKISKGINNILGDEKIMNILSMMSSQNTRNESKVGSVGSHRQELNTQSGSLQYSGNNSSYHGNNYNNMQPRNTSYQQSSENYTNQASSSSYTDPDYQRYGGSSNEQNENVRSNDMDKNYEYNRLGNQNIQQTRSNMPRVMQNLHSKQSDYPQGDFVRRMDLDVKQIQPLKPKWVDEPLFTPSIIVEYEHKPLRLKARDFIEPVHMFEYNHQSKDGESSNKKNFDKELDDLFSRKRRADDDWSSSDKFYSRDYDRRGLKDDARDRNRLRDDRDMYDRRIDDRRRDDRDRFRREEYDRRDRIEQERSRDMGRGRDERDRDRDMARDMGRERELGRDRDYTRDRDKDFNRDRDTRDRSKEYRKDERDIRNRSRSRDKENRKRGHSRETECFDNYGLKKNRDIKDETVPTNKPKHVVMIDDLLEPPGRTMRPDKMVIILRGPPGSGKSYLAKLIRDKEAEHGGTARIMSIDDYFMQEGEIEEKDPITGKIVKKPSLKYEYDESSEESYMTSLKRAFKRSITDGYFTFLIYDAVNDQLKSYADIWNFSRQNGFQVYICTMEMDPQACFKRNIHNRSLQDIEAIVSSFFPTPAHHIQLDPTTLLQSASIREVQMEDADDVTMEEVENPEVDNSFTSKWEKMEDAAQLARLDGTSRPLRSSQLSMEDYLQLDDWKPNTAKPGKKTVRWADIEERRQQEKMRAIGFVVGQTDWNRMTDPTMGSSALTQTKYIERVRRH-