Monarch geneset OGS2.0

DPOGS214728
TranscriptDPOGS214728-TA2037 bp
ProteinDPOGS214728-PA678 aa
Genomic positionDPSCF300022 + 71419-81813
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0085909e-6253.23% 
BombyxBGIBMGA005135-TA3e-11060.21% 
DrosophilaCG42675-PE1e-4259.87% 
EBI UniRef50UniRef50_D7EL763e-4836.82%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EL76_TRICA
NCBI RefSeqXP_973174.17e-4936.32%PREDICTED: similar to CG10999 CG10999-PA [Tribolium castaneum]
NCBI nr blastpgi|2700154811e-4736.82%hypothetical protein TcasGA2_TC004275 [Tribolium castaneum]
NCBI nr blastxgi|3800114671e-6932.51%PREDICTED: uncharacterized protein LOC100868905 [Apis florea]
Group
KEGG pathway 
Orthology groupMCL16387 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214728-TA
ATGGAAGGCAAAATTTCACGTTCTCTAGACCACAGTAACTGGTGGGAGCCAGACCATTGGAAGATGCGTTCTGGCCAGATATCAACGTCCCTGGGAGAAGCTGATCCAGCTGAAATAAGGAATGGTTGGCTCACGTTACCTGACCCTAAGGCACGCTTCCAACAGAAGCAATTACAGGAGCGCGAACAGAAGATCGCGTCTCTATACGAATCACAGCAGGCCAGGGCTCTGGACCGAGTGAGACACTCGCCAAGCAACGGACTCACCAGCACCACGCCTACACTACCATCACCACAGACCCTGCCTCATCAGCCGGGGAAGGTCCGTCAGATGTTTGAAGAGAGGCGAACCAAGGCCGGGATAGACAAGAGCTACCCTCTCCAACCGATACAGAACACGGAGCGGACACAGCCGCGCAAGACCATGACCAACGGATTGAATAAACTGACTACCAAGACTGCTGACAGGAAAACGCTGAAAACGCTCAACACGGGCACACACACTGTCAAAAAACAGAACACTACTAAGGTCCATAAAGTTGGTGCGGCTGTGAACGGCTCCGGAGACGTCAACCACAACCACGAGCTGGACCTCAACACCAACACGTCTGTGCAGCCTCGCAGCAACGGTGAAGTACAAAAAGATGAGAGAGACGCCAACGACAACACGGAGCTCATAGAGAACGAAACCTTCCCTGAACTAGCGCTGGAGCAGTTGACGCCGCGGACGACGCCAGCGACGCCAGACACGAACGGGAACAATAATAATAAGAAGAGCAGTCCGGCAGCGACGAGGCCCAGGAGTCCGGTGAAGAGTAAGCCGGCAGCTGTCACAACACCCAGGAAACTGATCAGTGACAATAAGGCGTCCAGGGAGAACAGTAAGCCCCGGGGGGCGATGCAGCGGACGGTCAACTCGGTCCGTCAGATGTTTGAAGAGAGGCGAACCAAGGCCGGGATAGACAAGAGCTACCCTCTCCAACCGATACAGAACACGGAGCGGACACAGCCGCGCAAGACCATGACCAACGGCTTAAATAAACTGACAACCAAGACTGCTGACAGGAAAACGCTGAAAACGCTCACCACGAGCACACACACTGTCAAAAAACAGAACACTACGAAGGTCCATAAAGTCGGTGCGGCTGTGAACGGCTCCGGAGATGTCAACCACAACCACGAGCTGGACCTCAACACCAACACGTCTGTGCAGCCTCGCAGCAACGGTGAAGTACAAAAAGATGAGAGAGACGCCAACGACAACACGGAGCTCATAGAGAACGAAACCTTCCCTGAACTAGCGCTGGAGCAGTTGACGCCGCGGACGACGCCAGACACGAACGGGAACAATAATAATAAGAAGAGCAGTCCGGCAGTGACGAGGCCCAGGAGTCCAGTGAAGAGTAAGCCGGCAGCTGTCACAACACCCAGGAAACTGACCAGTGACAATAAGGCGTCCAGGGAGAACAGTAAGCCCCGGGGGGCGATGCAGCGGACGGTCAACTCGGTACTTAAAAAACCTCTCACTGCGAGTCCACAGACCTCCGCCTCGAAGGCGCGCTCGACCACCGAGAGGTCTAGTGAAGGGGCGTGCGGCGTCTGCGGGCGGCACTTCGCCCCCGACAGACTCGCCAGGCACCAGGACATCTGCCGCAAGACACACGCCAAGAAGAGGAAGCCCTTCGACGTGCTCAAACACAGACTAGCGGGCACGGAAGCCGAGCCGTTCATCAACCGACTCCGCAAGGGACCCGCCAACACCTCCTCCACCAAGCTGCAGAAGCCTCTGAACAGTACGTGGAGACAGAAACACGAGGAGTTCATCCAGGCGATACGAGCCGCCAAGCAAGTTCAGGCGCATCTCAACGCTGGAGGTAAGTTGAGCGACTTGCCGCCGCCTCCTCCGTCCGAGAACCCGGACTACGTCCAGTGTCCTCACTGCAAGAGACGCTTCAACCAAGCCGCCGCCGACAGACACATACCCAAGTGCGCCAACTTTCAATTCAACAAAAGTAAACAACCGCCCAAAAAACGATAA

Protein sequence:

>DPOGS214728-PA
MEGKISRSLDHSNWWEPDHWKMRSGQISTSLGEADPAEIRNGWLTLPDPKARFQQKQLQEREQKIASLYESQQARALDRVRHSPSNGLTSTTPTLPSPQTLPHQPGKVRQMFEERRTKAGIDKSYPLQPIQNTERTQPRKTMTNGLNKLTTKTADRKTLKTLNTGTHTVKKQNTTKVHKVGAAVNGSGDVNHNHELDLNTNTSVQPRSNGEVQKDERDANDNTELIENETFPELALEQLTPRTTPATPDTNGNNNNKKSSPAATRPRSPVKSKPAAVTTPRKLISDNKASRENSKPRGAMQRTVNSVRQMFEERRTKAGIDKSYPLQPIQNTERTQPRKTMTNGLNKLTTKTADRKTLKTLTTSTHTVKKQNTTKVHKVGAAVNGSGDVNHNHELDLNTNTSVQPRSNGEVQKDERDANDNTELIENETFPELALEQLTPRTTPDTNGNNNNKKSSPAVTRPRSPVKSKPAAVTTPRKLTSDNKASRENSKPRGAMQRTVNSVLKKPLTASPQTSASKARSTTERSSEGACGVCGRHFAPDRLARHQDICRKTHAKKRKPFDVLKHRLAGTEAEPFINRLRKGPANTSSTKLQKPLNSTWRQKHEEFIQAIRAAKQVQAHLNAGGKLSDLPPPPPSENPDYVQCPHCKRRFNQAAADRHIPKCANFQFNKSKQPPKKR-