Monarch geneset OGS2.0

DPOGS216076
TranscriptDPOGS216076-TA1947 bp
ProteinDPOGS216076-PA648 aa
Genomic positionDPSCF300067 + 490648-494331
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0089411e-15948.75% 
BombyxBGIBMGA008873-TA2e-4531.63% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_001865230.15e-0628.57%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastp%
NCBI nr blastxgi|2700085031e-0820.39%hypothetical protein TcasGA2_TC015019 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL35088 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216076-TA
ATGGATAAAGAAACTTATGAAACATATGAATACCTCGAAGCTGATAGCGGCGACAATGATACAAGCGAGTTTTTTGTTGTCCAAGGTGATGGAACATTTCTTAGTGTAGATAGGCCAATTCAATATGTAAGACAAGTTGAAAGTGCTAAGGATAATTTGGATGAGAGCCAGTCCTGCGCTGTCTATGATGTCACACCCCAACAGTTTTATGTTGATGTTGATGAATCCTCCGAACTTATATCAGTTTCAGACCAATTTATTCTGCCTGGTGGAGACAATAATCCGTACTCAAATAGTTTTTTATTACAATCAGCTTCAAATAGCAAGGAGGATCATGTTGAACAGATGGATGAAGACACAATAGAGGACACAAAAGAGGATGTAGCAAGTACCACACTATCTGAAGGAGGTAGCAATAAAGCTAATGGTGATTGTACTGAGATAACATTAAGCGATGAACAATATCAATTATTGAAGAGAAAAGGATGGATATTATTGGAAATAAATGACAAAGTATTCTTACTAGATACATTAGGATTACACGACATAACAGCAGACGAAAAATTAATCCAGAAGCTAAAGAATGAAATTGAAGATGATAACAGTGAAGATATCAACCAGAATCCTGTTAAAATTGAAGAAACTAGTTATATTGAACCTAAGTATGAAATGACGGAAGGCGATGAAAATAATGAAGAAGAGACACTTCACTTTATTGTTGAAGGCGACCAAATTATAGCTGATGGTAATTCACAGGAGGTCTCGGAATACTCTGAGAATGATCACGAAATATTACAAGTTGAAACAATAGAGGAGGAACAAGATGAGGCGATCAGAGTTGAACATGACTATGTGCAACTGAATTCCGTGAAAGATACAAAATGCAAACGGGAGAATGATGTTTTGAGATTAAAAACCAAATTCTCATTCAGAGACATTCCATCAGAAATAGTTTTGGGTAAAACAACAACGGGTAAAAAATTGGTGGCCAGAGTGGTCAAGACTGAGGCACCGGTGGATCCCAAACAAGCAAATCTAAAACTAGCCCAACAAAATGGACAACAGCAATGTACAACAACTTTAGAAGAATTCAAATTCGAAAATTTAATTCAACAAGCTCTAAGAGGCTATGACACTTGTAGTGTGAAGGACATATCAGCAGCCGAGACAGTTGTTGAACAACTCCTACGGGTACCCGAATTCAAACCGGCTATTATTGAACGGCGATTGATGATTACTAAGGTCCACCAATATGTACAAGACACGTCCGGTAACGTGTTCAAGGAGGGTAGGGCGACTTTAGTGACGGGGCGAGCTGTACTCGAGGGTACCCGCGGGTGGCGGTTCATGTTCCTGCCTACTATGCTGCCGCGGCTGCTGGGAGAGGAAGACGATGATGACGATGAATTACAAGAAAGATCTGAGGAGTTGGATGATGATATTTTCTTGCACATCCATATAAGAGAGACAAAAGATTCTGATGGCATAACCAGGATTTCAATAACTCTTAATAAGAGACATATTCCTATAAAAACTATGACCGAGATGAAATCTCGATATCCCAAAACTGTTTTTGCGTGTAGTGCCTGTGCTGCTGTATATAAAACAGAGGAGGGGCTTAGGCTTCATCAAGAGACTGAATGTATGGAGACTGAAACACCTCTAACTATAGATGCCGATGACACAAGTGATCTATACAGTGTCATCGGTACTGTTAAAGAGAAGCAATATATTTGTAACCAGTGTCATTTGGTATTTAACAAGTTGAATAACTGCCAGCGGCATGTTAAGCTTCATTACAAACAAGATCCGGAGAACTCCAGACAGGAAGTACCAAAGAAGACTGAAGGTGCCTATAAATGCAAGATGTGCCCAAGCACTTATCATCACGCCGCCACACTATCAAAACATATTGTATCTAAGCATATTAAAATAAGGTCTACTTGA

Protein sequence:

>DPOGS216076-PA
MDKETYETYEYLEADSGDNDTSEFFVVQGDGTFLSVDRPIQYVRQVESAKDNLDESQSCAVYDVTPQQFYVDVDESSELISVSDQFILPGGDNNPYSNSFLLQSASNSKEDHVEQMDEDTIEDTKEDVASTTLSEGGSNKANGDCTEITLSDEQYQLLKRKGWILLEINDKVFLLDTLGLHDITADEKLIQKLKNEIEDDNSEDINQNPVKIEETSYIEPKYEMTEGDENNEEETLHFIVEGDQIIADGNSQEVSEYSENDHEILQVETIEEEQDEAIRVEHDYVQLNSVKDTKCKRENDVLRLKTKFSFRDIPSEIVLGKTTTGKKLVARVVKTEAPVDPKQANLKLAQQNGQQQCTTTLEEFKFENLIQQALRGYDTCSVKDISAAETVVEQLLRVPEFKPAIIERRLMITKVHQYVQDTSGNVFKEGRATLVTGRAVLEGTRGWRFMFLPTMLPRLLGEEDDDDDELQERSEELDDDIFLHIHIRETKDSDGITRISITLNKRHIPIKTMTEMKSRYPKTVFACSACAAVYKTEEGLRLHQETECMETETPLTIDADDTSDLYSVIGTVKEKQYICNQCHLVFNKLNNCQRHVKLHYKQDPENSRQEVPKKTEGAYKCKMCPSTYHHAATLSKHIVSKHIKIRST-