Monarch geneset OGS2.0

DPOGS215246
TranscriptDPOGS215246-TA5322 bp
ProteinDPOGS215246-PA1773 aa
Genomic positionDPSCF300047 - 454623-476147
RNAseq coverage1667x (Rank: top 8%)
Annotation
HeliconiusHMEL0139880.062.38% 
BombyxBGIBMGA008811-TA0.050.79% 
Drosophilacp309-PF5e-4333.08% 
EBI UniRef50UniRef50_G6DR860.099.77%Putative uncharacterized protein n=11 Tax=Eukaryota RepID=G6DR86_DANPL
NCBI RefSeqXP_972088.12e-4528.85%PREDICTED: similar to cp309 CG33957-PB [Tribolium castaneum]
NCBI nr blastpgi|3072112582e-4727.61%A-kinase anchor protein 9 [Harpegnathos saltator]
NCBI nr blastxgi|1571132982e-11123.91%hypothetical protein AaeL_AAEL006395 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL25539 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215246-TA
ATGGTGATGAGGCATGAAGGAGCGCAACCTCTCCCACGTACGGCAGACACGATGATGATGATGGTGCAGGTGTTGTTGTCCGACCCTGACGCGGAGCTGTCCAGCTGGCCGCTGGAGCTGGTGGCGCTCAGGGACAGGATCCATCACGACAGGGAGTTATCGAGTGTCCGGGACGAGCTGTCCAGCGGCGAGGGAGACAAATGGAGACAGAGGAGGAACAACAGCTTTGACCAGAACCGTCAGCTGGAGGAGGTCACCAGGGAGAGGGATGGTCTCCGTCGTGTGGCGGGAGTGCTGCAGCGTGCTGTTTCGTCACTGGTCGCCTACTGCGCCAGCGCCGAGGACGAACTGAACAGGACCGTACTGAACAAACTGTTGGGACACCTGGCCGCCGACGACGACACTATTGTGGAGCTGGACTCCCGGCCGTCCACTCCGGTCGCTGAGCTGAGTGTGGTAGGAGAGACGCACGTCCACCTGGCGCCAGACCTGCACTCCATATTGGTGTCGCTGGACGAGGCCGGCGTCAGGGGCTTCCTGCAGCAGCAGAGGGACCTCGGAGACGACATCAAGAGAGAGCTGGACGCCTCGCTCAAGAGGCTTAGGCACGAGGCAAAGGACCTACTGCAGCTGTCCGCGCGCCTCGCCAGCAACAAGGCTCGACAGGAGACGAGGATGTTAGAGAACAGCACTCGGGAGGCAAAGGAGGAAGATGAACGAGAATGTATCAACTGTATGATGCAGAGAAAAAACGTGGAGGAGGCCATGTCCGAATGTCTCCAACGTGAGAATCTGCTGCGGAGCGACCTCGACGGCGCCATGATGAAGATTGCTCACCTCATGGGACCTCCGGACAGGACGCACTCCGACGACGTCGTGGAGGGGTACGGCACGGGCGGGCGCTCCCTGGACGGCCTGACACCGCGGGCCCAGCTCGCTGCGGACCTCGAACACGCGCTAAGGGAGAGGGACGACCTGAAGCAACAGCTGGAGGCGGCCAACCGTCAGCTTCGTTCCACTCGCCAGTTCGTGGAGGAGCAGGCGGCCGAGCGGGAGGCGGAGAGGGACGAGTTCGCCGAGAGACTCGCCGAGCTCAGGGACGAGAACAGCAGGCTGGCCGCTCGCCTGCAGACCAACGCCAGGATAGTGAACGAGGTCGAACAACTCGAAAGTCAGACTCGGGAGATGAATCAGATAATAGCGGACCTGGAGGGGCGGAAGGGCGCCGCGGACGAGGAACTCAAAGCTGCAGAGGAACAAGTGTCCCTGCTGAGAGACATCATAAACAACCTGGAGACTCAGCTGGAGGAGAAGACTGCCCGAGAGGAGGACGTGCTGCGGGAGCTGGGGGACATGAGGAGCACCATAGACGACAGGGACCGGAGGATGAGGGAGCTGCTGGCGGAGCTCGAGCGTTCAAGGGGTCGCGACGCGGAGAGAGAGGCGGCCACCCAGCAGGGAGAGGAGGGGGGCGGTGACGAGCTGCTGGGGACACTCAGAGATGATGCCAAGCTTCTCGAGGAGCAGATACACAGCAGCGTCGTGCGGCTGGAGGGAGCCTACGAGCGAGCCTCCGGATCACTGTCCGAGCGCACCGAAGACGTATCAGTAACGGGGGCGAGGGCGGGCGGGGAGCGGGGTGGAGGAGGGGCGCGGGGACCTGCTCTAGAGGAACTCAGCGGGGTGTGGGACCAGCTGAAGGCGCTAGAGCGGGCCGGGGACGCCGCCCTCAAGAGGATAGATGACCTTCACATGCAGCGGCAGAGACTCAAGGACGTGGCGCAGGAGGTCCGCGCCGAGCGTGACGTGCTCCAGGCACGTATGTCAGAGCAGGCGCTGCGCATATCATCACTGAGTGCGCGTCTCGCGCAACAGCGATGTGACGCTGATGCGCTGGCTCATGACGCCGCGGCACAGCTCTCCGTCAGACTACACGACGCACAGGCTGAGGTTCAAAGATTGACGGAGGAGTTGAGTACCAAAGATAAACAGCTGGCGAGATTGAAGCAGAGCCACGAGGAGAGAGACAAGTATGGAGAAACACAACCGGTCTTCGGGAATTCATGTAACCCTAAAGACAGGGCGGCAATGTTAGAACGGGAATTTTCAAACGCTAAATCAAAGATAGAACAACTTGAAAGTTTAGTACGAAGTCTCGAAACTGATAAGGAAAAACTTCAATCAACTGTCAGAGAACAACAGAGGACACTCGCCGAGAAGGAGGAACAGTTAAATGAGATTATGGCTCTTAATCTGGTGGAGGACCAAAGCGAAGTCATTCAGGCCAAGACGTCCGCCCGGACACTCAGCGACATCGTCTCCATATCAGAGTTCGACGAACAGGATTTAATAATGCGTCGGGCGGAGCTGAAGGGACAGAACGCTAGCATTACAAACACAGACCACAAAGACAAAACCAATTTAAACAAGACCCTACCCCCCGATGTACAAAAACCTAATATGAGTTCATTGAACTTGGACTACGCAGATCATTTCGATGGCACAAACTTTACACCCCGTGCTGACTCACTACCCATACATCTAACATCTACTCAAAATAAGGAGGTTTATAAAAGGAATGCAAATGAAACGACAATATTTGGTGAAGGGATCAAACATTCTACTGCTCAAAATATACCCGATAATTGCTCAATGTATCCCAACAGAGACGTCTCGGAAAGTAAAAACCTAAGTGTTGAACCGAAAAAGATAAACTTCTCAATGGAACCATCTGACAACAGAACTCACGACGAGGAGTTCGCTTCCTTGAAAGATCTGGGAATAACATTAGACGTGAAACAAGAAAACTTCCCAGACATACTGACACACCTCAAACGTGAAATAAGGAAATCTAGAAGTGAGGTAGAACTGTATAAGGCAGAACTCAAGAATGCTGAGGAACAGCTGTGTGAGTTTCCAGCTTTAAAAGAGGAAGTGGAGGAACTGAAAGGGCTCTTAGAGAACACTATGGCTACCATGGAGAACGACAAGAAGTTTTACGAAAATCAATTGGATACCTTTGCTTCTAACAAGAAACTGCTCGAACAAAGACTTAAAGAATTAACACAAGAAGTTCATGAGAAGTCGAAAGACCTGAATCTGCTCAAGGAAGATATTCTACGAAGAGAAACCATGATATTAGAATTGGCTAAGGAGAAAATGAACTTAGCAAATAAGATTTCTGATTTAGAAATCAAAATAGACGACTTAAATAGCAAGAACATGGCTCTAGTAAAATACGAAGCAGAAAATAAGCAGCTCAAAGAAAAGCTAGCTGAGCTACAGAGACTGGAACAGTTGTTATCAGAAAAAAATCAACAGATTGATAGTTTAAATCAACATCTAGATAGATTAGACGACCTACAGAGATGTTTGAATGACAAAACTGAAGAGATGGAGGGACTGAAACAGGCTTTGAATATGAAAACAAAGGAACTGTTCCAGGTGAGAGACTCGGTCAACACATTAAACACTGACATCGCTAAAGTTATAGAAGAAAACGATCAGCTCAGCCAACAAAATAAAGAACTGAAGTTAAAACTGACCAAACTAGAGAAAGAGCAAGAAAACGCCGCGATCAGGCTTCAGAATAATGAGACGGAACTCAACAGGGTCAACTCACAAAACAGCGACCTCGCGGCGAGAATAGAGGAGCTGAAGGTATTACACGACACACTGGCGGATAAAGAGACGGAGATAGAGATATTACACGAAGACATCAACCTGTACCATAACGAGATAGCGGCGTTGAGAGAACAGCTGAAGATGGTGTCGCGGAGCCCGTCGCCCAGGAACAAAAACACTGAAGCCGGCAATACTGTGGACAGGCAGGCCACTAATGATAGGAAGCAGTTGGTGAAAATAAAAAAACAAATATCTCTGTTGCAACACGAGCTTGACTTCCAGAAAAAGGAACTCAATGATAAGGCATTCGAATTAGCTAAGGCTAAACTGGACTTGACGGAAGCTCGGAATAATATAAGTCAGATGAGTAAGCAAATATCTGACAGCGAGCAGCTACAGATGGCCTGGAGCGAGCAGCAGCAGCAGCTGGAAAAACTAGCTCAAGAAAAGGAACAGCTGGAGCGACAGCTGGAGACCGTTCTAGCCAGACTGCGGGAGGAGACCGACGTAGGCGAGTTGAGGCACAAGCTGTGCGCTGAAAGTGAACGATGTGGTCAACTGGAGCGGGAACTGAGGCGACTTCGAGAGACGACTGACAGACCCCGTCCGTCGCCAGGCCGTGCGCGTTCACCCACGGCGGAGCTGGAGCGGGCGGTACGCGACCAGCTAGACTACTCACACGCCTTGGACGACGATATCATGGACCAGATTTTGTCTGCTAGCAGCGACGAACGAGAGGATATTCCGAGACTCGTTCTCAATTCATCCAATCAGTCTACGAGCTCGATGAAATCGACATCGTCGGAGCGCATGCAGCGGCTGCGGAACGACAACGAGAAGCTGCAGCTGAAGTTGGAACACCTGGAGTGTAGATTAAAGGATAAGGACGCGCTCATTTCTGAACTCAACAGAGTACGAGACAAGCTGTCCAGTGAGTGTCAGTCGGGGCGGCTGAGGCTGGAGGCGGAGCGCGACCGCTCGGCGCGTCTGGCCCTGCTGCTGGACTCGCACAAGGACACGGCCAGCTCGCTACACGACCAGGACTCCAGCATGATCGACATGCTCAAGAGACGACTCGAGACCGCCATGAAGTCCGAGTTGGAGGCGCAGCAGCGGGAGAGGGACCTGCTACAACGGCTGCAACAACTGGACCGAACGCTGCCGGCCGCGACCTCCCCCACGGACCAACTCAAACAGGAGTCGGAGGACCGGGCTCGACTGCAGAGCTCGCTGAGCGCGGCACGGCTCCAGCTGGAGGCCGAGCGGCGGCGTGCTGGAGAACTGCACGCGATGCTCGACCACGAGAGGAACAAACACAGAAGAGACCTGGACGAAGCGGCCGCCACGGCAGCCGCCCTCAGGGGGGAACTGGCAGAACTCAGGAGATTGAAACACGAATCCGGCGTGGAGCTGGTGAGGACCAAGGAGCTGCTGCACACGCAGAGCGACACCATCGCACAGCTCGAGAAGAAGTATTCCGCCAGGAGTAAACATAATGATACTATCACGGGCATCGAAAACGAAAAAACAGATCTGGCTCGTGAAATGGCAACCCTCCGACGGTGTCTGACGGACGCACCCCAGGCACAGCTGCAGCAGCTGCTGGAAGAGAGACAACAACTGAGAGACACCATCAGGGATCTAAGAGGACAGCTGGAGGCGAGGAACGACGACAAGGAACAGGCGGTTAGTATGTAG

Protein sequence:

>DPOGS215246-PA
MVMRHEGAQPLPRTADTMMMMVQVLLSDPDAELSSWPLELVALRDRIHHDRELSSVRDELSSGEGDKWRQRRNNSFDQNRQLEEVTRERDGLRRVAGVLQRAVSSLVAYCASAEDELNRTVLNKLLGHLAADDDTIVELDSRPSTPVAELSVVGETHVHLAPDLHSILVSLDEAGVRGFLQQQRDLGDDIKRELDASLKRLRHEAKDLLQLSARLASNKARQETRMLENSTREAKEEDERECINCMMQRKNVEEAMSECLQRENLLRSDLDGAMMKIAHLMGPPDRTHSDDVVEGYGTGGRSLDGLTPRAQLAADLEHALRERDDLKQQLEAANRQLRSTRQFVEEQAAEREAERDEFAERLAELRDENSRLAARLQTNARIVNEVEQLESQTREMNQIIADLEGRKGAADEELKAAEEQVSLLRDIINNLETQLEEKTAREEDVLRELGDMRSTIDDRDRRMRELLAELERSRGRDAEREAATQQGEEGGGDELLGTLRDDAKLLEEQIHSSVVRLEGAYERASGSLSERTEDVSVTGARAGGERGGGGARGPALEELSGVWDQLKALERAGDAALKRIDDLHMQRQRLKDVAQEVRAERDVLQARMSEQALRISSLSARLAQQRCDADALAHDAAAQLSVRLHDAQAEVQRLTEELSTKDKQLARLKQSHEERDKYGETQPVFGNSCNPKDRAAMLEREFSNAKSKIEQLESLVRSLETDKEKLQSTVREQQRTLAEKEEQLNEIMALNLVEDQSEVIQAKTSARTLSDIVSISEFDEQDLIMRRAELKGQNASITNTDHKDKTNLNKTLPPDVQKPNMSSLNLDYADHFDGTNFTPRADSLPIHLTSTQNKEVYKRNANETTIFGEGIKHSTAQNIPDNCSMYPNRDVSESKNLSVEPKKINFSMEPSDNRTHDEEFASLKDLGITLDVKQENFPDILTHLKREIRKSRSEVELYKAELKNAEEQLCEFPALKEEVEELKGLLENTMATMENDKKFYENQLDTFASNKKLLEQRLKELTQEVHEKSKDLNLLKEDILRRETMILELAKEKMNLANKISDLEIKIDDLNSKNMALVKYEAENKQLKEKLAELQRLEQLLSEKNQQIDSLNQHLDRLDDLQRCLNDKTEEMEGLKQALNMKTKELFQVRDSVNTLNTDIAKVIEENDQLSQQNKELKLKLTKLEKEQENAAIRLQNNETELNRVNSQNSDLAARIEELKVLHDTLADKETEIEILHEDINLYHNEIAALREQLKMVSRSPSPRNKNTEAGNTVDRQATNDRKQLVKIKKQISLLQHELDFQKKELNDKAFELAKAKLDLTEARNNISQMSKQISDSEQLQMAWSEQQQQLEKLAQEKEQLERQLETVLARLREETDVGELRHKLCAESERCGQLERELRRLRETTDRPRPSPGRARSPTAELERAVRDQLDYSHALDDDIMDQILSASSDEREDIPRLVLNSSNQSTSSMKSTSSERMQRLRNDNEKLQLKLEHLECRLKDKDALISELNRVRDKLSSECQSGRLRLEAERDRSARLALLLDSHKDTASSLHDQDSSMIDMLKRRLETAMKSELEAQQRERDLLQRLQQLDRTLPAATSPTDQLKQESEDRARLQSSLSAARLQLEAERRRAGELHAMLDHERNKHRRDLDEAAATAAALRGELAELRRLKHESGVELVRTKELLHTQSDTIAQLEKKYSARSKHNDTITGIENEKTDLAREMATLRRCLTDAPQAQLQQLLEERQQLRDTIRDLRGQLEARNDDKEQAVSM-