Monarch geneset OGS2.0

DPOGS210121
TranscriptDPOGS210121-TA4386 bp
ProteinDPOGS210121-PA1461 aa
Genomic positionDPSCF300017 + 1483161-1493733
RNAseq coverage211x (Rank: top 46%)
Annotation
HeliconiusHMEL0107030.066.14% 
BombyxBGIBMGA000228-TA0.057.53% 
DrosophilaCG32397-PA1e-3544.75% 
EBI UniRef50UniRef50_B0WMN71e-4553.53%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WMN7_CULQU
NCBI RefSeqXP_001662638.14e-4833.87%hypothetical protein AaeL_AAEL012536 [Aedes aegypti]
NCBI nr blastpgi|1571328008e-4733.87%hypothetical protein AaeL_AAEL012536 [Aedes aegypti]
NCBI nr blastxgi|1571328002e-6525.71%hypothetical protein AaeL_AAEL012536 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL26713 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210121-TA
ATGGACATCCGTGTGGTATTTGTTTTGCACGGCGTCATCGACCACTTAACCCAGCACTCGGTGGACTCGGTCGCGTCGTATTCTAGTCAGAATCACGTTGACCGTATAAGGACTCAAAACTTCTCAGCGGCTTCCTGTCAAGACGAATGTGATTTCGGGTCCATTAGGACGAGCCTTTTCAGAAGATCTTCGTCTAATACAGAAATTCCAATAAAACGACCAGAAAGCGTCACATCCAATACAAGTTCTACAAGAAGCCGTCTTAGAGATTTTGTTGAAAGGAAATCGAGTCAAGACGAACATAGACCCGTGGCAAACACAGTTGCTGACATTCAATACTTTGAAAATCCCACAGAAACATTAGAATTTGACAATCCACGAAATTCACTAGAATCCAAAAAGAGTCAAGACGAAAATACTAAAAAGCGGAAATTTTCTAATAAAGATAAAGACTGTGAAAATAAAATCAAAAAGGAGGACAAGTATCAGAGTAAACTCGCTGAATATTACAAGCTTCCGTTACAGCTACCTCAAGACGAATTTTATCAGCATCTCACACGATCTAAGGCTGCTGAGGAATTTTTACAGAAACGCTTTCCAAATTCCGACACTGAGTTCAGTGCTAGTTACGGAAGATTGTGCAAACATAAAGACGTCGAAGGTTCAAATTTACGTCGTAGCAGGAGTCTAGCTGTAATAAGAGAGGAAACATTTACAGATCTTCAGATTCAAAATCATCCCAAAACTAAGCGGTCTCAACTTATACCGCGTGCCCGTTTATTTGACAAACCTTGTTTTAGAGACAGATTAATTGGTCGTGCAAAATACCAAACCAAGGAAGAAGTTTTGGAAGGCATTTATATTGACAGTACTGTATCTTGTTTTGGAGATATAAATAGACAAAGAGTAGACGAAGAACCCTCAAATCCTCCTGATAACGTAGAAAATAATTCTGCAAAGGAGGACGTTGTATCAAGAAACGAGAGCCATCATTCCCGGTCTATTAGCGGAAGTTGGCCTAATTTACATGGTGAAACGAAGCAAACCAACGGCTCTGAAGAAAGTATCGGCCATTCACGTAATCCAAGTGAAATCGATAGCTTAGATTCTAACTACGTTAGAAAACATTATAACTTTGAAGCCCATTTAAAAAACTATAAAGGAAGTTCTCCAGAGACAGATAGATCTATAGCATCGCCTAATAATAGTAAAGATCTATACATTAGTACTGACGAGCCAGACAGTGATAACGAAGAAGACAAGTCTAAAACACAACGCTCTTCGGTTAAAGAACATAATATATTAACTCAAAAAGTTGTTAATGAAAAAATTGTCTTAAATTCTGATGAGATTTATAAAGCAGAATCACCCGATAAGGTATCTTTACAAAGTAAGCAGAGTTGTAACATAGAGAAAGAGAATAAAGCAACAGAAGAAAACAAACAAGAGAAAAAAGATTCAGACAGTCAGTCGACAATTTCTGAAAACGACGAAAACATTTCTAGTCCAGCTGATAGCATTACATCCTATATATCAATATCTATTGCATCTTCAACGGACAAATCACACAAATTAGAATACTTAGCAAATTCACTTGCTGAGAAAATTGACGAATACTGTGATTCACAAAACGCGTCTGTGAATTCACCTACAATCCATTCTGAAACTCAAACGCTTGACACAGATCCTATCTATACCAAAGTTCAAAAACAAACATTTGCTGACTTAAAAAAACATTCTTTTACCAGGAAACCTCGTGAATCCAAAAAAGTAAAAATAAAAACACCCACAAATGAAAGTTATGTAACAAACACATTTATTAAATGCAAAGACTACAATGAAGATAGCTATTGCACATCAACAAAAAATATAACCACTGAACCTTTTGTTACAACTTTTATAGAAAATCAATACTACTCGTTACCGGATATAAATATAAGTAAATGTTTGAGAAAATCAGAGAGAATAGACGCTCAATTAAGAGAAGAAGATCCAGAAGACATACCTTGTGAAAACACATACGAAGTTGCTCATACACACGTCAGAGAAATTTCGCATCATTCAAATGGTGAAAGCTATGGTCAACTTAATAAAATAAGCCCAAAAATCTCACACAAAAGTCAAGTAGACATTGAAGATAACTCTTCCTATTATAAAGAAGAATCGGAACCTCAACTACTTCAAAACTTTACAAAACTGGAAGACGATTTAAAATCCATAATAACTATTGACAGTTCTTATGCAGACGATACTATTTATTACCAATTAGATCACCATCAAAACGAACGAGAAGTAAATGAAATTGAAGGCACAATACGAAATAATAAAATAATTACAAAATCAGAGAGCTTAAAGTTAGTCTCCAATATATCTAACAAACCGGAAATAAAGCTATTGAAGTCACTATCCAACGACAATATAAATACATCTCTTCAAAGAAAAAGAAGTAAAACTGGGATTCATAAACACTATTCACTTCGTCAACGTAATCCAACTGGTACTGAATCTATCCGTATACCAAGCCCACACCAAGTAGAAAAAAGTTTCAGTAAACCAATTCATCGTAAAATTTCAAATCTATCATTTAGTGACCAAATCAAAAATCATTCACTAATTTCGCGAGTTCAATCTTTTAAAACATCGCCAGATTCGAATGTCTCCATTATTCCATTAAGTGGTCATCAAACTATTGTCATTGATCCCCCTACACAATCTATACCAGAACAACAAACTAAATACTCAGTAAAGGAAAAACAGTCTGAAAGCAATGATAATATAAATTCGAAACCAAGTTACCAAAATCCAACGACTATCAAAATAACAACCAATACCGAAGACGGTACTTGCGAAAACAAAATATTTACAAATAATATAACAGGAGATCCTTTTATAACAATCAAACTAAATAAAATCGTTAAAAAGAATATAGAAAACCAGTACCAAAGACCGCCGTCCAAGTTGGTTATAAACGACAGTAACAATAATCTTGCAGACACTAATTTATCAATCACTATGACCCGACCACAAGTTCTTCAAGTCATCGATTCAAAAAATAAAAAGCTGAATGAAGTTGTTGCAGATAAAAATAAAAAAGGTATAAGTGCAGTTAAAATGAATAACTTAGAGAGTAAAAACGACTTTAAAGAATCCAAATTCGAAAATAATAATGTAAAAGAAAAACTTAATAACGCAATAAAAACGGATATAGAGTATCAAGAAAAAATTAACTCGGTAAAAAACTACTGGTCCAAACTTATTGAAAAATCTCCTGATTATCAAAATAAGGAAGACGATAAGGACAAGACAGAAGGAACAGTGGAAAATATAAACGAAGAAAACGCAAATCGCGATGAGAATGACAACAACAATCATAAATGTGTTCCAGAAGTTTCAGTAGGGAGCATTATAAAGACATTAGAAAGTGCAAAAATAGTAGATAGTGTAAAGAAAGTAAATCAAACAAAACTACAGTTGTGGAAAGAAGAAACAGAAAAAGTTAAGAGTGATTCTGAAATTGAAGAAGCACCAGTGGAAAAGATAGTTAAAGACACAACAAGAACTATTTACGATCACCCGAAAATCGAAAAACCATCTACAAAATCCGAACAGGAGATATGTAGAGATACTCCAGAGATCGAAATTGTAGAGTTAAGCAGTGACAATCAAAACCAAAAAACTCAGGCAACCTTAATTAAAGCTAAAGGATATGAGAAGGGTTGTGACGAATTTGACCATGTAAGGTATAAAGTGATGAAATCAGAATTGTTTAAAAACAGCATGATAGCGAATTATCGAAAAGAAGCCCAATTCGATGGTCTTTTACAATATCTCCAGGATTACAGTTTTCAAGAACTACTAGTTAACAATAATATAGTAATAATAGAACCCGTCAGAACTAAAGTTGAACCAGCTCCTAGAAAAAATACAACAGACACATGTAAGATACCACCCACATTACTTAAGAAACCTGATTGTACATCACAAGTGGACAAATCAAAGAATGCCATTCGCAGGCACTTTTTCTATCATCCAATTAGAGTTAATAAAGAAATCATAGAAGAGGAACTTCCAAATCCAGATACAGTGAAAAAGGTACGTAACTTATTTGAGGACACCCTTAAAATGAAGAACGAATCAAATTCTATCCAGGCAACAAACATACCGATGGATAATGTCCAAGATACAAGAGAGGATATGAGGCATGCAACTCAAACTGACAAGCAAGAAAATGACGACCCACACAGCTTTGAAAAGAAGTTTTACTACGTTAACGACAGTGTAGAACAAAGAAACGTCCCAGTGGGAAAATTCGGGGAGATGATTTTTGAAGAATTTGAGGTTTTAGAAAACTGTTACGATAGTTTAAATAGTAATAAGTCCCCTTAA

Protein sequence:

>DPOGS210121-PA
MDIRVVFVLHGVIDHLTQHSVDSVASYSSQNHVDRIRTQNFSAASCQDECDFGSIRTSLFRRSSSNTEIPIKRPESVTSNTSSTRSRLRDFVERKSSQDEHRPVANTVADIQYFENPTETLEFDNPRNSLESKKSQDENTKKRKFSNKDKDCENKIKKEDKYQSKLAEYYKLPLQLPQDEFYQHLTRSKAAEEFLQKRFPNSDTEFSASYGRLCKHKDVEGSNLRRSRSLAVIREETFTDLQIQNHPKTKRSQLIPRARLFDKPCFRDRLIGRAKYQTKEEVLEGIYIDSTVSCFGDINRQRVDEEPSNPPDNVENNSAKEDVVSRNESHHSRSISGSWPNLHGETKQTNGSEESIGHSRNPSEIDSLDSNYVRKHYNFEAHLKNYKGSSPETDRSIASPNNSKDLYISTDEPDSDNEEDKSKTQRSSVKEHNILTQKVVNEKIVLNSDEIYKAESPDKVSLQSKQSCNIEKENKATEENKQEKKDSDSQSTISENDENISSPADSITSYISISIASSTDKSHKLEYLANSLAEKIDEYCDSQNASVNSPTIHSETQTLDTDPIYTKVQKQTFADLKKHSFTRKPRESKKVKIKTPTNESYVTNTFIKCKDYNEDSYCTSTKNITTEPFVTTFIENQYYSLPDINISKCLRKSERIDAQLREEDPEDIPCENTYEVAHTHVREISHHSNGESYGQLNKISPKISHKSQVDIEDNSSYYKEESEPQLLQNFTKLEDDLKSIITIDSSYADDTIYYQLDHHQNEREVNEIEGTIRNNKIITKSESLKLVSNISNKPEIKLLKSLSNDNINTSLQRKRSKTGIHKHYSLRQRNPTGTESIRIPSPHQVEKSFSKPIHRKISNLSFSDQIKNHSLISRVQSFKTSPDSNVSIIPLSGHQTIVIDPPTQSIPEQQTKYSVKEKQSESNDNINSKPSYQNPTTIKITTNTEDGTCENKIFTNNITGDPFITIKLNKIVKKNIENQYQRPPSKLVINDSNNNLADTNLSITMTRPQVLQVIDSKNKKLNEVVADKNKKGISAVKMNNLESKNDFKESKFENNNVKEKLNNAIKTDIEYQEKINSVKNYWSKLIEKSPDYQNKEDDKDKTEGTVENINEENANRDENDNNNHKCVPEVSVGSIIKTLESAKIVDSVKKVNQTKLQLWKEETEKVKSDSEIEEAPVEKIVKDTTRTIYDHPKIEKPSTKSEQEICRDTPEIEIVELSSDNQNQKTQATLIKAKGYEKGCDEFDHVRYKVMKSELFKNSMIANYRKEAQFDGLLQYLQDYSFQELLVNNNIVIIEPVRTKVEPAPRKNTTDTCKIPPTLLKKPDCTSQVDKSKNAIRRHFFYHPIRVNKEIIEEELPNPDTVKKVRNLFEDTLKMKNESNSIQATNIPMDNVQDTREDMRHATQTDKQENDDPHSFEKKFYYVNDSVEQRNVPVGKFGEMIFEEFEVLENCYDSLNSNKSP-