Monarch geneset OGS2.0

DPOGS202137
TranscriptDPOGS202137-TA4791 bp
ProteinDPOGS202137-PA1596 aa
Genomic positionDPSCF300193 - 32080-56508
RNAseq coverage442x (Rank: top 28%)
Annotation
HeliconiusHMEL0146250.071.16% 
BombyxBGIBMGA001510-TA0.067.41% 
DrosophilaCrag-PA0.055.95% 
EBI UniRef50UniRef50_Q7PWX70.048.35%AGAP001102-PA n=1 Tax=Anopheles gambiae RepID=Q7PWX7_ANOGA
NCBI RefSeqXP_322065.30.048.35%AGAP001102-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187952990.048.35%AGAP001102-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1187952990.048.50%AGAP001102-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[298-483] IPR0011942.5e-64DENN
[552-626] IPR0051128.9e-26dDENN
[161-269] IPR0051133.4e-23uDENN
Orthology groupMCL10740 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202137-TA
ATGGATGAACGACGAGTTGCCGATTATTTTGTAGTAGCTGGCCTACCAGAGGTGCCAGAAATACTGGATGATTCTGATTCTGGACATTTAAAAGGCTATAGCACTAGAGCCCCTATAACTGATATCGGAGTAATGTTCCCGGGGCTGGGGGAGAAGGTGCCAGATGGTTATGAAATGCTGGAACTGACTCCTACAGGTCTGCCAGCTGATTTGAACCATGGCTCGATGAGGTCACCAGAGTGTTTCCTATGTATCAGACGAGGACGGGATAGACCTCCCCTCGTTGATATTGGTGTAATGTACGAGGGTAAGGAACGTCTGATGGCTGATGCGGAAATGGTCCTGCGTTCCGTCGAAGATAGAGTTGCTAACGTGAACAACTCGTCAGCCAAGACGTTTATAACGTACAGACGGGCGCATCCGAACGCGCCGTGTAACGCGCTCGTCGTGGTCGACGTTTGCGTCATAGTCGAGAGCAGAGGAGAGACGCCGCCTCACGCCTTCTGTATGATACCCAAGAATTTGAATAAAGGTCTAATGGGGAGTAACGTTTTCCTCTGTTACAAGAAGTCAATGAACCGCCCGCCGCTAATTGCTTATAAGCCAGAAGTATTGTTTAGATACCCTCAAGTAGACCGTCGTAGTCTAGCGTTTCCTACATCAGTACCATTATTCTGCCTTCCAATGGGAGCTACCTTGGAAGTGTGGCCGAACAACGCGTCATCACCCAAGCCAGTCTTCTCAACCTTTGTACTGACGGTCGCTGACGCTACCGACAAGGTGTACGGTTCCGCGGTGACGTTCTACGAGCGTTACACCAGCCCGCTGTCTGAGAGCCAGATGGACCAGCTGGGTTGGAGGGCCGGTGTCACTCACATGACACACTCGCTACACGCCAACAAGTCAATATGTTTACTATCCAGGTGGCCCTTCAGTGACACGTTCGAGAGGTGGCTGTTGTATATACTTGAGATGTCTTGGAGCAAGGAACCACTAAACATACCTATTGAAAGATACATAACACATTTATTGGAAGAAGTACCGTTCCCCGAGCCCAGGATATTATTACAGTTATCACCAACTAATCCTCACGACCGTGTGATAGTGACCCGGCGGGATGATCAACCTTTAGTGCGGAGCGGGGCCGGCTTCAGACAACTTCTGCTTAATTTAGGACCAGACAACTGTTTGTTACTCCTGGCGTTAGCTATCACAGAACAAAAAATACTTATACACTCTCTTAGGCCAGACACATTAACTGCAGTCTCCGAGGCAGTGTCTAGTTTCCTCTTCCCCTTCAAATGGCAATGTCCCTACATACCTCTATGTCCTTTGGGGCTGGCAGAAGTGCTACACGCTCCTCTGCCTTACTTGATAGGTGTTGACTCAAGGTTCTTTGATCTTTATGAACCACCGCCTGACGTGACATGCGTTGATCTAGATACTAATAATATTACGATATGCGAGTCACAACGGCATATATCATTGAAACTGTTACCTAAAAGACAAGCGAGGGTCCTGAGACAAACATTGGACCAGTTACTATCAAACATACGACCCGCGTCTCCGGTAAATTCATCTGGTGATAAACACAACGGCGAACCAACTACCAGTTTAGATAGAGATTTCCAGAAGAGAAAGAAAGAGCAAGCATTGGAGCTTAAAATCCAGGAAGCCTTTCTAAGGTTCATGGCGGTGACCTTTCAAGGTTATCGTTCATTTCTAATACCTATCACTAAAGCGCCGACCGTGGGTACAACAGACCCGCACGCTCTGTTCCATATGGACTCATTTTTGAGATCAAGGGACAAGACCCACCAGCGTTTCTTCGCTCTGACGATGCGGACACAGATGTTCACTCGTTTCATAGAAGAACGTTCGTTCGTATGCGACGCTGATCAAGCCTTGTCCTTCTTCGACGAATGCATAGAGAGAGTTGCTAGTGAGGAACCCTTACTAGGAATGGACGATAGTAATACGTCTGAGAGGACCGTGTTTGTACTACCTCCGGACCCGCCGGATACCGAACAGCAGTACACGTACAATAAGTTTATATTGGACGAGCAGCTGGTATCTCTATGTCATAGTACCCGCGGGTCATTGACCTGCGCCCCCGCCGCCGCCCTCGCCTCAGTAGAGTCGCTAGCTGACGCCTCACCCATGGCGAGACGAACAAAACAGGAGATAGCGGCCGCGCAACGTATTATGTATATATATATATATATATATATATATATAACTTTACGTTTTGCGTTCGAGCTACTGGAACGAGCTACTAAGCTGAGGGTACCCTGCGATGAGGTGTGTTACCGTGTCATGATGCAGCTGTGTGGTATCCACTCACTGCCGGTGCTGGCTGTGCAGCTATTGTTTCTGATGAAGCGGGCTGGTCTTCAACCTAATGCCCTTACATATGGTTACTACAATAGATGTGTGTTGGAAGCCGCCTGGCACAAGGATATGCCCAGCGGATCTCAACTCATGTGGAACAAAGTTCGTATAGCGATAATGGGAGTGACTTTGTTCCGTAAGGCGGGGGCTTTGAGAGCCAGTCGAGCGGCAGGAGCTGCGGGAAGTACGGGTACGTTGCCTCGTGTGCGTACGGTGGGCGGCGAGGGTGCAGATCTGACTGCATTGGCTCTCGCTGAACCGACGCGTAGCAGATGTAGCCTGGATTCGGCATGCGATTTGAACGCGTCGGTCAGCACCAATACGGCTTCGACGACGGCCTTCGAAGCCCTATACTGTAGAGGTAACATCGTCCGCGCCCCTGCCAGTCACCCGCGGGCTCATCAAATTTCATCGACCGCCGGTATACTCATTTCCGGCCTTCCATCAGATCCAGATCTTAGTTCTACAACCAGACCTAGAAGTAATTCGTTAGGCAGCGAAGAAGTCGAATCTTCTGTGTCTATAACGGAGAAGAGACAGACTATCCACGTAAGTCCCGACAGTCCATCCGATCTCAGGATACTGACACGATCAGAGAGCTTCGCTGGGGACGCACAGATAGTACAGAACCTACAAAGGCTATCGTTTTCTAGTAGCACATCAAATACGAAATCTCGATGTTCGAGGACGCTGAGCTTCCCCGAGGAACCCGAAAAAGAAACGGCGCTCGTCGACAAAGTCGAGAAAACTATCTCGTCACCTCTCAAGGTGTCCCCCCGCACCCCGGTGTTGGCGGATGACCCGCTGGGCGCTCTGTCCGTGGAGCCGAGTAGCCCCGCCCCCGCCCCTCCTACCACGGACGTGCCTCTACCACGACACGAGTTGAGCGTCAGCCCACGACTGTTCCAGAGGAGCAACTCTTTCACTGAGGAACCGGAAACAGTCGGGAAACTCCACAGAAGCGAGACGGCGCCAGCAACAGTGTCATCAAGCCTGGCCTCTATAGGAAATACGCTCAAAATCAGTTTTGGACGTTATTCACCAGCAAGACTGTCGTTGAGGAAGGATAATATGAACATCGGAAAGGCTATGATCGAGAACTATTTCAGTCCAACGAGCATAGCTGGGAAGAAATCGAACGAGCTCTTACAGAGCGGTCTCAGCAGCTTGAAATCAGCAGCTACGAGTATGGCTAAGAAATTCGATGAGATGAAAGAAGTGATATCGGCTAATTCGACTCCGGTCAAAGGCGCTATAGGCAACGCCACGAGCGCCCTCACTAACTTCAGAGGCGACGATGACTCGGGAGACGGCTCATCAGAGGTCAATCAGAATGAGTGGTCGGGCGGCGTGGGCTTCAGGCGCGCGTCCAGTGACGCGGAACTGGCGTGTTCTATGGAGAGAGGTTCTCTGGCTACACTTTTATCACATCTACCCGACAATCTGTATCCCACGCAGTATGATAATTCAAAGTCCGAGAACCCGTCGGTGGAGGTCCGTATGACGTCATGTTCTCAGTGTCACCAGTGCCTGGCGCTGTTGTACGATGAAGACATCATGGCGGGCTGGGCGGCCGACGACTCCAACCTCAACACGCGCTGCACGGCCTGCGGGCGACACACGGTGCCGCTGCTGTCTGTACAGGTGCGGTACACGGAGGTAATCGTACGAATGGATCAAACACAGGCGGAAACATTGACTGTGCCTTATTTAAATCCTTTAGTACTGAGAAAGGAATTTGAATCTATATTAGGAAGAGAAGGCGACGCTTGTTTGGCTGAACGCGAATTTGTGGAGTCGCATCCTATAGTATACTGGAATCTGGTGTGGTTCTTGGAGCGAGCCAATATAGATAATCATTTCCCTGACTTATTATGTCCGAATTTTTCCGTCAAATACCAGAGTTCGGATCCTTTGCCGGATGTGGACAAAATGACAGTCGGATGTCGCGTTGTTTGTTCGTGGGAGGGTGCTCGTGCGGCGGACTGCGAAGCCCCGTCCCTCCACCGGGCGTGGCGCGCGAGGAGGACTCAGCCGCGGTCGAGGCAACTTAGAGCTCTGCTACTGTCACATCACGACAGACCGACAGACTCTATCGTGGCGACCATATTGGACGGTCTCATGAGTAACGATCTATCAGACGCTGTAAGAAAATTGGCGGCGTGGAGAGAGTCGACGTGCGCTAACAAAAGATATCACTCGTATTACAGAGACATTCTATTCCTGGCAATGGCTGCACTCGGAGAGCAGAAGATCAATGTGACGGTGTTCCAGAGGGAATATACGCGAGCCATCGAACAGCTGGGAACAGAGGCTCGGCCGCAAGATCTGCCACCCTCACCTACAGCTGTCTGCTGTAGACATTACTTTAAGAGACTTACACTCGACGTAGACGACTAG

Protein sequence:

>DPOGS202137-PA
MDERRVADYFVVAGLPEVPEILDDSDSGHLKGYSTRAPITDIGVMFPGLGEKVPDGYEMLELTPTGLPADLNHGSMRSPECFLCIRRGRDRPPLVDIGVMYEGKERLMADAEMVLRSVEDRVANVNNSSAKTFITYRRAHPNAPCNALVVVDVCVIVESRGETPPHAFCMIPKNLNKGLMGSNVFLCYKKSMNRPPLIAYKPEVLFRYPQVDRRSLAFPTSVPLFCLPMGATLEVWPNNASSPKPVFSTFVLTVADATDKVYGSAVTFYERYTSPLSESQMDQLGWRAGVTHMTHSLHANKSICLLSRWPFSDTFERWLLYILEMSWSKEPLNIPIERYITHLLEEVPFPEPRILLQLSPTNPHDRVIVTRRDDQPLVRSGAGFRQLLLNLGPDNCLLLLALAITEQKILIHSLRPDTLTAVSEAVSSFLFPFKWQCPYIPLCPLGLAEVLHAPLPYLIGVDSRFFDLYEPPPDVTCVDLDTNNITICESQRHISLKLLPKRQARVLRQTLDQLLSNIRPASPVNSSGDKHNGEPTTSLDRDFQKRKKEQALELKIQEAFLRFMAVTFQGYRSFLIPITKAPTVGTTDPHALFHMDSFLRSRDKTHQRFFALTMRTQMFTRFIEERSFVCDADQALSFFDECIERVASEEPLLGMDDSNTSERTVFVLPPDPPDTEQQYTYNKFILDEQLVSLCHSTRGSLTCAPAAALASVESLADASPMARRTKQEIAAAQRIMYIYIYIYIYITLRFAFELLERATKLRVPCDEVCYRVMMQLCGIHSLPVLAVQLLFLMKRAGLQPNALTYGYYNRCVLEAAWHKDMPSGSQLMWNKVRIAIMGVTLFRKAGALRASRAAGAAGSTGTLPRVRTVGGEGADLTALALAEPTRSRCSLDSACDLNASVSTNTASTTAFEALYCRGNIVRAPASHPRAHQISSTAGILISGLPSDPDLSSTTRPRSNSLGSEEVESSVSITEKRQTIHVSPDSPSDLRILTRSESFAGDAQIVQNLQRLSFSSSTSNTKSRCSRTLSFPEEPEKETALVDKVEKTISSPLKVSPRTPVLADDPLGALSVEPSSPAPAPPTTDVPLPRHELSVSPRLFQRSNSFTEEPETVGKLHRSETAPATVSSSLASIGNTLKISFGRYSPARLSLRKDNMNIGKAMIENYFSPTSIAGKKSNELLQSGLSSLKSAATSMAKKFDEMKEVISANSTPVKGAIGNATSALTNFRGDDDSGDGSSEVNQNEWSGGVGFRRASSDAELACSMERGSLATLLSHLPDNLYPTQYDNSKSENPSVEVRMTSCSQCHQCLALLYDEDIMAGWAADDSNLNTRCTACGRHTVPLLSVQVRYTEVIVRMDQTQAETLTVPYLNPLVLRKEFESILGREGDACLAEREFVESHPIVYWNLVWFLERANIDNHFPDLLCPNFSVKYQSSDPLPDVDKMTVGCRVVCSWEGARAADCEAPSLHRAWRARRTQPRSRQLRALLLSHHDRPTDSIVATILDGLMSNDLSDAVRKLAAWRESTCANKRYHSYYRDILFLAMAALGEQKINVTVFQREYTRAIEQLGTEARPQDLPPSPTAVCCRHYFKRLTLDVDD-