Monarch geneset OGS2.0

DPOGS206140
TranscriptDPOGS206140-TA4218 bp
ProteinDPOGS206140-PA1387 aa
Genomic positionDPSCF300028 + 1237372-1253827
RNAseq coverage279x (Rank: top 39%)
Annotation
HeliconiusHMEL0087783e-16386.97% 
BombyxBGIBMGA000509-TA0.062.67% 
Drosophilaph-d-PA6e-2057.35% 
EBI UniRef50UniRef50_E5RWY00.063.77%Polyhomeotic n=1 Tax=Bombyx mori RepID=E5RWY0_BOMMO
NCBI RefSeqXP_002428662.11e-4040.59%polyhomeotic, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3198030290.063.77%polyhomeotic [Bombyx mori]
NCBI nr blastxgi|3198030290.066.21%polyhomeotic [Bombyx mori]
Group
Gene OntologyGO:00055153.4e-16protein binding
KEGG pathway 
InterPro domain[1303-1379] IPR0137614.6e-21Sterile alpha motif-type
[1308-1377] IPR0109933.4e-16Sterile alpha motif homology
[1314-1377] IPR0211293e-12Sterile alpha motif, type 1
[1312-1379] IPR0016601.1e-07Sterile alpha motif domain
Orthology groupMCL22622 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206140-TA
ATGGAGAGCGTTCAGCTAACGGGGGAGTTGAGGGGAGGTGAGGGGAAGTCTGAAATATTAGAAAATATTCAAAAGAAGAAAGACCAAGATAAAGATCAGCCTCAGAGTCCAGGATTCCAGCAGCAATTGCCAAAGCCACTCGATAAACAGATCCTAGCTGATGTACAGAATGCTTTAGCCCAGCATGTCCCAAAACTTATATCTTCATCGCATACCTCCACTGCATCGATTCAGTCAAAAGACGAGACAACAGAAAAGACAATAAAAGTTGCAAACTCAACTACAATAACATTAATAGAAAACACAAGTCACAGTGTTGAAAAAGACAAAGATAACAATTATCCTAAAAATTATTTCACATTAGAAGCTAAAAATGATAAAAAGGATGAGGCTAACAAAAACAGCATTACTATAACAAGAACAGCACCGAAACAGTCACCTGGTGGTAGTCAAAGTGACAGGCCCAAGGGCAACTCGAAAGCCACAAAGAGGCCTTTACAATACTTAGAAACACTCGCAGAGAAAGCTGGTATCACATTCGAGGATAAATATGAAGCTGCAAATACACTTTTAGCTCTTGACAAACAAAACAGTACCTTCCGCAGACCGGAGATTAAACAGCCAAAGCTTGAACCAGAAACTCATAATCAAGGTGAAGAATATCGTTATAGAAATCAAAAGGAAGAAGATGATAAGTTACAAATACAACAGCAGATAATCCAACAGCAGCAGCAGCAGCAACAGCAATTGCAACAGCTCCAACAACAGCAATTACAGCAACAAATCCAACAACAGTTGCAACAGCAAGTTTTGCAGCGTCAGGCTATACAGCAACACTTCAAAAACCAACAAGATCAACAGGAGCTGTTGCAAAAACAATTTCAACAACAACAACAGCATCACGCTCAACAACAAGCCCAACAACAGCAGCAGCATCAACAACAACAGCAACAGCAGCAGCAGCAACAACAGCAGCAGCAGCAGCAGCAACAACAACAGCAACAATCACAACAACAGCAACCTAAAACTGATATACAGCAATTATCCCAGCAACAAGACCTACAGCAACAGAAATTTCTTCAGGTTGTGAATCAGCCGGTACATACACAGCAGTTTAACGTACCCGGCATCGGCGAGATCAATCTCAGTTTCCTCGCGTCACCACAAAATAACGTTAATCTACTTTACGAAAAAAATCAAAAAACCAGCATGAGCGGAGACATTAAGCCGCAACAGATAGTTGATGGTCAGGCGCAGGTGGTTCAGCAGCAGATCAACATGTCTCAGCACGAGCCGACCCAGCACCATCAGACGGTCACAGTTGTGTCCTCGATGCCAAGCAGCATGGCACCACATCAGGTGCAGCAACAGAGCTCTGGAGGGCTGCAACAACAGGCCGCAATGCCGCCATTACAGAGTCTGCCGAACACCCAACACCCGCAACAGATAAGCGCTGAGTGGGGACACAGCCGCGTGCAGGTCATCCAGCAGCCGCTGCAGAACAGCACGTACCTTCAACAGCTGTATAATGCTCAAGGGCCGCTACTGATGCCGGGGAACATAGCTTTGCACCCGGGAATCAATTCACCCCAGATACAGGTCATTGCTGCGGGGAAGCCGTTCCAGGGAAACCAGCTGGCTCCGCATATGTTGACGACCCAGGGCAAACAAGTACTACAAGGACAGGCTGCCCCGTTCCCGGGATACACGACTATCCCGGCTATCCCGACAACCCAGAACCAGACGTTCGTGTTCAGTCCACTCGGCGTGATCAACTCGCAGTCGAGTATACTACCAGCTCACTCCCAGCCGACTGTGTCTGGGATAGGACAACAGCAAAAAACTTCTGACATGCACAAGGTGATGAGTGGTGGGAAGGTCGGTGGTAAAGTCGGTAGCGCGGTGCCGGTGCAGGCGCAATGCGTCCAGGTGTCCCAGCCCGTGCTCGGCCAACAGCAAGCCCAGATCATCAGTCCGTTACAGACGGGTGGTCAGATGCAGTTCGCGCCATGGCAGATATCCGGGGCCCTGCCGCAAGTGTGGGCAGGAGGTCTGCAGGCGGGGGCCCTGCCCGCGGGGGGCCTGCTCGCCCCCAACCCTATATTCATTAGGGGGACCCAACCCGACGCCCCATCCATGTTCATACAACACTCACCACAGAATAACGTTCAACACAACAATGTGAGCGTGGCCTGTGCGACGGCGACCACGTCGAAGCCCCGGGCCTCCAGTGAGGGAATGACGAAGACGTCTCGTCCTCTTTCCAACATTCTACCATCTAGCGGCATCAGACCCGCTTCATCTGTCTCCACCCAGACTAATACTAACCAAGCACAGAATCAGGCAAAGCAACGCGGCAAGCCAGGCGTACGATCCCCAGCACCGGCAGCCAAACAAGATGCTGCTAACCAAACAAACAAAATGCAGCATCAAATGCAGCAAACTAAACAGTTGCTGGTTATGAATTCTAGTGGACAGATGGCTCAAATTTCGAGCGTGTCGGAAAAACAGACGATTAACAAGAACATTCAGCAGCAGACAATCATACAGCAACAGCAGTCCATTCAGCAGCAGCAACAGCAGCAACAACAGCAGCAGCAGCAACAACAGCAGCAGCAGCAACAACAACAACAACTACTTCAACAGCAACACCAAACCCAGATTGTACAGCATTATCAACAACAAGGAATGACTTTGGGCATGCAGCAGAGTACACTTCCGGTTGGTTCTGTGTCCCAAACTTTGCCAATGACTGGGATGCCGCAAACTGTGTCTATGGTGCAGCAATTACACTCGTTGAGCGGTCTCGGCCAAACAAGTGCTTTAGTGAGTCAATTGAACAGTACTCAACCACCAAACTCACTATCCCAAGTGCCGTCACTTCAACAGCCACAGTTGGTGAGCAGTGGTCTCGTTCAGACGTCTGTTGGCATGGCAGTTCAACAGCCAACCGGCTTGAGTCATACCACCCAGATGCAAGCGCCCGCGTTGGCGCTCGATGGATCGCTGCTGACTCCACTCGTCGTCTCACCGGCGATATTACACCACGAGGTAACAACAAACAATCTAAGTTCGCCGCAACTGACTCTACAGGGCCAGGGGACGGTTGGTCTTCTAGCGGCACCGCTGCAGGGGTCCGTGCTGCCCTTGTCCCAGACCCTCTCGCAGACCTTGACTCAGACCCTACTCCATGTGAAGAGTGAGGAAGACAAGACACAACAAATGCCGCCACCGCAAAGCTCCGTTGTTCCGCAGTCATCACAGCCAATGGATACCAGCGACGCTACATCAACAGTATCATCACCAAGTCCTACCACCACCACGGTCAGCGCAGACGCTGCTGTCTCTACCACTACCGCTTCCGGTCCAAAAACTCCAACCGACACTCCAAAACCGAGCCCAAGTAAGGAGTCTCCGGAACAACCAGCGACCACAGCTTCACCAGCCACCAGCAGCGCCCAGAGCACCACCTCTCTAACGCCACAGGTGATGACAACTCTGGCTTGCTCTACAACAGCAACTGTTCCCACTTCCATAACCACGCCGGTAACAAGTAACTCGCTGTTCAAACCAGCTCAATGTCCTCCTCGTCATATCAACCAGCAGACAGCTCATGATAAAACCTTGCCGAAAGCCATGGTGAAGCCGAATATACTAACCCACGTTATTGAGGGCTATGTTATTCAGGAAGCTGGTGAACCATTTGCTGTTAATAGACCTCTCCGCGAGTGGGGCACGGATAAGGAGCAGGACAAAGAGAATAAACTGCCCTCAACAGACGAACCGCCCAGAAAGAAACAAATGTTAGAAAACGGCAGTAGTCTACCAAGGATATCATCAAGCAATGAGTCCAGTGAAAGCTCACAGAGCTCCAAGTCTGATCCCACACCGCAGCCCGAGGCGCCGTCGGAGGAGTCCCCGAAAATACCCAACGCTAACAAGTGGACCGTGTCCGAGGTGTGCGACTTCATACGCAGTATCCCAGGCTGTGCCGGTTACGCGGACGAGTTCCTTATGCAGGAGGTCGATGGGGAGGCTCTGCTGCTCATTAAGCCTGAACACCTGGTTATGGCGCTCTCTATGAAGCTGGGACCAGCATTAAAGATAGTCGCATGCATTGACTCGCTGCGGCCGGAAAGCGAACAGACAAATGATCATGACTGAGGTAATATATACATACATATATATACTATATATATATATATGTATATAGCTTAA

Protein sequence:

>DPOGS206140-PA
MESVQLTGELRGGEGKSEILENIQKKKDQDKDQPQSPGFQQQLPKPLDKQILADVQNALAQHVPKLISSSHTSTASIQSKDETTEKTIKVANSTTITLIENTSHSVEKDKDNNYPKNYFTLEAKNDKKDEANKNSITITRTAPKQSPGGSQSDRPKGNSKATKRPLQYLETLAEKAGITFEDKYEAANTLLALDKQNSTFRRPEIKQPKLEPETHNQGEEYRYRNQKEEDDKLQIQQQIIQQQQQQQQQLQQLQQQQLQQQIQQQLQQQVLQRQAIQQHFKNQQDQQELLQKQFQQQQQHHAQQQAQQQQQHQQQQQQQQQQQQQQQQQQQQQQQSQQQQPKTDIQQLSQQQDLQQQKFLQVVNQPVHTQQFNVPGIGEINLSFLASPQNNVNLLYEKNQKTSMSGDIKPQQIVDGQAQVVQQQINMSQHEPTQHHQTVTVVSSMPSSMAPHQVQQQSSGGLQQQAAMPPLQSLPNTQHPQQISAEWGHSRVQVIQQPLQNSTYLQQLYNAQGPLLMPGNIALHPGINSPQIQVIAAGKPFQGNQLAPHMLTTQGKQVLQGQAAPFPGYTTIPAIPTTQNQTFVFSPLGVINSQSSILPAHSQPTVSGIGQQQKTSDMHKVMSGGKVGGKVGSAVPVQAQCVQVSQPVLGQQQAQIISPLQTGGQMQFAPWQISGALPQVWAGGLQAGALPAGGLLAPNPIFIRGTQPDAPSMFIQHSPQNNVQHNNVSVACATATTSKPRASSEGMTKTSRPLSNILPSSGIRPASSVSTQTNTNQAQNQAKQRGKPGVRSPAPAAKQDAANQTNKMQHQMQQTKQLLVMNSSGQMAQISSVSEKQTINKNIQQQTIIQQQQSIQQQQQQQQQQQQQQQQQQQQQQQLLQQQHQTQIVQHYQQQGMTLGMQQSTLPVGSVSQTLPMTGMPQTVSMVQQLHSLSGLGQTSALVSQLNSTQPPNSLSQVPSLQQPQLVSSGLVQTSVGMAVQQPTGLSHTTQMQAPALALDGSLLTPLVVSPAILHHEVTTNNLSSPQLTLQGQGTVGLLAAPLQGSVLPLSQTLSQTLTQTLLHVKSEEDKTQQMPPPQSSVVPQSSQPMDTSDATSTVSSPSPTTTTVSADAAVSTTTASGPKTPTDTPKPSPSKESPEQPATTASPATSSAQSTTSLTPQVMTTLACSTTATVPTSITTPVTSNSLFKPAQCPPRHINQQTAHDKTLPKAMVKPNILTHVIEGYVIQEAGEPFAVNRPLREWGTDKEQDKENKLPSTDEPPRKKQMLENGSSLPRISSSNESSESSQSSKSDPTPQPEAPSEESPKIPNANKWTVSEVCDFIRSIPGCAGYADEFLMQEVDGEALLLIKPEHLVMALSMKLGPALKIVACIDSLRPESEQTNDHD-