Monarch geneset OGS2.0

DPOGS202638
TranscriptDPOGS202638-TA4827 bp
ProteinDPOGS202638-PA1608 aa
Genomic positionDPSCF300039 - 788438-804187
RNAseq coverage143x (Rank: top 54%)
Annotation
HeliconiusHMEL0145200.080.31% 
BombyxBGIBMGA000839-TA0.072.04% 
DrosophilaCG12325-PA0.053.34% 
EBI UniRef50UniRef50_B4GGA00.053.46%GL17309 n=9 Tax=Neoptera RepID=B4GGA0_DROPE
NCBI RefSeqXP_623489.10.054.04%PREDICTED: similar to CG12325-PA [Apis mellifera]
NCBI nr blastpgi|3287822450.053.92%PREDICTED: periodic tryptophan protein 2 homolog [Apis mellifera]
NCBI nr blastxgi|3800202490.054.48%PREDICTED: LOW QUALITY PROTEIN: periodic tryptophan protein 2 homolog [Apis florea]
Group
Gene OntologyGO:00055157.6e-64protein binding
KEGG pathway 
InterPro domain[636-1041] IPR0110467.6e-64WD40 repeat-like-containing domain
[880-1091] IPR0159432.5e-42WD40/YVTN repeat-like-containing domain
[1485-1584] IPR0071482.7e-16Small-subunit processome, Utp12
[377-416] IPR0016802.6e-09WD40 repeat
[508-546] IPR0197811.1e-08WD40 repeat, subgroup
Orthology groupMCL14261 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202638-TA
ATGAAGTATAATTATAAGTTTCAAAATTTACTGGGCACAGTGTATCGTCATGGTGATATATTGTTCACTAATGATGGAAACTGTGTCATCAGTCCCGTTGGAAACAGGATAACCATTTACAATCTGAAACAGAATAAAAGCAATACTCTCCCTGTGGAAAGTCACTACAATTATACAGCCATCGACATATCTCCAAATGGATCAGTACTTCTTGCTATCAATGAAAAGGGTGAAGCACAAATGATCAGCCTCGTAACCTGCACGGTCATACACAGATATAAGTTCAAGCAGCAAGTTAATGCTGTCAAATTCAGTCCTGATGGGAAATTGTTTGCTGCTTGTTGTGATGACACAGTGTTCATAATGACTGCACCCAGTGCATTTACGGGAGAGTTCCGTTCATTCATAATGAGACGTGTGTTTAAAAAATCACATGACGAAGTCACTTGCCTGGACTGGTCTAGTTGTGGAAAGTTACTTGCGGTGGGATCCAAAGACACAACAACCAAAATATACACAGCCGAGTACTTGGACAACCTAAATATGTACTCTTTAAGTGGTCATACAGACAAGATTGTTGGTGTATTCTTTGAGCAGAAGAGTTTAGACCTTATAACTGTGAGCCGTAATGGTCAGGTTTGCTTGTGGGATGCCAGTCTGGATTCAGATAGCCTGGTTACTTCAGAGGTACAAATATCACATAAGAAGAGACGGAAATTGCAAAAGGAAGCCGAATTAGTTGAGGATGAAGTTGATGAAGAGAATATAGTTGAAAAGGACAAAGAATATGAGAGTGATAAAGATGTTGAAATAGAAGAGGAACAAAAGACAAATGACGGTAAAAAACTCCAATACAACAAGCTGGGGAGGCATTATATTGGGGATTCCATAAGGAATGGTAACCATAAGGTGAAACTAACGGCTGCGGCATACCACAGGGGGACCAAGATATTAGTGACAGGTTTCTCCACTGGTATATTCTTCCTTCACGAGATGCCAGATGTGAATCTCATCCACTCCTTGAGTATATCAGAACACAGGATTGGCAGCATCTCAGTATCTCACCAGGGGGACTGGATAGCGTTTGGTTGTCCCAACATTGGACAACTGTTGGTTTGGGAGTGGCAGAGCGAGCAATATGTTATGAAGCAGCAGGGCCACTCGCTAGACATGACCTGCCTCGCGTATTCGCCTGACGGGCTCTACATAGTCACAGGCGGCTATGACGGGAAGGTCAAAGTATGGAATACCAGCTCGGGCTTCTGCTTCGTTACATTCAGTGAACATAAGTCGACGGTGACCGGGATAACGTTCAGTGCCAATAAGAAATTCTTCGTGTCTTCATCTCTGGACGGCACCGTGAGATGTTACGATCTGACGAGGTATCGTAACTTCCGTACTTTCTCGTCTCCGACCCTGGTTCAGTTCGGCTGCGTGTCCTTGGACAGCAGCAGTGAACTGTGTGCTGCTGGAGGACAGGACGTCTTCGAGATATACCTGTGGTCCGTCAAATTTGGGCGACTTTTGGAGGTGCTCGCAGGTCATGCAGCTCCAGTGGCTAGTTTAGCTTTCAGTCCACTTCTGTCTAGTTCCAAACTGGCCTCCGCCTCCTGGGACAAGACGGTAAAGATATGGAACTGTATAGAAACAAGCTCGGACTGTGAAACTATACAACTGGGTTCGGACGCACTGCAAGTGAGCTTCAGACCTGATGGAGAAGAGATAGCAGTATCGACGTTAGACGGTAACATATCATTCTTCAACGCCACCACTTGCGACCAGACTGCCAGTTTGGAGGGGAGAAATGATCTGGGAGCCGGCAGGGCCGACACCGATCTGGTTACACCGGAGAAGCTGTTGAAGACCAAGAATAAAAGCAATACTCTTCCTGTGGAAAGTCACTACAATTATACAGCTATCGACATATCTCCAAATGGATCAGTACTTCTTGCTATCAATGAAAAGGGTGAAGCACAAATGATCAGCCTCGTAACTTGCACGGTCATACACAGATATAAGTTCAAGCAGCAAGTTAATGCTGTCAAATTCAGTCCTGATGGGAAATTGTTTGCTGCTTGTTGTGATGACACAGTGTTCATAATGACTGCACCCAGTGCATTCACGGGAGAGTTCCGTTCATTCATAATGAGACGTGTTTTTAAAAAATCACATGACGAAGTCACTTGCCTGGACTGGTCTAGTTGTGGAAAGTTACTTGCGGTGGGATCCAAAGACACCACAACCAAATTATACACAGCCGAGTACTTGGACAACCTAAATATGTACTCTTTAAGTGGTCACACAGATAAGATTGTTGGTGTATTCTTTGAGCAGAAGAGTTTAGACCTTATAACTGTGAGCCGTAATGGTCAGGTTTGCTTGTGGGATGCCAGTCTGGATTCAGATAGCCTGGTTACTTCAGAGGTACAAATATCACATAAGAAGAGACGGAAATTGCAAAAGGAAGCCGAATTAGTTGAGGATGAAGTTGATGAAGAGAATATAGTTGAAAAGGACAAAGAATATGAGAGCGATAAAGATGTTGAGATAGAAGAGGAACAAAAGACAAATGATGGTAAAAAACTCCAATACAACAAGCTGGGGAGGCATTATATTGGGGATTCCATAAGGAACGGTAACCATAAGGTGAAACTAACGGCTGCGGCATACCACAGGGGGACCAAGATATTAGTGACAGGTTTCTCCACTGGTATATTCTTCCTTCACGAGATGCCAGATGTGAATCTCATCCACTCCTTGAGTATATCAGAACACAGGATTGGCAGCATCTCGGTATCTCACCAGGGGGACTGGATAGCGTTCGGTTGTCCCAACATTGGACAACTGTTGGTTTGGGAGTGGCAGAGCGAGCAATATGTTATGAAGCAGCAGGGCCACTCGCTAGACATGACCTGCCTCGCGTATTCGCCTGACGGGCTCTACATAGTCACAGGCGGCTATGACGGGAAGGTCAAAGTATGGAATACCAGCTCGGGCTTCTGCTTCGTTACATTCAGTGAACATAAGTCGACGGTGACCGGGATAACGTTCAGTGCCAATAAGAAATTCTTCGTGTCTTCATCTCTGGACGGCACCGTGAGATGTTACGATCTGACGAGGTATCGTAACTTCCGTACTTTCTCGTCTCCGACCCTGGTTCAGTTCGGCTGCGTGTCCTTGGACAGCAGCAGCGAACTGTGTGCTGCTGGAGGACAGGACGTCTTCGAGATATACCTGTGGTCCGTCAAATTTGGGCGACTTTTGGAGGTGCTCGCAGGTCATGCAGCTCCAGTGGCTAGTTTAGCTTTCAGTCCACTTCTGTCTAGTTCCAAACTGGCCTCCGCCTCCTGGGACAAGACGGTAAAGATATGGAACTGTATAGAAACAAGCTCGGACTGTGAAACTATACAACTGGGTTCGGACGCACTGCAAGTGAGCTTCAGACCTGATGGAGAAGAGATAGCAGTATCGACGTTAGACGGTAACATATCATTCTTCAACGCCACCACTTGCGACCAGACTGCCAGTTTGGAGGGGAGAAACGATCTGGGAGCCGGCAGGGCCGACACCGATCTGGTTACACCGGAGAAGCTGTTGAAGACCAAAGCTTTCACTACGATATGCTACTCAGCGGATGGCACGTGTATCCTGGGGGCAGGAAACTCCAAACACATATGTCTGTACAGTATTAAGGAGGGTGTACTCATCAAGAAGTTTGTGATCACGCAGAACAAGTCCTTGGACGCTATTAATGACTTTATAAATCGTCGGAACATCACCGAATTTGGTAATATGGCGCTGGTTGAAGAGAGGGAGGAGTTGGAAGGAGGGGAGGTTAGGGTGAGGCTGCCGGGGGTCGGCGGGGGAGATATGGCTGATAGGAGGCTGAAACCTGAGGTTCGTGTGTGGTGTGTTCGTTTCTCTGGTGCTGATGAAAGCTTTGCAGCAGCATGTACCGAGGGACTGCTGTTATATGGAACAAGAACGGGGAGTGGGTTCAGGCCATATCGTCTAGAAACAGGTTCCACGCCGGCTGCAGTGAAAAATCTATTGTCGGAGAGATCATGGGGCTTCGCCCTCATTGGAGCCCTACAGCTCAACGACAACACTCTCATACAGCAATGCGTAGAAGCTGTCCCCCCGAATGACATCGAACTAACAGCAAAGAGTTTGGAAGAAGATTACATGATACGTCTTCTTAACTCGATCGCAAGTCTTCTAGAAGATAGTCCCCATCTAGAACATTTGCTCATCTGGGTTAGGAGTCTCGTCACAGACAAGAAGAAATTCCCGCCCAGCGTGTTGCTAGCCTTAGAGAAGGCGCTCACGGTGAAATATTCGCAGATTAATAAAATGCCATATCGTCTAGAAACAGGTTCCACTCCGGCTGCAGTGAAAAATCTATTGTCGGAGAGATCATGGGGCTTCGCCCTCATTGGAGCCCTACAGCTCAACGACAACACTCTCATACAGCAATGCGTAGAAGCTGTCCCCCCGAATGACATCGAACTAACAGCAAAGAGTTTGGAAGAAGATTACATGATACGTCTTCTTAACTCGATCGCAAGTCTTCTAGAAGATAGTCCCCATCTAGAACATTTGCTCATCTGGGTTAGGAGTCTCGTCACAGACAAGAAGAAATTCCCGCCCAGCGTGTTGCTAGCCTTAGAGAAGGCGCTCACGGTGAAATATTCGCAGATTAATAAAATATGTGAGTTCAATAAATACACAATACGCTGTATAAAGTCAGTTGGCGCTTTGACTCTGAAGGATGAGGACGGTTCCAGCGGACGAGGCACTTTCACTGACGATTCAGATTCTGATTAA

Protein sequence:

>DPOGS202638-PA
MKYNYKFQNLLGTVYRHGDILFTNDGNCVISPVGNRITIYNLKQNKSNTLPVESHYNYTAIDISPNGSVLLAINEKGEAQMISLVTCTVIHRYKFKQQVNAVKFSPDGKLFAACCDDTVFIMTAPSAFTGEFRSFIMRRVFKKSHDEVTCLDWSSCGKLLAVGSKDTTTKIYTAEYLDNLNMYSLSGHTDKIVGVFFEQKSLDLITVSRNGQVCLWDASLDSDSLVTSEVQISHKKRRKLQKEAELVEDEVDEENIVEKDKEYESDKDVEIEEEQKTNDGKKLQYNKLGRHYIGDSIRNGNHKVKLTAAAYHRGTKILVTGFSTGIFFLHEMPDVNLIHSLSISEHRIGSISVSHQGDWIAFGCPNIGQLLVWEWQSEQYVMKQQGHSLDMTCLAYSPDGLYIVTGGYDGKVKVWNTSSGFCFVTFSEHKSTVTGITFSANKKFFVSSSLDGTVRCYDLTRYRNFRTFSSPTLVQFGCVSLDSSSELCAAGGQDVFEIYLWSVKFGRLLEVLAGHAAPVASLAFSPLLSSSKLASASWDKTVKIWNCIETSSDCETIQLGSDALQVSFRPDGEEIAVSTLDGNISFFNATTCDQTASLEGRNDLGAGRADTDLVTPEKLLKTKNKSNTLPVESHYNYTAIDISPNGSVLLAINEKGEAQMISLVTCTVIHRYKFKQQVNAVKFSPDGKLFAACCDDTVFIMTAPSAFTGEFRSFIMRRVFKKSHDEVTCLDWSSCGKLLAVGSKDTTTKLYTAEYLDNLNMYSLSGHTDKIVGVFFEQKSLDLITVSRNGQVCLWDASLDSDSLVTSEVQISHKKRRKLQKEAELVEDEVDEENIVEKDKEYESDKDVEIEEEQKTNDGKKLQYNKLGRHYIGDSIRNGNHKVKLTAAAYHRGTKILVTGFSTGIFFLHEMPDVNLIHSLSISEHRIGSISVSHQGDWIAFGCPNIGQLLVWEWQSEQYVMKQQGHSLDMTCLAYSPDGLYIVTGGYDGKVKVWNTSSGFCFVTFSEHKSTVTGITFSANKKFFVSSSLDGTVRCYDLTRYRNFRTFSSPTLVQFGCVSLDSSSELCAAGGQDVFEIYLWSVKFGRLLEVLAGHAAPVASLAFSPLLSSSKLASASWDKTVKIWNCIETSSDCETIQLGSDALQVSFRPDGEEIAVSTLDGNISFFNATTCDQTASLEGRNDLGAGRADTDLVTPEKLLKTKAFTTICYSADGTCILGAGNSKHICLYSIKEGVLIKKFVITQNKSLDAINDFINRRNITEFGNMALVEEREELEGGEVRVRLPGVGGGDMADRRLKPEVRVWCVRFSGADESFAAACTEGLLLYGTRTGSGFRPYRLETGSTPAAVKNLLSERSWGFALIGALQLNDNTLIQQCVEAVPPNDIELTAKSLEEDYMIRLLNSIASLLEDSPHLEHLLIWVRSLVTDKKKFPPSVLLALEKALTVKYSQINKMPYRLETGSTPAAVKNLLSERSWGFALIGALQLNDNTLIQQCVEAVPPNDIELTAKSLEEDYMIRLLNSIASLLEDSPHLEHLLIWVRSLVTDKKKFPPSVLLALEKALTVKYSQINKICEFNKYTIRCIKSVGALTLKDEDGSSGRGTFTDDSDSD-