Monarch geneset OGS2.0

DPOGS204277
TranscriptDPOGS204277-TA5007 bp
ProteinDPOGS204277-PA1668 aa
Genomic positionDPSCF300046 + 29607-44414
RNAseq coverage394x (Rank: top 30%)
Annotation
HeliconiusHMEL0033200.096.15% 
BombyxBGIBMGA007554-TA0.081.87% 
Drosophilakto-PA0.044.62% 
EBI UniRef50UniRef50_Q7QCA20.048.81%AGAP002523-PA n=2 Tax=Anopheles gambiae RepID=Q7QCA2_ANOGA
NCBI RefSeqXP_392792.30.051.59%PREDICTED: similar to kohtalo CG8491-PA [Apis mellifera]
NCBI nr blastpgi|3479680150.048.81%AGAP002523-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3504019150.052.18%PREDICTED: mediator of RNA polymerase II transcription subunit 12-like protein-like [Bombus impatiens]
Group
Gene OntologyGO:00063571.8e-15regulation of transcription from RNA polymerase II promoter
GO:00165921.8e-15mediator complex
GO:00011041.8e-15RNA polymerase II transcription cofactor activity
KEGG pathway 
InterPro domain[462-832] IPR0219903.9e-124Mediator complex, subunit Med12, LCEWAV-domain
[97-156] IPR0190351.8e-15Mediator complex, subunit Med12
Orthology groupMCL11016 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204277-TA
ATGGGGATTATGTACGAAAAAAGACCTTTGAAACGGCCTCGGCTGGGGCCTCCTGATGTTTATCCACAAGAACCACGACAGAAGGAAGATGAACTTACCTCAGCCAATGTAAAACACGGATTTACAACAACCCCTCAATCTAGTGATGAATTTGGTACTGCTAGAAACTTCAACTATTCCGCATCAAAGATTGGACAATTCTTTTCTGGAATATTGTCCAAGAAGGAAGAACTTAACACTCTACCTGACTGTGGAAGAAAGAGACAGCAGGTTAACCCCAAGGACAACTTTTGGCCAGCCACAGCTCGTACTAAACCGCAGATTGAAGCATGGTTCAAAGATTTAGCTGGTAACAAACCTCTATCACAACTTGCAAAGAAGGCACCCAATTTTAACAAAAAGGAAGAAATATTTATAACATTAACTGAGTATCAAGTTGCCATGCCAAGAGCTGCTTGGTTCATCAAGCTCAGCTCAGCATACACTGTTGCAGTGTCTGAGGCTAAAATCAAAAAGAGACAACTTCCTGATCCAACCACCGAATGGACTACAACCCTGATTAAGTTCCTAAAAGATCAAATACCAAAGTTAGCAGAACATTACCAAAGTGCTGTACCTGGCAGCGTATCCGATAAGACCCCTCCGTCACAATCAGGTCAAGGCACGCCTAGTCACAGCTGTTTAGGAAATCCACCGGGAGGGTCCACACCAAATAGTTCAATGGTACCAAATTCCATGCACTCACCAGGTCTAGGCTCGTCCACTCCAAATGCCAATGAAAATACTGACTGGCGTCAAGCATTGCGGCAGTGGAATTACTGCTGCCGCCTGGCTCGCTATCTTCTTGATGAGGGTCTATTGGATAGACATGATTTCCTGACTTGGATCATCGAATTACTCGATAAAAGGGCTCCTGATGATGGACTTCTAAGGTTATTCCTGCCTCTAGCATTGCAACATATAAGTGAGTTTGTGTGTTGTGAGGGTCTTTCGCGTCGGTTGGCGACGTCGTGTGCAAAAAAAGCAGCAGCTATTTGTTCCGTGTTGTCGGATACTACACTGCGGGCTCTTAACCAACCTGCTGTATTCAATAAGACCGCAGAGAAAAGTGAAACTACCGGTGCGCCCTTGTCACCGACGGCTGAGGCGAATGGCGATGTGAAACCAAATGTCTCCCAGATAAAGCGTTCTGTGACCCCAGCGGATCAGAACCAAGGAACACCTAATCCAAACCATTCTTTGGGAACTCCAAATCCTAACCATTCCCTAGGGACCCCTAATCCGTTGGGCACCCCAAACCCTCAACAGCCAATGGATGTCCAAGTGAAGGAAGAAGCTAGTTCACCGCCGAGGGACACTAAGGCGACCTTAACAGCGGTCATAGCAGCTGCATTGAACCCCGTACAAGTTGGCTTTGGGGAGATACTGAATTGTGCCTACCACAGGGATGTCATCATACAGTTGGCCACAATTCTTCAGATAATCTGCATTGAGTGCCCAACAGCGCTAGTGTGGTCTGGTTGTGGGTCGTCACTCCAGGGCTCTCCGTTGGATCTGCTGCCACTGCCGCCCTCCGCCCTGCCCATGCCGCAGATGGACTCCACGCTATCGCAAGAGCATCGGCGGATGGCTTATGAATCTGAGCAAGAAATCGCTGCAAGAAGCAAGAAAGCGGAGAGTAAATGGTGTACTGATAAATGGCAAACCAGCTCTCAAGCCCGGGTGTTGGCAGTGCTAGAGGCTCTGGACAGGCACTGTTTCGACCGTGTCGACCCCAATAATAACTTGGACACGTTGTACAAAGAAGTATTTGCAAACTGCACTCCAACCAACAAGGAAGGAACTCTTGACCCTAAAGATCCTGAATATGCGTGTTCGTACGCGGTGGTGCGTGTTCTGTGCGAGTGGGCGGTGTGCGGCGCGAGGTGGGGGGAGCACCGGGCGATGGCGGCAGCGGCCTTGCTGGACCGACGACAGCACCATCACCACCACGACCACCAAGGTTCCGATGACAAGGAGTCTGTTGGTTCTGGGACTGGAATTTATAATGGACCACCGATATTCCAGAATTTACTACTTCGTTTCTTGGATAACGACGCGCCCGTCTTAGATGAAAGTCCAAACGCGCCGCCAGGGAATAGACAGCAGTTCGCGAATTTGGTACATCTGTTCGGTGAACTGATAAGGCGGGATGTGTTCTCACACGATGCTTACATGTGCACGCTTATTTCTAGAGGAGACCTAATATCTCCTACGGAACCAACATCAACGGGTGGAACACATACAGTGGCGCCGCCAGCGAACAGTACCGGAACTAATCATAACATAGACGAAGACATATTTGCTGGTATAGACCTCAAGCCTAAGATGGAGGAGAATGTTCGAATGGACTTGGACGACTCCAAAATAGACGATGACCTTGACAAGCTTCTCCAACACATAAAGGAAGATCAGCAGAACTCAATGGACGCCCCCGACAGTCCAAAGGATCCCGGGGAACCCACACATGGAGTCAATTCACCAATGATGGGAGCGAGCGGGATCGGTATTCCTGGAATGCAATCCATTGGAATGGGTCCAATGTCTGTGCCAGGTGTTCGCGGGGCTTCGTTGTCCCGACACTATCACTACACGACTCACTTTCCCTTGCCGCCCGCGGAGCCTGAACACACGCCGCACGACGCTAACCAAAGACACATTTTGCTCTACGGAGTTGGACGGCAGAGGGATGACGCCAAGCACGCCGTCAAGAAAGTGACCAAAGGCAAACGTAATTATATATTTATGTTATGTTTTACAGAAATTTGCAAATTATTCTCCAAGAAGTTCTCCATTGACGTAGCCGAGGGTGGTAAAATAAAGAAGCATTCGAGAAGCGAATTCAACTTCGAGGCTGTTACTCAGAAGTTTCAAGCGATGTCGATGTATGAACAAGGTGCTGTCTCATGGGCCGTTGGTGGCGCGGTGTGCGAGGCGCTGGCGGCGTACGCGGCCGGTGCTACCACCTACTTGCCACAACCGGAACACGTGGCCTTCGCCCTTGACCTCATGGAGATAGCGCTCAATGTTCACGGCCTCATTGAGACATGTATACAAATACTCAAGGAGCTGTCTGAAGTCGAGGCCGCTCTGATAACTCGTGGGGCTCCCAGCAGCGGGTTAGCCGCTCCCCGTGCTTACACGTCCGCGTTAGCACTATATACCGTCGGAGCCCTAAGGAGATACCATTCCTGTCTCTTGTTATGCGTGGAACAAACGTCAGCTGTGTTCGAGCAGCTGTGCCGCCTTGTCAAGTGTGTGGTCAACCCTGGCGACTGTGGTTCCGCCGAGAGATGCGTCCTCGCTCAGCTCCACGATTTGTACAAAGCCGCCGCTCATCTGTTCCACGCACCACACGCTGATACTTTTGCGAACGCGTATCCTAAAATAAAACAAGCTCTCTACTCACCTCTGACTCCAACGCCATCTAACTATCAGTACAATCCCCAATTTCTCAGCGAGTTCTTCACTAATCCCCGTAAAGGTAAAATAGAGATTGCATGGGCGCGGCAGGTGGCGGAGTCCCCGGCGAACAGATACAGCTTTGTTTGTTCCGCCATGTTGGCTGTGTGCCGTGAAGTTGATAACGATCGCGTGAACGAGCTTGGCGTTGTATGCGCGGAGATGACAGCGTGGTGTAGCAGTCTGGCGGCGGAGTGGCTGGGAGCGCTGGTGGCGCTGTGCGGGGCGCAACATTACCCTCAGCACGCCGCACCGCCACCACTCTATCCTGACCTGCTGCATCATCGGGACCTGCACGACGCTGCTGCACACGACGCACTCGCAGTCTTCACTTGCATACTAGTCGCTCGTCACTGTTTCTCGCTGGAGGACTTCGTCCGTCACGCAGCTCTACCGTCGTTGGTGAAGGCATGTGGGGGCGGCGGGCCGCTGCCACCCAACGCGCCCTCGCCAGACGCCGGCGCGAGGCTCACCTGCCACCTACTGCTGAGGCTCTTCAAAACAGTCGACACTCCGCAACCAGGACTGTACAGCGTGTCCACATCCCCCGGTCCTGTAGGTACTGGTGCTGGCGTGCGCCTGTCGTGCGACAGACACCTGCTCGCCGCCGCACACAAGAACATCGGCGTGGGACCAGTCCTAGCCACACTTAAAGCCATACTAATGGTGGGTGATTCAACAGCCCGGGATGGGATGAAACTCAGTGGAAAGAAATCCAGTGAACTGTCTCACATACTAGGGACCAGCGACACCGTACCCACGGACGCACAACTTGATCTCATGTCGATGGTAGATACAGAGATGAGTGGGGGCCATAGAACTCCTAGAGGCAGTGGTGGTGTTGGTTCCACTCTCCTGGATTCGTCTCAATCGTTGTCGTCCCTGGCGAGGAGAGTCCTCGCTGAAATATGCTCCGAGGAGTGGGTCCTGCAGAAATGTCTCCAGAATCCAGATGAACTTTACCAACCCGATATGCTACTAGACTCAATGCTGACCCCACGGCAGGCGCAAAGGTTGCTTCATATGATATGTTATCCAGATTCAGCAAGCCACACACATCCTGACCTTGATCAGAAAACTATGATAACGAGGCTTCTAGAGAACCTGGAGCAGTGGTCGCTTCGAATGTCGTGGCTGGACTTACAGCTGATGTTCAAGCAGTTCCCTAGCGGCTCGTCGGAGTTGAACGCCTGGCTAGACACCGTGTCACGGGCTGTGATCAACGTGTTCCAGCAGCCAGCACCTCCACCACCGGACAAAGACAGGATGGGAACTGAGAGAGGCGTGGTGAGCACCAGCAAGTGGTCGGAGTCGGTGTGGCTGGTGGCGCCTCTGGTGGCCAAACTACCAGCTGCGGTTCAAGGACGCGTTCTCAAACAAGCTGGACAGATCTTGGAAAGCGGCTGGGGCGCTGGATGCAGTGGCAGCTGCTCGAGCGGCGGGGGGAGCGGCGGCGGCTCGCACAGAGACTGCAAGAGTCATCAGAGCCCTAGCTACAAAGGGTAA

Protein sequence:

>DPOGS204277-PA
MGIMYEKRPLKRPRLGPPDVYPQEPRQKEDELTSANVKHGFTTTPQSSDEFGTARNFNYSASKIGQFFSGILSKKEELNTLPDCGRKRQQVNPKDNFWPATARTKPQIEAWFKDLAGNKPLSQLAKKAPNFNKKEEIFITLTEYQVAMPRAAWFIKLSSAYTVAVSEAKIKKRQLPDPTTEWTTTLIKFLKDQIPKLAEHYQSAVPGSVSDKTPPSQSGQGTPSHSCLGNPPGGSTPNSSMVPNSMHSPGLGSSTPNANENTDWRQALRQWNYCCRLARYLLDEGLLDRHDFLTWIIELLDKRAPDDGLLRLFLPLALQHISEFVCCEGLSRRLATSCAKKAAAICSVLSDTTLRALNQPAVFNKTAEKSETTGAPLSPTAEANGDVKPNVSQIKRSVTPADQNQGTPNPNHSLGTPNPNHSLGTPNPLGTPNPQQPMDVQVKEEASSPPRDTKATLTAVIAAALNPVQVGFGEILNCAYHRDVIIQLATILQIICIECPTALVWSGCGSSLQGSPLDLLPLPPSALPMPQMDSTLSQEHRRMAYESEQEIAARSKKAESKWCTDKWQTSSQARVLAVLEALDRHCFDRVDPNNNLDTLYKEVFANCTPTNKEGTLDPKDPEYACSYAVVRVLCEWAVCGARWGEHRAMAAAALLDRRQHHHHHDHQGSDDKESVGSGTGIYNGPPIFQNLLLRFLDNDAPVLDESPNAPPGNRQQFANLVHLFGELIRRDVFSHDAYMCTLISRGDLISPTEPTSTGGTHTVAPPANSTGTNHNIDEDIFAGIDLKPKMEENVRMDLDDSKIDDDLDKLLQHIKEDQQNSMDAPDSPKDPGEPTHGVNSPMMGASGIGIPGMQSIGMGPMSVPGVRGASLSRHYHYTTHFPLPPAEPEHTPHDANQRHILLYGVGRQRDDAKHAVKKVTKGKRNYIFMLCFTEICKLFSKKFSIDVAEGGKIKKHSRSEFNFEAVTQKFQAMSMYEQGAVSWAVGGAVCEALAAYAAGATTYLPQPEHVAFALDLMEIALNVHGLIETCIQILKELSEVEAALITRGAPSSGLAAPRAYTSALALYTVGALRRYHSCLLLCVEQTSAVFEQLCRLVKCVVNPGDCGSAERCVLAQLHDLYKAAAHLFHAPHADTFANAYPKIKQALYSPLTPTPSNYQYNPQFLSEFFTNPRKGKIEIAWARQVAESPANRYSFVCSAMLAVCREVDNDRVNELGVVCAEMTAWCSSLAAEWLGALVALCGAQHYPQHAAPPPLYPDLLHHRDLHDAAAHDALAVFTCILVARHCFSLEDFVRHAALPSLVKACGGGGPLPPNAPSPDAGARLTCHLLLRLFKTVDTPQPGLYSVSTSPGPVGTGAGVRLSCDRHLLAAAHKNIGVGPVLATLKAILMVGDSTARDGMKLSGKKSSELSHILGTSDTVPTDAQLDLMSMVDTEMSGGHRTPRGSGGVGSTLLDSSQSLSSLARRVLAEICSEEWVLQKCLQNPDELYQPDMLLDSMLTPRQAQRLLHMICYPDSASHTHPDLDQKTMITRLLENLEQWSLRMSWLDLQLMFKQFPSGSSELNAWLDTVSRAVINVFQQPAPPPPDKDRMGTERGVVSTSKWSESVWLVAPLVAKLPAAVQGRVLKQAGQILESGWGAGCSGSCSSGGGSGGGSHRDCKSHQSPSYKG-