Monarch geneset OGS2.0

DPOGS206472
TranscriptDPOGS206472-TA4728 bp
ProteinDPOGS206472-PA1575 aa
Genomic positionDPSCF300070 + 246342-253861
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0119370.087.17% 
BombyxBGIBMGA005423-TA0.077.32% 
DrosophilaMED1-PA7e-11442.75% 
EBI UniRef50UniRef50_D6WVF30.042.94%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVF3_TRICA
NCBI RefSeqXP_970198.20.042.93%PREDICTED: similar to peroxisome proliferator-activated receptor binding protein [Tribolium castaneum]
NCBI nr blastpgi|2700121100.042.94%hypothetical protein TcasGA2_TC006213 [Tribolium castaneum]
NCBI nr blastxgi|2700121100.042.98%hypothetical protein TcasGA2_TC006213 [Tribolium castaneum]
Group
Gene OntologyGO:00063572.6e-63regulation of transcription from RNA polymerase II promoter
GO:00165922.6e-63mediator complex
GO:00011042.6e-63RNA polymerase II transcription cofactor activity
KEGG pathway 
InterPro domain[34-439] IPR0196802.6e-63Mediator complex, subunit Med1, metazoa/fungi
Orthology groupMCL17352 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206472-TA
ATGGCGACTGTAAAAGGACAATTAGATAAATCTAAAGAACTCCAAAAAGAGATACTTATGGAAAAACTTCGATCGAAAACGTCACAGTATAAAAATTTAAGTGAAACTTCCAAAGCTGTTCGTATGGCTTTGTTGGATAAGAGATGGGCAGTGGATAATGCGGATCGTATGATTCTACAGAAGTGCTTGGATAGCTTACAACATTGTATTAAAGTCAGTTCCCTGCAAAGCCTTATCGAAAGACTGGAGTGTCTGTCAAGACAGCTCGGACTGAAATTTGTTGTGGGAACATCAGGTGTTAACTTATTTATATCTTCAGATATGTTTTATTTGGAAATATTAGTTGAATCATCGGGATCTGTGAAAGATGTGAAAATACACCATGAGGGCAAGATTGAGCAGCAAAGTTGCGAAGAGCTGGTGCAGTGTATATCACGTGGAGACTTCGCCGACTTCACAGCACAACTGGAAGGACTGACGGCAGTTTATCAATTGAATGCTGAAAAGAAAGTTAAATGTAAAGCGTTCAGTGCTCTACAAAGTTTAGAGGAAGATTTATGTACGTTGAGACAGTTGCAAAGTTTCATTAAGGATCCATGGGCACAGGTCCATAAAAGCCCTATAGGGTTTTTGCAGAGACGGCGAGGAGGTCATGCGTTGCGTCTCACCTATTTTGTGTCGCCGTATGAGCTTCTGGATAAAGAGAAAGGTTTGCAGCCCTTGACAGTTGATTTACTAACTTCTGGTAAACCATCTTCATCAAGCGGATTGGCTTCACTTGTGTCACCATCTCATACACCTATAGGGCATTCAGCGACGGTACTCTTAGAAGGTTCTGCAGCCAATAAGCTCCAACTATCACCGATCATTGCTGGTGGACAGCGACCAGGGAAAGGGAATGGTCCAGTGTATGCACAGCTTGTTCCACAGAATAGTGCTATGTTACCAGCATGTTTCACTCTTAAATTATCTCAACCCACTCCTATGTGTGCTGGTCTTGCCAAATTGATACACGCAACTACAGATGTTGAAGTCGCTGGTGATTGGGCTAACGCTAAGCCTATGTTGGGTCTTGTGGCCCAACAAGCTTATAATAAACTTGCTCCTGGTAAAACAGTTGAACTAAATCTCAGCAAAGGACTCTTTGTTAGTCTGCCGGAGCAGTGGCAGTGTTACTTCCTATGTGAAACGCGAGGCTTGGACGGCTGTATGGTGACCAGCGTTCCCTTCACTCATCCGAGTCACGTGCCACCCCTACTGTCTGTGCTGAGGCAACAAGCGCTGTTTAACGCGTTACTCGCCAGCTGCGTCAGGCTGCAGACTAAACAAGTGATGGAGTTGGAAAGCGTCCTGATGTTCGAAGTGAGCGCCCTCTCTTGGCAGCATATCTCAATATCTCTAGAACACCCTGTAGATGAGAGTATGGCTACAGTGGAACTCGATCTAGCCGATCCAGCGGCTCCCCGCGCCTCGCTTTACACCCTCAACTCACCTCCCAGAGATCATATAGATGACTATATAACGAAAGTACTACAGAAGAGCCACTCCATACCAGTGACACTTAGGTCACTTATAAAAATATGGGAGCGTGAACGAGCTACGAAGCAGATAAATGGTAGCTACTCCGCTTTGGAGTCAAATTTTAGCTGTGGCCTGGGGGGCATCGACCCCGGGGGTGACGTTAAACACGAGCCCGGCAGTATGCATGGCCGGGCTATGTACCCTGCCAATGTTAGCCACCCACATCAAGGAGGTACATATCTTTCGGAACAGCAACACCACTTGTCTATGATGTGTCAAGACAATCCCAGCGATCTAAAACCCGCTGGTTCAGAAGAGCGAAAATCAAAAAAGCGTAAAGTGGAGCCAGGTTTATGGCCTTCTGGCCAAAAAAAGGGTAAACCTGGTATGAATGATATGATGGACTCTGATAGCGAGAGCGACAATTCAGATGGTTACGGCAATGAAGATGACACTATGAACAGCAATTCAACAAATGATGACATAGAGGGTCCATCGTCTGTGTCCCAGAATGAATTCGCTTCAGACTTGGAATTGTACGGTATGGACACTCCAGACGCACTGGGCGGACAAGAAAACAGAACATCGTCTGATATGGACAACTCCAACGACGGTGATGTCGAAGATATAATCAGGAATTCTTTTCACAAGTCTGAGCACAAAAGGCAAAAACTTAAAAGTAAAGATTCGGAGTCGAGAAGAAGTACATCGTCGTTACTGTTGGACCTGACTGAAGGAAAGTCTTACATGCCTTCCTCTGTAAGCATAACTCCTATAGGATCAACTTCATCGAACCCCCCCAGTTCGGGATCTGGGTCCAATATATCATCAATGTTATGTCTGGATAGAAGACCCGGTATAGAAATTATACCGATAACAGCAGCTCCACCGGCTGCTCTGCCCAGTTCTATAACTATCACGCCAATAACGTCTTCGCAGATAAAGTCACTTGATGATAGAATGAGATCCGACAAAAAATCAAGCAGATCAGGCGAGGAACGTAGTAAAGAAAAGAAAAAGAAGAGGCGTCGAGACGATTCTATGGGCCCGCCGGAAAAGATCCCACCCAAACAAGATCCTTTAACCAAACCTGTATCCGTAAGTATTAAACCCACTGACGGTTCTCCAATGCGTCCGACTTCCCCAAACTCCATCTTGAGAAAATTTAGTCCAAGTCCCACACAAGGCAGGTCTGTATCTATCTCCAAATCACCAAGTCCCAGCAGTGTTAAGGGCATGGGAAAACCATCAGGTACGTCCAGCCACCACAGTAGTCCTAGACACTCACCCGTTCAAAACAGTCCAAAACATTTAACAGGGTATTCGAGTCCAAAAAACCACAGTATTTCGTCGCCGAAACATAGTTCGTCCGGCTCGGGCAAACCTAGTATGTCAACTTTAAAATCTGCAACTAGTGGCTCCCCTTCCGGGAAATCCAGTGCGACGGGTTACGATCTGTCAAAAAAGTTATCTAAAGAAAGTTACGGTTCCAGTTCAGCTACCTCTCGCGACAAAGAAAAGAAACATTCCAGTTTATCTTTTTCCAGTTCCGATAGAAGCAGTCCCAAATTAAAAAACCCAATGAAATTGAAACAGTTAGAAATAACCCCAGTCTCTTGTGATAGTCCCGTAACCGAGCCTCTGATATCCCCATCTAACATAGAAGTAAACAAATCTAACGCTCCAAGCCAAGCTCGAAACAGGAAAGGTTCTCTCAGTGCGGTAATAGACAAGTTAAAATCTGCTCAACACTGTGGTGCTGAATCAGAAATTACTATAAAATCAAGTTCAAGTAGTAGTAGTAAATTTTCTGACCCAAAAAGCAGTTCTAGTAACGTTAAAATAAATGAGAGTAAAAATCAAGAATATATGGTGAAACCGAGTCTTGATGGCATGAAGATAACAATAAACAAAACTAGATCGAAAGAATCGTCGAGCAGTAGTTCGAAGTTAAATTATTCGAGCTCAAGTTCTAGCAAACAGTCATCACCCCAACTCACTCCGCCCAGCCAAGGTTCGCCTAAAACTCACACCGGTCTGAAACCCGGTGTAATAAGTGGACCCGCTTCAAAAAAAACTCAAGTTATGCAGTCATCCAAATCTGCTAGCATAGCATCAGGCTCCACACCTACCAAAGCCAATTTGAGTCCATCCGAGCAGTCCGGCTCTAATAGCAACAATACTCAAAAAGTGCCTTATACCAAATCAAGTAGCAGTTCCACTGGCATCAACTTGTCTTCGAAATCAGTATCAAAAAGTTCCGGTTCCCCAAAAAGCACAAGTTCAGATCTTGCTAAAATTATGAAAGAACGAGAAAAAGCTAGAACAAAATTTCTTAGTAATTCCGAAAAATCTATATTTACTTCAAAATCTGAGCGTCACTCGAGTCCAAGTAGTTCTAGAGATGATGTAGACGGTGACAGATTTAAAAATTCTAAAGACGCCAATTTCCTAGTCGAAGGTTTAATGAAGCCGCTTGATACTAGTAAATTTCAAATACCCAAATTGAAAAATCAAAATACTCCCGATAAAAACAGTCCCGTATCGCAGTACGACAGCCGTTCCATGGTTAACGATTTTTCCCGATCTGTAGATCAAAGTAAATATCCATTTCCCCACTACGATAACTCGAACAGGAATATAGATACCATGCAAAAATCAGTATATCCCCTCAATGTGCCGAAACCTTTATTAAACATGGGTGGTATAAGCCCGAAAACTATAGTGGCCGGTAAAAGTGACCAATCAGAGTATTCCAAAGCGATGTCAGATAGCATAGCGAAAGGAGAAAGCCCACAACGCATTAAAGATGAAGTGAAAGAGGGTTATTCGGGTGCATACCCATTGATGCCTTTAGATTTCAAAACGGACATGGCCAAGGGGTTTCCGGCGCCGAAGGGAAGTTCGGAGGATAGGAAGGGGGCGGATTCTTGTATGAAACCACCAGCGAGTGGGACGGTGGCTCGTAGTGGCGCTACGACACCGCAGGAAGCTGCAGAAATGTTACTAGACTTCTCATCATCATCTGGTAGTAAAGTATCGGGTGTGGTCTCAGTCCGTCCAGTGTACCCGGCCTCGCCAGCCTTATTGCAATTAGCGAAGAGTCCCGCCTCGTCACCCCTGGTAGCTCCTTCCCCCCATTCCAACTCTCCTTGCATCACCGACGACGAACTCATGGACGAAGCCCTGGTGGGCCCCGGGAAGTAA

Protein sequence:

>DPOGS206472-PA
MATVKGQLDKSKELQKEILMEKLRSKTSQYKNLSETSKAVRMALLDKRWAVDNADRMILQKCLDSLQHCIKVSSLQSLIERLECLSRQLGLKFVVGTSGVNLFISSDMFYLEILVESSGSVKDVKIHHEGKIEQQSCEELVQCISRGDFADFTAQLEGLTAVYQLNAEKKVKCKAFSALQSLEEDLCTLRQLQSFIKDPWAQVHKSPIGFLQRRRGGHALRLTYFVSPYELLDKEKGLQPLTVDLLTSGKPSSSSGLASLVSPSHTPIGHSATVLLEGSAANKLQLSPIIAGGQRPGKGNGPVYAQLVPQNSAMLPACFTLKLSQPTPMCAGLAKLIHATTDVEVAGDWANAKPMLGLVAQQAYNKLAPGKTVELNLSKGLFVSLPEQWQCYFLCETRGLDGCMVTSVPFTHPSHVPPLLSVLRQQALFNALLASCVRLQTKQVMELESVLMFEVSALSWQHISISLEHPVDESMATVELDLADPAAPRASLYTLNSPPRDHIDDYITKVLQKSHSIPVTLRSLIKIWERERATKQINGSYSALESNFSCGLGGIDPGGDVKHEPGSMHGRAMYPANVSHPHQGGTYLSEQQHHLSMMCQDNPSDLKPAGSEERKSKKRKVEPGLWPSGQKKGKPGMNDMMDSDSESDNSDGYGNEDDTMNSNSTNDDIEGPSSVSQNEFASDLELYGMDTPDALGGQENRTSSDMDNSNDGDVEDIIRNSFHKSEHKRQKLKSKDSESRRSTSSLLLDLTEGKSYMPSSVSITPIGSTSSNPPSSGSGSNISSMLCLDRRPGIEIIPITAAPPAALPSSITITPITSSQIKSLDDRMRSDKKSSRSGEERSKEKKKKRRRDDSMGPPEKIPPKQDPLTKPVSVSIKPTDGSPMRPTSPNSILRKFSPSPTQGRSVSISKSPSPSSVKGMGKPSGTSSHHSSPRHSPVQNSPKHLTGYSSPKNHSISSPKHSSSGSGKPSMSTLKSATSGSPSGKSSATGYDLSKKLSKESYGSSSATSRDKEKKHSSLSFSSSDRSSPKLKNPMKLKQLEITPVSCDSPVTEPLISPSNIEVNKSNAPSQARNRKGSLSAVIDKLKSAQHCGAESEITIKSSSSSSSKFSDPKSSSSNVKINESKNQEYMVKPSLDGMKITINKTRSKESSSSSSKLNYSSSSSSKQSSPQLTPPSQGSPKTHTGLKPGVISGPASKKTQVMQSSKSASIASGSTPTKANLSPSEQSGSNSNNTQKVPYTKSSSSSTGINLSSKSVSKSSGSPKSTSSDLAKIMKEREKARTKFLSNSEKSIFTSKSERHSSPSSSRDDVDGDRFKNSKDANFLVEGLMKPLDTSKFQIPKLKNQNTPDKNSPVSQYDSRSMVNDFSRSVDQSKYPFPHYDNSNRNIDTMQKSVYPLNVPKPLLNMGGISPKTIVAGKSDQSEYSKAMSDSIAKGESPQRIKDEVKEGYSGAYPLMPLDFKTDMAKGFPAPKGSSEDRKGADSCMKPPASGTVARSGATTPQEAAEMLLDFSSSSGSKVSGVVSVRPVYPASPALLQLAKSPASSPLVAPSPHSNSPCITDDELMDEALVGPGK-