Monarch geneset OGS2.0

DPOGS204229
TranscriptDPOGS204229-TA1800 bp
ProteinDPOGS204229-PA599 aa
Genomic positionDPSCF300046 - 640847-645995
RNAseq coverage248x (Rank: top 42%)
Annotation
HeliconiusHMEL0151570.074.20% 
BombyxBGIBMGA007510-TA0.091.25% 
DrosophilaMED14-PA4e-15749.75% 
EBI UniRef50UniRef50_E0VE601e-17853.95%CRSP complex subunit, putative n=3 Tax=Pediculus humanus corporis RepID=E0VE60_PEDHC
NCBI RefSeqXP_002424404.12e-17953.95%CRSP complex subunit, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838581347e-17955.36%PREDICTED: mediator of RNA polymerase II transcription subunit 14-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838581341e-17255.45%PREDICTED: mediator of RNA polymerase II transcription subunit 14-like isoform 1 [Megachile rotundata]
Group
Gene OntologyGO:00063576.3e-54regulation of transcription from RNA polymerase II promoter
GO:00165926.3e-54mediator complex
GO:00011046.3e-54RNA polymerase II transcription cofactor activity
KEGG pathway 
InterPro domain[23-213] IPR0139476.3e-54Mediator complex, subunit Med14
Orthology groupMCL11072 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204229-TA
ATGGTGCCAGTTGCGATTGAAGGATGCAATCAGGGGACCGAAGGTGGAGGTCCCCGTGGTGGGTCTATTTCTTTGGGACTACTAATTGATTTTATAGTTCAAAGAACTTATGATGAACTTACGGTTTTGGCTGAACTGTTACCTCGCAAAACAGATATGGAAAGGAAGATCGAAATCTACAAATTCAGTGCTAGAACAAGGCAACTGTTTGTTCGTTTATTGGCTTTAGTGAAGTGGGCAAGCAGCGCCACAAAAGTAGATAGATCAGCTCATATTATGGCATTCTTGGATAAACAAGCTCTCCTGTTTGTTGAAACAGCAGATGTGTTAGCGCGAGTTGCAAGAGAAACTTTGGTTCATGCAAGATTGCCAACATTCCACATGGCTGCAGCAGTCGAAGTGCTCACTCTTGGCACATACAGTCGTTTGCCAGCCGTAATCCGTGAACGCTTGGTACCACCGCCGCCGCTCACACCTGCTGAAAGACGTTCAACATTAAGGGCACTCGCACATGTTGTCCGACAAAGACTGACAACAGCTTCACTGCCCAGTGATGTTAGGAACTTGAAGGTTGAAAATGGAAGGGCTACATTCACTGTAGGTCAGGAGTTCAGCGTTTCCTTGACGGTTATGGGTGACGCACCAAATGTGCCTTGGCGTTTATTAGATATTGCTATATTAATACAAGATAACGAAACGGGAGAGGGAAAGCCATTAGTACATACATCCCAATTGAACTGGCTCCGTGGAGTCGCCCAGGCGAGGCTAGCGGCTGCGGGTCTCAGCGGCGCTTTAACAGCTCTACGATTCTTCTGCCGGTCGCTCTCTCTGGAACTGTTATATACACAGACATTAAGATTATGCCGCGATCGATTGGCCCGACATCTCCAAGTGGATAGATATATACCTGGACAGAAGTTACAGGTTTCATATTGGAGGGAGTTGGGTTGCGAGCTCGGTTACCGTCTTATCGTTGGTGCGGAGGGAGAGTCACTGTGCGTGTGGCACGTACCAGCGCTGGCGGGCGGCGAGCGTGTGGCGGCCGCGCTCACACCGCACGCGCCGTCCATGGAGCGTCTCCTCGCTCACACCGTACACGTGAGATCGAGACAGAGACTTAATGATCTCAAAGTTCTCCTCAATGATCTTGGGGTGGAATGTTCTGTGGGCGGATGGCCGTGTTCCCTGGCGTGTTCAGTGGTGGCGCCGTGTCTTCGTGCGGAGCAGCTCCTAGTGTCTGTGGGTGCTCACGGGGGTCGCCTGCGAGCGCGTGTCCCCGCATACCCCGCCACCCCTCGCATGCCAGAACTGGCGGCCGCGCTGGCCGCCACGGACCGAACTCTTGTTAGACAACTGCTAACACAACTGAGGTTCTGGTTGGTGGCACGGCGATGTGAGAAGTCCCTGCAACACCTGCCAGCCAGTGTGTGTGAGCATCTGCCGTTCCTGCACGGACCCGACCACCCTCTCAACAAACTGTCGCCGGACAGACTGTACGTCACCTTACACAGACACACGGATCACATACTAGTGGTAGAGATGAAGGAGGTGGCAGCGGGCGGCACGGCCACCACCAGCTGTAGTAGCAACACCGGCAGCGCCTGCAGTGTAGCGCTGGCCTTCCACCTGGCCGGGGTGAGGCGCTGTACTCCTGACGAGTGCGAGGATGAGTCCTCTGGCACGACGTCCAACACGCCGGCTACAGCTCGTGCTTACCTCAAGTTGCATTCGCTGGTCGAGTTGGACACGTTCACATTGACCCACGGGCCCTTCACACCTCTCGACACCCCAGGCAAGTAG

Protein sequence:

>DPOGS204229-PA
MVPVAIEGCNQGTEGGGPRGGSISLGLLIDFIVQRTYDELTVLAELLPRKTDMERKIEIYKFSARTRQLFVRLLALVKWASSATKVDRSAHIMAFLDKQALLFVETADVLARVARETLVHARLPTFHMAAAVEVLTLGTYSRLPAVIRERLVPPPPLTPAERRSTLRALAHVVRQRLTTASLPSDVRNLKVENGRATFTVGQEFSVSLTVMGDAPNVPWRLLDIAILIQDNETGEGKPLVHTSQLNWLRGVAQARLAAAGLSGALTALRFFCRSLSLELLYTQTLRLCRDRLARHLQVDRYIPGQKLQVSYWRELGCELGYRLIVGAEGESLCVWHVPALAGGERVAAALTPHAPSMERLLAHTVHVRSRQRLNDLKVLLNDLGVECSVGGWPCSLACSVVAPCLRAEQLLVSVGAHGGRLRARVPAYPATPRMPELAAALAATDRTLVRQLLTQLRFWLVARRCEKSLQHLPASVCEHLPFLHGPDHPLNKLSPDRLYVTLHRHTDHILVVEMKEVAAGGTATTSCSSNTGSACSVALAFHLAGVRRCTPDECEDESSGTTSNTPATARAYLKLHSLVELDTFTLTHGPFTPLDTPGK-