Monarch geneset OGS2.0

DPOGS214478
TranscriptDPOGS214478-TA1989 bp
ProteinDPOGS214478-PA662 aa
Genomic positionDPSCF300122 - 337637-343491
RNAseq coverage373x (Rank: top 32%)
Annotation
Heliconius% 
BombyxBGIBMGA013412-TA2e-6462.45% 
DrosophilaMED26-PA2e-1654.67% 
EBI UniRef50UniRef50_E2BZR16e-2564.37%Mediator of RNA polymerase II transcription subunit 26 n=10 Tax=Formicidae RepID=E2BZR1_HARSA
NCBI RefSeqXP_001120925.15e-2359.77%PREDICTED: similar to Mediator complex subunit 26 CG1793-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3071976442e-2464.37%Mediator of RNA polymerase II transcription subunit 26 [Harpegnathos saltator]
NCBI nr blastxgi|3071976443e-2452.27%Mediator of RNA polymerase II transcription subunit 26 [Harpegnathos saltator]
Group
Gene OntologyGO:00056345.1e-18nucleus
GO:00036775.1e-18DNA binding
GO:00063515.1e-18transcription, DNA-dependent
KEGG pathway 
InterPro domain[11-98] IPR0179235.1e-18Transcription factor IIS, N-terminal
[9-83] IPR0036171.4e-12Transcription elongation factor, TFIIS/CRSP70, N-terminal, sub-type
Orthology groupMCL31010 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214478-TA
ATGAGTAATAACGCTCATGAGCTCAGTAATCGACTTCTGAATGCGTTGGATAGCAATTACAATGTCGTCGACATGCAGACTGTTAATGAAATTATTTCCATTCTGGAGAAATTTAATATCACAAAAGAGCTGTTGGAGACCACCAGACTCGGTAAGCATGTGAACGAGCTCCGTCGTAAGACAAGCGAGCCGACACTGGCTCGCCGCGCCAAGGTGCTGGTAAAGCGCTGGCGGGATCTCGTCATACCTAGCACACACTCACCCGCACATACTGGCTCAAATCGTTCGTCTGCTGAGCGTCAAGTTAGGAGGCTCGGTGGAACTCCACTCACTTCACCGGCACTCTCCAGGAACCTGGTGAGTCCAGCTGTTCCAAGTCCAAGACCACCTTCTAGACCAGCGTGGGGTGGCTACGAGTCGGACTCGCAGGACGTGATCCTGGTGGACGACGACCCCCCTCCCCCTGTGACCCCCTCCCTAGCCCCTCTCCCACCACACAAACCGGCAACACCTCCTGTAAAAAGGGCGCTGTCACCCGAACCTCTATACGATGAAAAGAAGGCGAAAAGAGATAAAAAACCTAAGAAAAGAAGGGGTCATGGTAGAGCGGGCTCGGGGACTGAGACGACAGCGGAGCCCCAGAGTCAACACAACTGGGTGTCCAGGAACGGGACAGGACCTGAACGGAGACGGAATGGATGGAAACGAGACCCACCAGACCCTTACGCTGCGCTCGTTAATAGACTGCCTCCCGCCGGGGCTAAGAAGGTGAAGACGACCAAGGAACTGCTGGAACAGATCCAGTCCCGCGGCAGCAGTCGACCTTCCCGCCCGGCCTCCCCGCGCTCCCCGCACTCCCCGGCCTCCCCTGCCTCCCCCGACGTCATGCTCATCGAGCCTGACGTGTGCCCTATCAAAGTCGAGTCCCCGCTTCGGAACGGCGGCGGAGATTCCGCGAAGACGAAAATGGATCCCGAGCCAGAGGAGCGTGAGGAGGTCCCGCCGCTAGAGGAGCCGCGCGCCTGGGCGGAGTGCACGTGCGGCGAGGAGGAGGCGTCAGCGGACTGTCCTGCCGCTGGGCGCCCGGCGCTCCGGCCGCTCCACGTCCGGGCGCTGCACAACGTGCTGCTTCCGGGGATGAACGGCACGCGAGCCCCGCTCTTCCCTCACCGGTTCGCCGTCCGCCCTCCCGCGCGCGACGACGACCCCACGCTGTTCTCCAGCGTGGTGCCCCTCTATAACTACTCAGACTACGCGGACGACCACTGTGTCAAAAACATGTCCCGAGTGCCGATCTGTGAGCGCCTGCCGTGGACGGAATTCGCGCCCTCGCCGCCGCCTCCGTCTCCGCCGCCTTCGCCGCCGCCACTGCGACCCTACCCTTCGCCGCTCCGCGAAGAAGTCTTCCCCGACTTGCCGGAACCCGACGATGATTCCGCGGACTCCTCCACCACGCCCAACGTGATCGAAGCTACGAGCGAGGACGAGCCCAGAGAGAAGCCGGTCATCGAGCCAGCTATGGTCGGCCTCGCCTACATGGAGCCCGGGGAAAGGTTGAAGGCGCCGCTGCTGGACGACGACGACGACTACCAGCTCGGGGAACCGCGCCGGGAGGTGACGGTCGAGAAGAGCTTCGTCAGTGAACCCCAGGAACATCGCACGGTCGAGAGGACGAAAGAGGACATAGACGGCCGGACGTTGTACGGCCTGTGCGAGGCGGCCCTGGCCAGCGTGCCCTACCGGTACTCGCAGGCGGCGGCGGCGCCGGTGTTGACGCTGCCGCAGCTCATCGAGAGGGAGAACGAACCTCCCGATCACGTGGCGGAGGACGAAGCCGCCAAGCTGCTGGAGGGCGCGCTGGAGACCGCTGCCGCCGACCCGGCCCTCGACGAGCTGGGCCGACCTCAGCGACCTCTGTCGTTCGCGGAATGGCACGAGTGCGCCCGCCTCGGGGACCTGGTGGCCCTGCCGTACGTCGTCATCGACTGA

Protein sequence:

>DPOGS214478-PA
MSNNAHELSNRLLNALDSNYNVVDMQTVNEIISILEKFNITKELLETTRLGKHVNELRRKTSEPTLARRAKVLVKRWRDLVIPSTHSPAHTGSNRSSAERQVRRLGGTPLTSPALSRNLVSPAVPSPRPPSRPAWGGYESDSQDVILVDDDPPPPVTPSLAPLPPHKPATPPVKRALSPEPLYDEKKAKRDKKPKKRRGHGRAGSGTETTAEPQSQHNWVSRNGTGPERRRNGWKRDPPDPYAALVNRLPPAGAKKVKTTKELLEQIQSRGSSRPSRPASPRSPHSPASPASPDVMLIEPDVCPIKVESPLRNGGGDSAKTKMDPEPEEREEVPPLEEPRAWAECTCGEEEASADCPAAGRPALRPLHVRALHNVLLPGMNGTRAPLFPHRFAVRPPARDDDPTLFSSVVPLYNYSDYADDHCVKNMSRVPICERLPWTEFAPSPPPPSPPPSPPPLRPYPSPLREEVFPDLPEPDDDSADSSTTPNVIEATSEDEPREKPVIEPAMVGLAYMEPGERLKAPLLDDDDDYQLGEPRREVTVEKSFVSEPQEHRTVERTKEDIDGRTLYGLCEAALASVPYRYSQAAAAPVLTLPQLIERENEPPDHVAEDEAAKLLEGALETAAADPALDELGRPQRPLSFAEWHECARLGDLVALPYVVID-