Monarch geneset OGS2.0

DPOGS206783
TranscriptDPOGS206783-TA1131 bp
ProteinDPOGS206783-PA376 aa
Genomic positionDPSCF300001 - 5544809-5546092
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0134242e-12160.34% 
BombyxBGIBMGA000646-TA6e-12858.45% 
DrosophilaTaf7-PA2e-4135.45% 
EBI UniRef50UniRef50_E2BX265e-7443.44%Transcription initiation factor TFIID subunit 7 n=7 Tax=Endopterygota RepID=E2BX26_HARSA
NCBI RefSeqXP_968514.11e-7343.73%PREDICTED: similar to transcription initiation factor TFIID subunit 7 [Tribolium castaneum]
NCBI nr blastpgi|3071990212e-7343.44%Transcription initiation factor TFIID subunit 7 [Harpegnathos saltator]
NCBI nr blastxgi|3071990212e-7443.20%Transcription initiation factor TFIID subunit 7 [Harpegnathos saltator]
Group
Gene OntologyGO:00063676.5e-21transcription initiation from RNA polymerase II promoter
GO:00056696.5e-21transcription factor TFIID complex
KEGG pathwaytca:6569244e-73 
 K03132 (TFIID6, TAF7)maps-> Basal transcription factors
InterPro domain[55-136] IPR0067516.5e-21TAFII55 protein, conserved region
Orthology groupMCL34676 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206783-TA
ATGAACAGAGATAAGAGAGAACCAGAGTATCCTGCGGAGCTAGAATCTCAGTTTATTCTTCGTTTGCCGGAGGATCCAGCAAAAGAATTGCGGGAGCTACTTAAAACTGGAGATAACATAAAAAGTCGTCTAACAATTCAAATAGATAACGATATGAGAAATGGTGAGATGATGATATGTAAAGAGGAAGCCGATCAAACACCTGCAGAAGAAGAATCACCTTCAAAAAATAAAAAAAAGGATCCCTATAAAGTTGATAAGAAATTTCTTTGGCCGCATGGTGTGACACCGCCGACCAAAAATGTAAGAAAGCGTCGCTTTCGCAAAACTTTAAAGAAAAAATATGTAGAAGCACCTGAAATTGAAAAAGAAGTAAAAAGATTGTTAAGAGCTGATAATGAAGCCGTCAGTGTCACTTGGGAAGTTGTGAAAGAAGAGGATGATCAACTTAAGCCAGAACCTGTTACCCCAACACCAACACCAAAAGCAGAGAAAAAGACCAAAGCAGCTGAGAGAGCAGCAAAGAAAGCTGCTGCTGCTGCTGCTGCTGCTGCTTCTGCTGCAGCTGCAGCCGCCAGTGCCAATAACTCTGAGTCTTCTAATGTTGTTGATATATTTGGTAGTGCTGTAAGTGACAGTGATGCAGAAGATGACAACATTAATGTAGAGTTGGAGGATAGTCACTTCTCAACTTATGACAGCCGCCTATCAGACAACAGTTCAGTTTTAGGCTTGGGAGATATGCAAACAAAAAAAGAAAGCTATCCAATAGAATTTGATTCGCAAATGTTTCAAAGTGGTCAAAGCCATAAAAGATCTAATAGAAGCCGAGCCACAACTCCAGCAACTTCAACAGTTACCAAGAGCGGAGGGTTGTCGTCTGAAGAAGATGGAGAATATTCAAGAGATATGTCTAAAGATAATATGTCAGTGAGAATTGAGGAACTCCGAACTGAACTAGAAGAGTTGAAACAGCGCAAACAGAGGATGCAGTATGAAATTGCTGGAATGGAAAATCTAGCTTTACGACAAAGATTTCAAGAGATTCTACATACATTAAACCAGGATGTTATGTATAAGGAAATGGAATATCAAGGCCTCATCACATTACAAAATTCTGAAGATATATGA

Protein sequence:

>DPOGS206783-PA
MNRDKREPEYPAELESQFILRLPEDPAKELRELLKTGDNIKSRLTIQIDNDMRNGEMMICKEEADQTPAEEESPSKNKKKDPYKVDKKFLWPHGVTPPTKNVRKRRFRKTLKKKYVEAPEIEKEVKRLLRADNEAVSVTWEVVKEEDDQLKPEPVTPTPTPKAEKKTKAAERAAKKAAAAAAAAASAAAAAASANNSESSNVVDIFGSAVSDSDAEDDNINVELEDSHFSTYDSRLSDNSSVLGLGDMQTKKESYPIEFDSQMFQSGQSHKRSNRSRATTPATSTVTKSGGLSSEEDGEYSRDMSKDNMSVRIEELRTELEELKQRKQRMQYEIAGMENLALRQRFQEILHTLNQDVMYKEMEYQGLITLQNSEDI-