Monarch geneset OGS2.0

DPOGS208479
TranscriptDPOGS208479-TA1137 bp
ProteinDPOGS208479-PA378 aa
Genomic positionDPSCF300064 - 1155557-1156693
RNAseq coverage143x (Rank: top 54%)
Annotation
HeliconiusHMEL0021882e-13464.25% 
BombyxBGIBMGA000646-TA1e-11556.51% 
DrosophilaTaf7-PA5e-6340.09% 
EBI UniRef50UniRef50_E2BX262e-8648.40%Transcription initiation factor TFIID subunit 7 n=7 Tax=Endopterygota RepID=E2BX26_HARSA
NCBI RefSeqXP_624942.19e-8948.94%PREDICTED: similar to TBP-associated factor 7 CG2670-PA isoform 2 [Apis mellifera]
NCBI nr blastpgi|3320170874e-8847.89%Transcription initiation factor TFIID subunit 7 [Acromyrmex echinatior]
NCBI nr blastxgi|3320170876e-8947.09%Transcription initiation factor TFIID subunit 7 [Acromyrmex echinatior]
Group
Gene OntologyGO:00063671.3e-50transcription initiation from RNA polymerase II promoter
GO:00056691.3e-50transcription factor TFIID complex
KEGG pathwayame:5525633e-88 
 K03132 (TFIID6, TAF7)maps-> Basal transcription factors
InterPro domain[14-173] IPR0067511.3e-50TAFII55 protein, conserved region
Orthology groupMCL14279 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208479-TA
ATGAATCGTGAAAAACGAGATCCCGATTACCCTGTAGAATTAGAAACTCAATTCATTATGAGAATGCCAGAAACGCCCGGAAAGGCTTTGAGTGAATTAATTAAATCAGGAGAAAATTTTAAGAATAGACTTACGATTCAAATAGAAAACGATATGCGACATGGAGAAGTAAGATTTGACCAATGGGTGCTACATGCTAAAATTGTCGATTTACCTACCATAGTAGAGTCCTGGAAGACCATTGACAGGAAAAGTCTATATAAAACTGCTGACCTCTGCCAATTGATGATATGTAAAGAAGAAGCAGATTCTTGCACTGAAGAAGAATCACCAACCAAAAATAAAAAGAAAGATCCCTTGAAAGTAGACAAGAAATTTCTTTGGGCTCATGGAATTACTCCACCAACTAAAAATGTTCGAAAAAGGCGCTTCAGGAAAACACTAAGAAAGAAATGTACAGAAGGACCTGAGATAGAAAAAGAGGTTAAAAGACTATTAAGAGCTGATAATGAAGCTGTTAGTTTTACTTGGGAAGTAATAAAAGAGGAAGATGAAACTCCTAAAGGCTCTAAAAACGAGGCTACATTGCCTAAAGTGGAGAAAGGCAAGAGCAAGAAAGATACTACACACACCACTCCCAAAACTAATCAGCCATCTAAAGTTGAAGATATTTTTGGTGATGCTTTAAGTGACAGTGATGTTGAAGAAGAAAATATCAGTGTTGATGTAGAAGATAGCAGGTTGTCATTCTATGAAGAACCCTTGTCCGAAAACAATTCTATAAATGCCGGAGACATTTCTAAGGGATCTAGTTTTGCTACACAATTTAAATCTGAAATGTTTGAATCTCCACCAAAGATGTCATCGGCTAACAGGAATCAGTCAACTAAGTATGATAGCAAGCAAACTGGAGAGCAATCTTCAAGCAGTTATCCCAACACTTCTAGTTTCAAAATGCAAGAGCTTTTTACTGAACTAGAAGAACTCAAACAGAGAAGGCAAAGGACACAACTAGAAATAGCTGGTATGGAGAATATGACATTAAGGCAACGGTTCCAAGATATCCTGAAAACCCTTAACAAGGAGATAATTACTAAAGAAGTTGAATACAACAGATTAAAATCTCATTTAAAATAA

Protein sequence:

>DPOGS208479-PA
MNREKRDPDYPVELETQFIMRMPETPGKALSELIKSGENFKNRLTIQIENDMRHGEVRFDQWVLHAKIVDLPTIVESWKTIDRKSLYKTADLCQLMICKEEADSCTEEESPTKNKKKDPLKVDKKFLWAHGITPPTKNVRKRRFRKTLRKKCTEGPEIEKEVKRLLRADNEAVSFTWEVIKEEDETPKGSKNEATLPKVEKGKSKKDTTHTTPKTNQPSKVEDIFGDALSDSDVEEENISVDVEDSRLSFYEEPLSENNSINAGDISKGSSFATQFKSEMFESPPKMSSANRNQSTKYDSKQTGEQSSSSYPNTSSFKMQELFTELEELKQRRQRTQLEIAGMENMTLRQRFQDILKTLNKEIITKEVEYNRLKSHLK-