New model in OGS2.0 | DPOGS208479  |
---|---|
Genomic Position | scaffold672:- 3676-4812 |
See gene structure | |
CDS Length | 1137 |
Paired RNAseq reads   | 177 |
Single RNAseq reads   | 520 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000646 (2e-91) |
Best Drosophila hit   | TBP-associated factor 7, isoform B (5e-47) |
Best Human hit | transcription initiation factor TFIID subunit 7 (7e-38) |
Best NR hit (blastp)   | PREDICTED: similar to TBP-associated factor 7 CG2670-PA isoform 2 [Apis mellifera] (1e-87) |
Best NR hit (blastx)   | PREDICTED: similar to transcription initiation factor TFIID subunit 7 [Tribolium castaneum] (3e-72) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0006367 transcription initiation from RNA polymerase II promoter GO:0005669 transcription factor TFIID complex GO:0016251 general RNA polymerase II transcription factor activity |
InterPro families   | IPR006751 TAFII55 protein, conserved region |
Orthology group | MCL12668 |
Nucleotide sequence:
ATGAATCGTGAAAAACGAGATCCCGATTACCCTGTAGAATTAGAAACTCAATTCATTATG
AGAATGCCAGAAACGCCCGGAAAGGCTTTGAGTGAATTAATTAAATCAGGAGAAAATTTT
AAGAATAGACTTACGATTCAAATAGAAAACGATATGCGACATGGAGAAGTAAGATTTGAC
CAATGGGTGCTACATGCTAAAATTGTCGATTTACCTACCATAGTAGAGTCCTGGAAGACC
ATTGACAGGAAAAGTCTATATAAAACTGCTGACCTCTGCCAATTGATGATATGTAAAGAA
GAAGCAGATTCTTGCACTGAAGAAGAATCACCAACCAAAAATAAAAAGAAAGATCCCTTG
AAAGTAGACAAGAAATTTCTTTGGGCTCATGGAATTACTCCACCAACTAAAAATGTTCGA
AAAAGGCGCTTCAGGAAAACACTAAGAAAGAAATGTACAGAAGGACCTGAGATAGAAAAA
GAGGTTAAAAGACTATTAAGAGCTGATAATGAAGCTGTTAGTTTTACTTGGGAAGTAATA
AAAGAGGAAGATGAAACTCCTAAAGGCTCTAAAAACGAGGCTACATTGCCTAAAGTGGAG
AAAGGCAAGAGCAAGAAAGATACTACACACACCACTCCCAAAACTAATCAGCCATCTAAA
GTTGAAGATATTTTTGGTGATGCTTTAAGTGACAGTGATGTTGAAGAAGAAAATATCAGT
GTTGATGTAGAAGATAGCAGGTTGTCATTCTATGAAGAACCCTTGTCCGAAAACAATTCT
ATAAATGCCGGAGACATTTCTAAGGGATCTAGTTTTGCTACACAATTTAAATCTGAAATG
TTTGAATCTCCACCAAAGATGTCATCGGCTAACAGGAATCAGTCAACTAAGTATGATAGC
AAGCAAACTGGAGAGCAATCTTCAAGCAGTTATCCCAACACTTCTAGTTTCAAAATGCAA
GAGCTTTTTACTGAACTAGAAGAACTCAAACAGAGAAGGCAAAGGACACAACTAGAAATA
GCTGGTATGGAGAATATGACATTAAGGCAACGGTTCCAAGATATCCTGAAAACCCTTAAC
AAGGAGATAATTACTAAAGAAGTTGAATACAACAGATTAAAATCTCATTTAAAATAA
Protein sequence:
MNREKRDPDYPVELETQFIMRMPETPGKALSELIKSGENFKNRLTIQIENDMRHGEVRFD
QWVLHAKIVDLPTIVESWKTIDRKSLYKTADLCQLMICKEEADSCTEEESPTKNKKKDPL
KVDKKFLWAHGITPPTKNVRKRRFRKTLRKKCTEGPEIEKEVKRLLRADNEAVSFTWEVI
KEEDETPKGSKNEATLPKVEKGKSKKDTTHTTPKTNQPSKVEDIFGDALSDSDVEEENIS
VDVEDSRLSFYEEPLSENNSINAGDISKGSSFATQFKSEMFESPPKMSSANRNQSTKYDS
KQTGEQSSSSYPNTSSFKMQELFTELEELKQRRQRTQLEIAGMENMTLRQRFQDILKTLN
KEIITKEVEYNRLKSHLK