New model in OGS2.0 | DPOGS203366  |
---|---|
Genomic Position | scaffold6:+ 30869-40689 |
See gene structure | |
CDS Length | 2799 |
Paired RNAseq reads   | 1515 |
Single RNAseq reads   | 4052 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003891 (0.0) |
Best Drosophila hit   | TBP-associated factor 4, isoform E (2e-80) |
Best Human hit | transcription initiation factor TFIID subunit 4 (1e-60) |
Best NR hit (blastp)   | transcription initiation factor [Aedes aegypti] (2e-100) |
Best NR hit (blastx)   | transcription initiation factor TFIID subunit, putative [Pediculus humanus corporis] (2e-145) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0006367 transcription initiation from RNA polymerase II promoter GO:0016251 general RNA polymerase II transcription factor activity GO:0005669 transcription factor TFIID complex GO:0005634 nucleus GO:0016986 transcription initiation factor activity GO:0003700 sequence-specific DNA binding transcription factor activity GO:0007517 muscle organ development GO:0048813 dendrite morphogenesis GO:0045944 positive regulation of transcription from RNA polymerase II promoter |
InterPro families    | IPR003894 TAFH/NHR1 IPR009072 Histone-fold IPR007900 Transcription initiation factor TFIID component TAF4 |
Orthology group | MCL14702 |
Nucleotide sequence:
ATGGCGTCAGCGGAGTTTTTAGAGCAAGCTTTGTCTACAGACGTTGATGAGAATGCAGTT
AACGCGATAGTAGGTTCCCTAGAAAATCATTTAGTGACGTCCGTTCCGTCAATATCGTCA
CAGAACAATTTGTTGACTGTTATTCCAAGTCAACTTAGCCTTGCAACAAGTGAAAATACC
ATTATTGGACAAAAATATAACAAAGAGAATAGTGATGGCGATATAGGGAGTGTAAACTTT
AGACCAAATATTGTTTCAAGTTCCTCGTTTAGTTTACCCTCAACTTTTATTAACCAAACA
AGCCTGTCTCAAAACATTTCAAATGGTACTGATTTGGTAAAAGTTATAAGTTCTCAACCG
CTAACTTTATCCGTCTCTGATAATAGTGTTGTGTTCTCAGCGCCATCATACGCAAACGGT
TGTCCTTCTTTGCCGTTATCCCAAGCTCAGATAATTCAGACTGTACAAGGAAGTAGTGCA
ATAAATCAGCCAATTAATAAATCTATTACTATGCAAAATCCTCCTTTGGTTATAAAACAG
GGAACGACTTCTGGTCAAGTCAGTATGCAAGCCAATATGGTACCAATGACAGTGAATTCT
AGCATGCCGGGTTCTATTTCTAACGTGATGACTATTAATAAGCCAGGAGGGCAGAACGTC
GTTGTCACAACACAGAATCTCGGTACAGGCCAACCTGCTATATTGCCCAATGTTCAAATT
TTAAACATGAGGCCGGGTGCGCCTGCGGTGGCGGCTCAAAAATCGGTCGCAACTGTGTCC
CCGCGCGTTGTTATCGGAACTCCTCAGGTTGTTGGACAGAGAGCAGCTGCCCCTGGAATA
ACGCTGCAAACACTACAAAGTCTACAACAGGGGCAGCAGGGTCATTTGTTATTAAAGACT
GAGAACGGTCATTACCAGCTGTTAAGAGTGGGTCCAGGGCCCGGGGCCAGCACGCTGGCG
CCGCAGCAACAGACGATGCGACTGTCCACAGTGCCGGCACATCCCGGGGTGTCAACGGTG
TCCACGAGCGTGCCGGCCCCGGTACAGATACCTGGTCAGATGCCGCAGGGGCCGGTCGCT
ACCCCGGTGCCAGCGGCATCTGTGACCGTGCCTCTGCCTTCGCCACAATCACTCCAACCC
ACAGTCACTACTCAGAAGCCGTTGGACAACACTAAGGAGAAGTGTCGCAACTTCCTGGCC
AACCTGCTGGACCTGTCTAGCAAGGAGCCGAAGTCCGTGGAGAGGAGTGTCAGGAACCTC
ATACAGGAACTGATCGACGCTCAGGTGGAACCGGAGGAGTTCTGTGATAGGTTGGAAAGA
CTCCTGAACGCCAGCCCACAACCCTGCCTCATCGGCTTCCTGAAGAAAAGTCTACCGTTG
CTGCGTCAGTCCCTCGTCACCAAGGAGCTGGTGATAGAGGGCATCAACCCGCCGTCTCCG
CACGTGGCGTTCTCAGCGATATCGCCGCAAGCACCCAACACGGCGGTAGCCACCAGCAAC
ATACAGATGCCGGGCCTAACGTTAGTGGTTCGGCAGCCAGATGACGAAGGCAGCTCAAGT
CCGACCCTAACGCCCCTCCTGCCTCCCGTGATGCCGGTCATCCCGCCCCAACCACCATCA
CCGAAACAGATTAACATAGTCGCCGTGCAGTCGGCGCAGGTTTGTCGCTCGCGTTACACC
AGCCTTAGCACCAGTAGAGTCATCGGAATCATTCGCTACGGTGGTTCCACACGTGATGCG
GCTCTTTATATAACTTATATGTCAATCCGTCTCCATCATTGCGCTCTTCCACATCAGCCG
AAGCCTCAGCCCAAGTCTGGTGGTACGATAGCGGTGCTCCAGAACATTCCAGTGCATCCG
AAGATCAACGTCAGCAAAGTGGGCAAGACTATGACGGTGAACAGTAAGGCTGGCTTCACG
CGACCCACGGGCTCCGCTAACACGGGCCTCTCGACTGTGCTCACGGCGGGGAAGTCTCTG
CTGCGGGACAGGGAGAGGAGATCAGCTCAGTTCTCGCAGAGCTTCGTGGACGACAAGATG
GCCGGCGATGACGACATCAATGACGTAGCAGCCATGGGAGGAGTAAATCTCGCTGAGGAG
AGCCAGCGGATATTGGGCTCCACGGAAATGATCGGAGCACAGATCAGATCCTGTAAAGAC
GAGACCTTAGTACCAATGGCGGTGATGCAGGCCAGGATACGTGCGGTGTCTCTGAGACAC
GGCCTGGAGGAGCCCCCGGCGGAGGTTGGGGCCTTACTGAGCCACGCGCTGCAAGAACGA
CTCAAATCGCTACTAGAGAAGCTAGCGGTCATATCACAGCACCGGATAGACACGCATGTC
AAGATGGATTCGCGTTATGAAGTGACTCAGGATGTGAAGGGCCAGCTGAAGTTCCTTGAA
GAACTGGACAGAGTGGATAAGAAGAGACGAGAGGACTCAGAGAGGGAGATGTTGTTGAGA
GCAGCCAAGTCGCGATCCAAGAACGAGGACCCTGAACAGGCCAAGCTTAAGGCGAAGGCC
AAGGAGATGCAGCGCGCTGAGTTAGAGGAGCTGAGGCAGCGGGAGGCGAATCTGACAGCA
TTACAGGCGATCGGGCCAAGGAAGAAGCCGCGAACTGACGGCTCAGCAGCTGGAGATAAT
CTGGGATCCAGCGGTCAGAGTACTGGACCTTCCGGCCGAGGTCAGCTCCCACAGCGAACC
CGTCTGAAGAGAGTCAACATGCGAGATATGCTGTTCATGATGGAACAAGAGCCTGAATAT
AGACACTCGGCGCTACTATACCGTGCCTACCTCAAGTAA
Protein sequence:
MASAEFLEQALSTDVDENAVNAIVGSLENHLVTSVPSISSQNNLLTVIPSQLSLATSENT
IIGQKYNKENSDGDIGSVNFRPNIVSSSSFSLPSTFINQTSLSQNISNGTDLVKVISSQP
LTLSVSDNSVVFSAPSYANGCPSLPLSQAQIIQTVQGSSAINQPINKSITMQNPPLVIKQ
GTTSGQVSMQANMVPMTVNSSMPGSISNVMTINKPGGQNVVVTTQNLGTGQPAILPNVQI
LNMRPGAPAVAAQKSVATVSPRVVIGTPQVVGQRAAAPGITLQTLQSLQQGQQGHLLLKT
ENGHYQLLRVGPGPGASTLAPQQQTMRLSTVPAHPGVSTVSTSVPAPVQIPGQMPQGPVA
TPVPAASVTVPLPSPQSLQPTVTTQKPLDNTKEKCRNFLANLLDLSSKEPKSVERSVRNL
IQELIDAQVEPEEFCDRLERLLNASPQPCLIGFLKKSLPLLRQSLVTKELVIEGINPPSP
HVAFSAISPQAPNTAVATSNIQMPGLTLVVRQPDDEGSSSPTLTPLLPPVMPVIPPQPPS
PKQINIVAVQSAQVCRSRYTSLSTSRVIGIIRYGGSTRDAALYITYMSIRLHHCALPHQP
KPQPKSGGTIAVLQNIPVHPKINVSKVGKTMTVNSKAGFTRPTGSANTGLSTVLTAGKSL
LRDRERRSAQFSQSFVDDKMAGDDDINDVAAMGGVNLAEESQRILGSTEMIGAQIRSCKD
ETLVPMAVMQARIRAVSLRHGLEEPPAEVGALLSHALQERLKSLLEKLAVISQHRIDTHV
KMDSRYEVTQDVKGQLKFLEELDRVDKKRREDSEREMLLRAAKSRSKNEDPEQAKLKAKA
KEMQRAELEELRQREANLTALQAIGPRKKPRTDGSAAGDNLGSSGQSTGPSGRGQLPQRT
RLKRVNMRDMLFMMEQEPEYRHSALLYRAYLK