New model in OGS2.0 | DPOGS213695  |
---|---|
Genomic Position | scaffold487:+ 14253-15639 |
See gene structure | |
CDS Length | 852 |
Paired RNAseq reads   | 63 |
Single RNAseq reads   | 247 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010619 (4e-59) |
Best Drosophila hit   | TATA binding protein (5e-58) |
Best Human hit | TATA-box-binding protein isoform 1 (1e-58) |
Best NR hit (blastp)   | TATA binding protein, putative [Aedes aegypti] (1e-72) |
Best NR hit (blastx)   | TATA binding protein, putative [Aedes aegypti] (2e-69) |
GeneOntology terms    | GO:0005737 cytoplasm GO:0045449 regulation of transcription GO:0006350 transcription GO:0006355 regulation of transcription, DNA-dependent GO:0003702 RNA polymerase II transcription factor activity GO:0005488 binding GO:0006367 transcription initiation from RNA polymerase II promoter GO:0045941 positive regulation of transcription GO:0003700 sequence-specific DNA binding transcription factor activity GO:0003677 DNA binding GO:0005634 nucleus GO:0006366 transcription from RNA polymerase II promoter GO:0006383 transcription from RNA polymerase III promoter GO:0045120 pronucleus GO:0010843 promoter binding GO:0000120 RNA polymerase I transcription factor complex GO:0005669 transcription factor TFIID complex GO:0005515 protein binding GO:0001939 female pronucleus GO:0001940 male pronucleus |
InterPro families    | IPR012294 Transcription factor TFIID, C-terminal/DNA glycosylase, N-terminal IPR000814 TATA-box binding protein IPR012295 Beta2-adaptin/TATA-box binding, C-terminal |
Orthology group | MCL19487 |
Nucleotide sequence:
ATGGATTCAGATCTTTCTGTTCCTGACTGCCCTATGGCGGAAATCGTGAGTGATGTTAAT
GTACAAAATATGCAAACTGATGGACACACTCCAGTTAATGAAAAACAAAACCCACAACGA
GGGAATACGAAGCTTTTAACAAGTGGGACACCTAAGCCTCATTCCTTAGAAAATGTGCCA
TCAACTCCACAAATTTCAGGAGATATAACTTTGACACCAACACATCGAACATTCACTCCA
CAAACTCCATCTGTGAATCCACATAATTCTATGAGTGCCATTACCCCAATGGCAAGTGCA
GTTAATCAAGCAAAAAATAGTATAAAATTTCAAAATTGTATTTCTACAGTAAGTTTAGAT
TGTGAACTGAATTTGTTAGACATATACTGTAGAACAAGGTTTTCAGAATACAACCCTGCT
AGATTTAATGGAGTCGTTATGAAGATTTTGGAACCGCGAGCCACAGCCCTAGTATTTAGA
TCTGGTAAAATAGTCTGTACGGGAGCCAAAAATGGACATGACTCATATATCGCAGCTAGA
AAATTTGCAAGAATTATTCAGAAACTTGGTTTTCCGGTGAAATTTGTTGATTTCAAAGTT
CTTAATTTTCTAGCAACAGCGGATTTAAGATTTCCCATAAAACTGGAAGCGCTACAGCAA
GCTCACGGTCAGTTCACTTCATATGAACCGGAACTTTTCTCTGGCCTCGTTTATAGAATG
ATACGACCAAGGGTTGTGTTGCTAATATTTGTTAATGGAAAAATGGTTATAACAGGCGCT
AAAACTAATCAAGAAGTTTATGAAGCAGTTGACATAATACACCCCATTTTAAGAAGTTAC
AAGAAAAATTGA
Protein sequence:
MDSDLSVPDCPMAEIVSDVNVQNMQTDGHTPVNEKQNPQRGNTKLLTSGTPKPHSLENVP
STPQISGDITLTPTHRTFTPQTPSVNPHNSMSAITPMASAVNQAKNSIKFQNCISTVSLD
CELNLLDIYCRTRFSEYNPARFNGVVMKILEPRATALVFRSGKIVCTGAKNGHDSYIAAR
KFARIIQKLGFPVKFVDFKVLNFLATADLRFPIKLEALQQAHGQFTSYEPELFSGLVYRM
IRPRVVLLIFVNGKMVITGAKTNQEVYEAVDIIHPILRSYKKN