DPGLEAN04269 in OGS1.0

New model in OGS2.0DPOGS213695 
Genomic Positionscaffold487:+ 14253-15639
See gene structure
CDS Length852
Paired RNAseq reads  63
Single RNAseq reads  247
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010619 (4e-59)
Best Drosophila hit  TATA binding protein (5e-58)
Best Human hitTATA-box-binding protein isoform 1 (1e-58)
Best NR hit (blastp)  TATA binding protein, putative [Aedes aegypti] (1e-72)
Best NR hit (blastx)  TATA binding protein, putative [Aedes aegypti] (2e-69)
GeneOntology terms


















  
GO:0005737 cytoplasm
GO:0045449 regulation of transcription
GO:0006350 transcription
GO:0006355 regulation of transcription, DNA-dependent
GO:0003702 RNA polymerase II transcription factor activity
GO:0005488 binding
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0045941 positive regulation of transcription
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0006366 transcription from RNA polymerase II promoter
GO:0006383 transcription from RNA polymerase III promoter
GO:0045120 pronucleus
GO:0010843 promoter binding
GO:0000120 RNA polymerase I transcription factor complex
GO:0005669 transcription factor TFIID complex
GO:0005515 protein binding
GO:0001939 female pronucleus
GO:0001940 male pronucleus
InterPro families

  
IPR012294 Transcription factor TFIID, C-terminal/DNA glycosylase, N-terminal
IPR000814 TATA-box binding protein
IPR012295 Beta2-adaptin/TATA-box binding, C-terminal
Orthology groupMCL19487

Nucleotide sequence:

ATGGATTCAGATCTTTCTGTTCCTGACTGCCCTATGGCGGAAATCGTGAGTGATGTTAAT
GTACAAAATATGCAAACTGATGGACACACTCCAGTTAATGAAAAACAAAACCCACAACGA
GGGAATACGAAGCTTTTAACAAGTGGGACACCTAAGCCTCATTCCTTAGAAAATGTGCCA
TCAACTCCACAAATTTCAGGAGATATAACTTTGACACCAACACATCGAACATTCACTCCA
CAAACTCCATCTGTGAATCCACATAATTCTATGAGTGCCATTACCCCAATGGCAAGTGCA
GTTAATCAAGCAAAAAATAGTATAAAATTTCAAAATTGTATTTCTACAGTAAGTTTAGAT
TGTGAACTGAATTTGTTAGACATATACTGTAGAACAAGGTTTTCAGAATACAACCCTGCT
AGATTTAATGGAGTCGTTATGAAGATTTTGGAACCGCGAGCCACAGCCCTAGTATTTAGA
TCTGGTAAAATAGTCTGTACGGGAGCCAAAAATGGACATGACTCATATATCGCAGCTAGA
AAATTTGCAAGAATTATTCAGAAACTTGGTTTTCCGGTGAAATTTGTTGATTTCAAAGTT
CTTAATTTTCTAGCAACAGCGGATTTAAGATTTCCCATAAAACTGGAAGCGCTACAGCAA
GCTCACGGTCAGTTCACTTCATATGAACCGGAACTTTTCTCTGGCCTCGTTTATAGAATG
ATACGACCAAGGGTTGTGTTGCTAATATTTGTTAATGGAAAAATGGTTATAACAGGCGCT
AAAACTAATCAAGAAGTTTATGAAGCAGTTGACATAATACACCCCATTTTAAGAAGTTAC
AAGAAAAATTGA

Protein sequence:

MDSDLSVPDCPMAEIVSDVNVQNMQTDGHTPVNEKQNPQRGNTKLLTSGTPKPHSLENVP
STPQISGDITLTPTHRTFTPQTPSVNPHNSMSAITPMASAVNQAKNSIKFQNCISTVSLD
CELNLLDIYCRTRFSEYNPARFNGVVMKILEPRATALVFRSGKIVCTGAKNGHDSYIAAR
KFARIIQKLGFPVKFVDFKVLNFLATADLRFPIKLEALQQAHGQFTSYEPELFSGLVYRM
IRPRVVLLIFVNGKMVITGAKTNQEVYEAVDIIHPILRSYKKN