DPGLEAN04261 in OGS1.0

New model in OGS2.0DPOGS213670 
Genomic Positionscaffold487:- 44192-45448
See gene structure
CDS Length921
Paired RNAseq reads  179
Single RNAseq reads  486
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010619 (6e-143)
Best Drosophila hit  TATA binding protein (1e-101)
Best Human hitTATA-box-binding protein isoform 1 (1e-92)
Best NR hit (blastp)  RecName: Full=TATA-box-binding protein; AltName: Full=TATA-box factor; AltName: Full=TATA-binding factor; AltName: Full=TATA sequence-binding protein; Short=TBP; AltName: Full=Transcription initiation factor TFIID TBP subunit (6e-153)
Best NR hit (blastx)  RecName: Full=TATA-box-binding protein; AltName: Full=TATA-box factor; AltName: Full=TATA-binding factor; AltName: Full=TATA sequence-binding protein; Short=TBP; AltName: Full=Transcription initiation factor TFIID TBP subunit (8e-146)
GeneOntology terms











  
GO:0042797 tRNA transcription from RNA polymerase III promoter
GO:0006359 regulation of transcription from RNA polymerase III promoter
GO:0005666 DNA-directed RNA polymerase III complex
GO:0042796 snRNA transcription from RNA polymerase III promoter
GO:0008134 transcription factor binding
GO:0000126 transcription factor TFIIIB complex
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0005669 transcription factor TFIID complex
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0016251 general RNA polymerase II transcription factor activity
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0005515 protein binding
InterPro families

  
IPR012294 Transcription factor TFIID, C-terminal/DNA glycosylase, N-terminal
IPR000814 TATA-box binding protein
IPR012295 Beta2-adaptin/TATA-box binding, C-terminal
Orthology groupMCL12753

Nucleotide sequence:

ATGGATCAAATGCTTCCAAGTCCATATAACATACCAGGTATTGGTACTCCCTTGCACCAA
CCTGAAGAAGATCAACAGATTTTACCAAATGCTATGCAACAGCAACAACAACAACAACGC
ACAACAACAACTTCCTTGGTATCAATGGGTTCATCGCCGCTCGTGGGTTTTGGCGCCTCT
ATAATGGGCACACCTCAGAGAACGATGCATACGTATGCTCCAACAGCCAGCTATGCAACA
CCTCAACAGATGATGCAACCTCAAACACCGCAAAACTTAATGTCTCCATTGATAACGGGT
TCAAGTATAGCCGGTCAACAGATCCTAAACCAAATGAGTCCTGCACCCATGACACCTATG
ACTCCACATTCTGCAGATCCTGGAATATTACCTCAGTTGCAAAATATAGTTTCCACAGTA
AATCTTAATTGCAAATTAGACCTTAAAAAGATAGCCCTACATGCCCGCAATGCTGAATAT
AACCCTAAACGTTTTGCTGCCGTCATTATGAGGATACGAGAACCAAGGACTACAGCATTG
ATATTTTCTTCTGGCAAAATGGTTTGCACCGGTGCCAAGAGTGAAGAAGACTCCCGTCTT
GCTGCAAGAAAATATGCCAGAATTATACAAAAGCTAGGATTTACGGCAAAATTTTTGGAT
TTTAAAATTCAAAACATGGTTGGAAGTTGCGATGTTAAATTTCCAATTCGCCTGGAAGGC
TTAGTCCTAACACATGGACAATTTAGCTCTTACGAACCTGAACTCTTCCCTGGACTCATC
TACCGAATGGTGAAACCTAGAATAGTTTTACTGATATTTGTATCAGGAAAAGTGGTACTA
ACAGGTGCAAAAGTTCGCCAAGAAATATATGAAGCTTTTGATAATATTTACCCAATATTG
AAAAGTTTTAAGAAACAATAA

Protein sequence:

MDQMLPSPYNIPGIGTPLHQPEEDQQILPNAMQQQQQQQRTTTTSLVSMGSSPLVGFGAS
IMGTPQRTMHTYAPTASYATPQQMMQPQTPQNLMSPLITGSSIAGQQILNQMSPAPMTPM
TPHSADPGILPQLQNIVSTVNLNCKLDLKKIALHARNAEYNPKRFAAVIMRIREPRTTAL
IFSSGKMVCTGAKSEEDSRLAARKYARIIQKLGFTAKFLDFKIQNMVGSCDVKFPIRLEG
LVLTHGQFSSYEPELFPGLIYRMVKPRIVLLIFVSGKVVLTGAKVRQEIYEAFDNIYPIL
KSFKKQ