DPGLEAN14516 in OGS1.0

New model in OGS2.0DPOGS202410 
Genomic Positionscaffold1305:+ 31880-39901
See gene structure
CDS Length1497
Paired RNAseq reads  1760
Single RNAseq reads  4210
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003305 (4e-31)
Best Drosophila hit  transcription factor IIFalpha (2e-73)
Best Human hitgeneral transcription factor IIF subunit 1 (2e-44)
Best NR hit (blastp)  GK12178 [Drosophila willistoni] (9e-135)
Best NR hit (blastx)  PREDICTED: similar to Transcription initiation factor IIF alpha subunit (TFIIF-alpha) (Transcription factor 5 large chain) (TF5A) [Apis mellifera] (4e-79)
GeneOntology terms








  
GO:0005674 transcription factor TFIIF complex
GO:0016251 general RNA polymerase II transcription factor activity
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0005634 nucleus
GO:0016986 transcription initiation factor activity
GO:0006366 transcription from RNA polymerase II promoter
GO:0016563 transcription activator activity
GO:0003677 DNA binding
GO:0045941 positive regulation of transcription
GO:0003824 catalytic activity
InterPro families

  
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR011039 Transcription Factor IIF, Rap30/Rap74, interaction
IPR008851 Transcription initiation factor IIF, alpha subunit
Orthology groupMCL11762

Nucleotide sequence:

ATGACGACACCAGGAACTTCACAACCGGCCACGGTACAAGAATTCAAAATCAGGGTGCCA
AAGAACGTGAAGAAGAAATACCACGTGATGAGATTCAACGCGACCCTCAACGTTGACTTC
GCGAAGTGGACTCACGTGAAGATGGAGAGAGAGAACAACATTAAGGAGTTCAAGGGAACG
GAAGAGGAAATGCCAAAGTTCGGCGCTGGTTCAGAATACGGCAGGGATGTGAGGGAAGAG
GCTCGACGGAAGAAATTTGGTATCATCTCGCGGAAATACAAACCTGAAGATCAACCCTGG
ATACTGAAAGTAGGCGGGAAAACTGGCAAGAAGTTCAAAGGTATCCGCGAGGGCGGTGTC
TCTGAGAACGCAGCCTACTACGTCTTCACGCACGCCGCTGACGGAGCTATCGACGCCTAC
CCTCTACAAGAATGGTACAATTTCCAACCGATCCAGCGCTACAAGGCGCTTTCCGCCGAA
GAAGCGGAACAGGAATTTGGAAGACGTAACAAGGTGATCAATTACTTCTCACTGATGTTC
CGTAAACGTATGAGAGGGGACGACGCGGCCGACGAAGACGATCCCGATGACAAGAAAACC
AAGGGGGCGAAAGCTAAGAAGGATCTGAAGATATCTGAAATGGACGAGTGGATAGATTCC
GACGACGAGTCCTCGGATTCAGAAGGAGACAAAGACAAAGAGAAGGAGGACAGCGACTCC
GGCACCAAAAAGAAGAATAAGAAGAAAGCGGTGCCAAAGAAGAAGAAGAAGGTCAATGAT
GAGGCGTTTGAAGAGAGTGATGATGGAGACGAGGAGGGCAGGGAGAGAGATTATATATCA
GACTCATCGGAGAGTGAGTCCGACCATGAGACGAAAGCCAACAAGGAGCTGAAGGGAGTC
GCCGAGGAAGACGCTCTGAGGAAACTGCTGACATCGGACGAGGGTACGGACTCGGAGCAG
GAACAGAAACAAGAGTCGGAGGGAGAAGACGAGCCCACCAAGGAGGGGGAGGAGAGAGCG
AGCAAACTCACCAAGAAGAAGAAGAAGGAAGACGCCAAGAGAGACACCAGCAGCGACTTC
AGCTCAGACTCCGACACCGACCCCGAGAACAGCAGCAAGAAACAGAAGAAAGGAAAGAAC
AATGACGCGAAAAACAACAACGCGGGTGGTAGCGCGAGCACGTCTCGCTCGTGCACTCCC
ACACCTTCAAACGCGATGTCCGCCGTGGCCGCAGCCAACGCCGCCAACAACCAGCCCGCC
AAGAGAGCGAAGCTGGACCCCTCGTATACGGAGTGCGGCGTCACGGAGGAGGCCGTGCGC
CGTTACTTGACTAGGAAGCCGATGACCACCACGGAGCTGCTGACCAAGTTCAAGTCCAAG
AGGAGCGGCGTGTCCTCCGAGAGGCTCGTGGAGACCATGACGCAGATCCTCAAGAGGATC
AACCCCGTCAAACAGAACATCAACGGCAAGATGTATCTTAGCATCAAACAGACGTGA

Protein sequence:

MTTPGTSQPATVQEFKIRVPKNVKKKYHVMRFNATLNVDFAKWTHVKMERENNIKEFKGT
EEEMPKFGAGSEYGRDVREEARRKKFGIISRKYKPEDQPWILKVGGKTGKKFKGIREGGV
SENAAYYVFTHAADGAIDAYPLQEWYNFQPIQRYKALSAEEAEQEFGRRNKVINYFSLMF
RKRMRGDDAADEDDPDDKKTKGAKAKKDLKISEMDEWIDSDDESSDSEGDKDKEKEDSDS
GTKKKNKKKAVPKKKKKVNDEAFEESDDGDEEGRERDYISDSSESESDHETKANKELKGV
AEEDALRKLLTSDEGTDSEQEQKQESEGEDEPTKEGEERASKLTKKKKKEDAKRDTSSDF
SSDSDTDPENSSKKQKKGKNNDAKNNNAGGSASTSRSCTPTPSNAMSAVAAANAANNQPA
KRAKLDPSYTECGVTEEAVRRYLTRKPMTTTELLTKFKSKRSGVSSERLVETMTQILKRI
NPVKQNINGKMYLSIKQT