New model in OGS2.0 | DPOGS202410  |
---|---|
Genomic Position | scaffold1305:+ 31880-39901 |
See gene structure | |
CDS Length | 1497 |
Paired RNAseq reads   | 1760 |
Single RNAseq reads   | 4210 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003305 (4e-31) |
Best Drosophila hit   | transcription factor IIFalpha (2e-73) |
Best Human hit | general transcription factor IIF subunit 1 (2e-44) |
Best NR hit (blastp)   | GK12178 [Drosophila willistoni] (9e-135) |
Best NR hit (blastx)   | PREDICTED: similar to Transcription initiation factor IIF alpha subunit (TFIIF-alpha) (Transcription factor 5 large chain) (TF5A) [Apis mellifera] (4e-79) |
GeneOntology terms    | GO:0005674 transcription factor TFIIF complex GO:0016251 general RNA polymerase II transcription factor activity GO:0006367 transcription initiation from RNA polymerase II promoter GO:0005634 nucleus GO:0016986 transcription initiation factor activity GO:0006366 transcription from RNA polymerase II promoter GO:0016563 transcription activator activity GO:0003677 DNA binding GO:0045941 positive regulation of transcription GO:0003824 catalytic activity |
InterPro families    | IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR011039 Transcription Factor IIF, Rap30/Rap74, interaction IPR008851 Transcription initiation factor IIF, alpha subunit |
Orthology group | MCL11762 |
Nucleotide sequence:
ATGACGACACCAGGAACTTCACAACCGGCCACGGTACAAGAATTCAAAATCAGGGTGCCA
AAGAACGTGAAGAAGAAATACCACGTGATGAGATTCAACGCGACCCTCAACGTTGACTTC
GCGAAGTGGACTCACGTGAAGATGGAGAGAGAGAACAACATTAAGGAGTTCAAGGGAACG
GAAGAGGAAATGCCAAAGTTCGGCGCTGGTTCAGAATACGGCAGGGATGTGAGGGAAGAG
GCTCGACGGAAGAAATTTGGTATCATCTCGCGGAAATACAAACCTGAAGATCAACCCTGG
ATACTGAAAGTAGGCGGGAAAACTGGCAAGAAGTTCAAAGGTATCCGCGAGGGCGGTGTC
TCTGAGAACGCAGCCTACTACGTCTTCACGCACGCCGCTGACGGAGCTATCGACGCCTAC
CCTCTACAAGAATGGTACAATTTCCAACCGATCCAGCGCTACAAGGCGCTTTCCGCCGAA
GAAGCGGAACAGGAATTTGGAAGACGTAACAAGGTGATCAATTACTTCTCACTGATGTTC
CGTAAACGTATGAGAGGGGACGACGCGGCCGACGAAGACGATCCCGATGACAAGAAAACC
AAGGGGGCGAAAGCTAAGAAGGATCTGAAGATATCTGAAATGGACGAGTGGATAGATTCC
GACGACGAGTCCTCGGATTCAGAAGGAGACAAAGACAAAGAGAAGGAGGACAGCGACTCC
GGCACCAAAAAGAAGAATAAGAAGAAAGCGGTGCCAAAGAAGAAGAAGAAGGTCAATGAT
GAGGCGTTTGAAGAGAGTGATGATGGAGACGAGGAGGGCAGGGAGAGAGATTATATATCA
GACTCATCGGAGAGTGAGTCCGACCATGAGACGAAAGCCAACAAGGAGCTGAAGGGAGTC
GCCGAGGAAGACGCTCTGAGGAAACTGCTGACATCGGACGAGGGTACGGACTCGGAGCAG
GAACAGAAACAAGAGTCGGAGGGAGAAGACGAGCCCACCAAGGAGGGGGAGGAGAGAGCG
AGCAAACTCACCAAGAAGAAGAAGAAGGAAGACGCCAAGAGAGACACCAGCAGCGACTTC
AGCTCAGACTCCGACACCGACCCCGAGAACAGCAGCAAGAAACAGAAGAAAGGAAAGAAC
AATGACGCGAAAAACAACAACGCGGGTGGTAGCGCGAGCACGTCTCGCTCGTGCACTCCC
ACACCTTCAAACGCGATGTCCGCCGTGGCCGCAGCCAACGCCGCCAACAACCAGCCCGCC
AAGAGAGCGAAGCTGGACCCCTCGTATACGGAGTGCGGCGTCACGGAGGAGGCCGTGCGC
CGTTACTTGACTAGGAAGCCGATGACCACCACGGAGCTGCTGACCAAGTTCAAGTCCAAG
AGGAGCGGCGTGTCCTCCGAGAGGCTCGTGGAGACCATGACGCAGATCCTCAAGAGGATC
AACCCCGTCAAACAGAACATCAACGGCAAGATGTATCTTAGCATCAAACAGACGTGA
Protein sequence:
MTTPGTSQPATVQEFKIRVPKNVKKKYHVMRFNATLNVDFAKWTHVKMERENNIKEFKGT
EEEMPKFGAGSEYGRDVREEARRKKFGIISRKYKPEDQPWILKVGGKTGKKFKGIREGGV
SENAAYYVFTHAADGAIDAYPLQEWYNFQPIQRYKALSAEEAEQEFGRRNKVINYFSLMF
RKRMRGDDAADEDDPDDKKTKGAKAKKDLKISEMDEWIDSDDESSDSEGDKDKEKEDSDS
GTKKKNKKKAVPKKKKKVNDEAFEESDDGDEEGRERDYISDSSESESDHETKANKELKGV
AEEDALRKLLTSDEGTDSEQEQKQESEGEDEPTKEGEERASKLTKKKKKEDAKRDTSSDF
SSDSDTDPENSSKKQKKGKNNDAKNNNAGGSASTSRSCTPTPSNAMSAVAAANAANNQPA
KRAKLDPSYTECGVTEEAVRRYLTRKPMTTTELLTKFKSKRSGVSSERLVETMTQILKRI
NPVKQNINGKMYLSIKQT