New model in OGS2.0 | DPOGS215600  |
---|---|
Genomic Position | scaffold446:+ 111023-114316 |
See gene structure | |
CDS Length | 1242 |
Paired RNAseq reads   | 617 |
Single RNAseq reads   | 1517 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000355 (5e-171) |
Best Drosophila hit   | transcription factor IIEalpha (8e-134) |
Best Human hit | general transcription factor IIE subunit 1 (5e-85) |
Best NR hit (blastp)   | PREDICTED: similar to Transcription factor IIE CG10415-PA [Apis mellifera] (3e-162) |
Best NR hit (blastx)   | PREDICTED: similar to Transcription factor IIE CG10415-PA [Apis mellifera] (7e-141) |
GeneOntology terms    | GO:0006367 transcription initiation from RNA polymerase II promoter GO:0016251 general RNA polymerase II transcription factor activity GO:0005673 transcription factor TFIIE complex |
InterPro families    | IPR021600 Transcription factor TFIIE, alpha subunit, C-terminal, metazoa IPR017919 Transcription factor TFE/TFIIEalpha, HTH domain IPR002853 Transcription factor TFIIE, alpha subunit IPR013083 Zinc finger, RING/FYVE/PHD-type |
Orthology group | MCL14239 |
Nucleotide sequence:
ATGACTGAGGAACGCTTGGTGACTGAGGTGCCAAGCAGCTTGAAGCAGTTGGCAAGGCTG
GTGGTGAGAGGTTTTTACACCATCGAAGATGCCTTGATCGTGGACATGTTGGTTCGAAAC
CCTTGTATGAAGGAAGATGATATCTGTGAACTGTTAAAGTTTGAAAGAAAAATGTTGAGG
GCTCGAATAGCTACACTAAAAAATGACAAGTTCATACAAGTAAGGCTTAAAATGGAAACT
GGTTTGGATGGTAAAGCACAAAAAGTGAACTACTACTTTATAAACTACAAAACGTTCGTG
AACGTAGTAAAATACAAATTGGATTTGATGCGCAAACGCATGGAAACTGAGGAGAGAGAC
GCCACCAGTCGCGCCAGCTTCAAATGTCCTTCTTGTGGGAAGACATTTACAGACCTCGAG
GCTGATCAACTTTACGACATGATGACCCAAGAGTTCAGGTGTACATTTTGTAACCAAGTT
GTGGAGGAAGACCAGTCAGCACTGCCGAAGAAAGATTCAAGGTTGCTGTTAGCGAAGTTC
AACGAACAGTTAGAGACGCTTTACATCTTACTGAGGGAGGTCGAAGGCATTAAATTGGCT
CCAGAGATATTGGAACCCGAACCTGTCGATATCAACACCATCAGAGGATTAACCGCTAAG
CATATGACGAGTCGTCCTGGTGGTGGCGAGTGGTCGGGCGAGGCGACCCGCAGTCAGGGT
CTGGCGGTGGAGGAGACAAGAGTGGACATCACCATCGGGGACGGCGACAGGACGGACACC
GCCGCCGCCCGCAAGGAGCGACCCGTCTGGATGGTGGAGAGCACCATCACCACCGGCGAA
CAGAGCGAGTCTTCATTGGTGTCTGGTGAGGTCGAGCGATCAACCGGCAAGCAACCAGCC
AAGGAGAAGGGAGATGACATCATGTCGGTGTTGTTGGCACACGAGAAACAAAATACAGGG
AACGTCGCAGCGAATGCCTTAAAGGGAGCGGATCCTGAGAGCTCGGATTCGAGCGACAAC
GAATCAAAGGATCCCTATAAACTTAAAGACGAGATCGCAGCTGTTGCTGAAATGGACAGC
GAAGATTCCGAATCAGATGACAACGCCCCTACCGTATTGGTGAACGGCAAGCCGGTAGCT
CTGACTAGTATAGACGATGACGTCATCGCTCGTATGACTCCCGCAGAGAAGGAAACATAC
ATACAAGTGTATCAAGAGTACTACAGCCATATGTATGACTAG
Protein sequence:
MTEERLVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLR
ARIATLKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRMETEERD
ATSRASFKCPSCGKTFTDLEADQLYDMMTQEFRCTFCNQVVEEDQSALPKKDSRLLLAKF
NEQLETLYILLREVEGIKLAPEILEPEPVDINTIRGLTAKHMTSRPGGGEWSGEATRSQG
LAVEETRVDITIGDGDRTDTAAARKERPVWMVESTITTGEQSESSLVSGEVERSTGKQPA
KEKGDDIMSVLLAHEKQNTGNVAANALKGADPESSDSSDNESKDPYKLKDEIAAVAEMDS
EDSESDDNAPTVLVNGKPVALTSIDDDVIARMTPAEKETYIQVYQEYYSHMYD