New model in OGS2.0 | DPOGS211027  |
---|---|
Genomic Position | scaffold226:- 80720-85735 |
See gene structure | |
CDS Length | 2412 |
Paired RNAseq reads   | 1205 |
Single RNAseq reads   | 3077 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006506 (3e-39) |
Best Drosophila hit   | elongin A, isoform B (3e-57) |
Best Human hit | transcription elongation factor B polypeptide 3 (1e-32) |
Best NR hit (blastp)   | PREDICTED: similar to Elongin A CG6755-PA, isoform A [Apis mellifera] (2e-85) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC001477 [Tribolium castaneum] (3e-71) |
GeneOntology terms    | GO:0003711 transcription elongation regulator activity GO:0008023 transcription elongation factor complex GO:0045449 regulation of transcription GO:0016021 integral to membrane GO:0005515 protein binding GO:0003677 DNA binding GO:0016944 RNA polymerase II transcription elongation factor activity GO:0008159 positive transcription elongation factor activity GO:0006368 RNA elongation from RNA polymerase II promoter |
InterPro families    | IPR017923 Transcription factor IIS, N-terminal IPR010684 RNA polymerase II transcription factor SIII, subunit A IPR003617 Transcription elongation factor, TFIIS/CRSP70, N-terminal, sub-type |
Orthology group | MCL14145 |
Nucleotide sequence:
ATGGCGTCTGTTTTAGATTTAGTTAAACATTACCAACGATCTATAGAAAAATATCCCAAT
GACGAACAGAAAATATTAAAGAGTATAGATAAGTTATACCACTTGAAAGTAACTGTACAG
CATTTGCAAGATACTGGTGTTGGCCGCACAGTCAATGCTCTGCGAAAGGAACCAGGAGAA
ATTGGACAAGCTGCTAGAGCTCTTGTGTTAAAATGGAAGGTTATGGTGGCTGCGGAAGAA
AGTGATCATGAAGATCATAATGACGACACCCAAAACTACAGTAGTCATGATAATGGCAGG
GATTATGACAGTAACCCAAGCAAATCTACAAGTAAACATGATACATCTGAGAAATCAAAT
AGAAGATCAAAGACTGAAGAGAAGTACCATAAGCAAACAAATGGGGATTATAGTGGAAAT
AAGAGAAAATATCAAAGTAGTGAGGAGGAAGACCACGACAATACAAAAAAATCTAAATAC
TCACAAGATAACGGCTATAATATAAAATCAGAATCAAGAAAGAAAATAGAGTCATCCGAA
AGTGAAAACAGTGAGGATGAATCATCACAAAGTGATAGTGGCAGTGAAGATACAAAGTCA
GAAGATGAAGAAGAAGAGATCCCAGATACTGAAGCAAAGGTAGTGAAAAATCAACACAAA
GAGTCATATAAACCATCCCATTTGAGACAGTCATCTAGTTCACATCAAAGCAAACATAAG
CATGAATCAAGACATGATAGAGAAGATTCAAAGCAAAGCAAGGAACATAATGACAGTTCT
GATAAAAGACATTCATCTGACAAACCCAAAGACAAGTCCCACAGTTCTCACTCCCACAAA
AGTTCTAAAGGGCATGAAAAAAGTTTAGACAAAAAAGACAAAGAAAATAGACATTCAGAC
AAAGAGAAGCAAATAAAAGAAGACTCAAGTAAACACAGTACTAAACATAGATCTGGTAGC
AGTTCAGATAAAACCAAGTCAAGCGGAACTGACAAACACAAGTCTAGTGAGAGTTCACAC
AAACATAAGTCCAGCAAGAGTGAAGATAAACACAGATCAAGTAGCAGTTCAGAGAAAAAT
AAAAAACTTGTACCCGAATATCATAAAAGTCACTCAGAGAGAACATCAAGTAGTACAGAC
AAACAATCCTCACAGATATCTGATAAACACAAAAAATCTTCTCATAAAGATGTTTCTAAT
GATAAACATAAATCTAAAGAACATGATGATAAAAAAGATGAACAAGTCAAAGAAAAGTCA
AAAGAAAAACATAGTTCCTCAAAAGACAAAGAAAGTCACAAAAGTGAGAAGAAACATTCT
TCTAAAGAATCAGGTGACAGCAAACGAAAATCTGATAGCAATCATAACAGCGATAGTTCC
AAGAAAAGTAAACATAAGTCTAGTTCTAGTAAACAATCTAACAAGTCAAGGGAGGACAAG
GAGAAACAACCAAAAAGAACAGAAGATAGCGATGATGGTATAGATTGTGGCTCAGGTGCC
AGTTTTGCCGAAGCACTTGGCATGATAAGTCCGTCAAAGCCAAAGAAAAAATCTATATTT
TCTAAAGATAATATGCAATCTCCACGCTCTCCTAGTGACAATCTCAACCCTCCTAATTTG
CTAGCACCTAGTGCTAAATTGGCGCCATTGCCCTCTTTAGAAATATCTGCCTTACCAGAG
ATATCACCCAATTATCGGCCACGCCCACCTCCGAAATTCCTACCACACTTCAGCGATGAA
GACGCTATGAGTAGCGCAATATCGTCGAAAAATCAGAGAACAAAAGTCTATTCTGGCAAC
AAAGTTATAGGAAAAATCACAACATTATATGAAATGTGTGTCCATGTCCTGCAAGAACAT
ATTGATGCCCTTGAATACACTGGTGGGGTTCCATATGAAATATTAAAACCAGTTGTGGAT
AAAGCAACTCCACAGCAGTTATTTGTTTTGGAACATTACAACCCATACCTCATGGACGAC
ACTGATCATTTGTGGCAGAAATTCTGTGAGAAAAGTTTTAGGAACAAGAAACGACAGGAA
ATGGAGACTTGGAGGGAAATGTATATTCGATGCCAAGAAGAACAAGAAATTAAGCTTAAA
TCACTCACTGCCAACATCAAAATGACTCAAGAGGCAAAGAAGGCGCCCATAAAGCAAACT
AAAATGGCCTATGTTGATACTGTAGTGAAACCACCTCGTAATGTTGCAAAGAAACAGGCA
CAACACGGTACAGCATTTGCTGCTACTGCCAGCCCTGCTGCTAGGGTTGCCTCTCTTTCT
GCAGCACCTAATGTATTAAAAGGTGGCAGGGCTGCCCCAGCCCCGGTTATAACAAACTCA
TCGAACTTCAAGCCCAAGAAAGCACCGCTTATGCAAAAAGCACTGCAATTTATGCGCGGA
AGAAAACGATGA
Protein sequence:
MASVLDLVKHYQRSIEKYPNDEQKILKSIDKLYHLKVTVQHLQDTGVGRTVNALRKEPGE
IGQAARALVLKWKVMVAAEESDHEDHNDDTQNYSSHDNGRDYDSNPSKSTSKHDTSEKSN
RRSKTEEKYHKQTNGDYSGNKRKYQSSEEEDHDNTKKSKYSQDNGYNIKSESRKKIESSE
SENSEDESSQSDSGSEDTKSEDEEEEIPDTEAKVVKNQHKESYKPSHLRQSSSSHQSKHK
HESRHDREDSKQSKEHNDSSDKRHSSDKPKDKSHSSHSHKSSKGHEKSLDKKDKENRHSD
KEKQIKEDSSKHSTKHRSGSSSDKTKSSGTDKHKSSESSHKHKSSKSEDKHRSSSSSEKN
KKLVPEYHKSHSERTSSSTDKQSSQISDKHKKSSHKDVSNDKHKSKEHDDKKDEQVKEKS
KEKHSSSKDKESHKSEKKHSSKESGDSKRKSDSNHNSDSSKKSKHKSSSSKQSNKSREDK
EKQPKRTEDSDDGIDCGSGASFAEALGMISPSKPKKKSIFSKDNMQSPRSPSDNLNPPNL
LAPSAKLAPLPSLEISALPEISPNYRPRPPPKFLPHFSDEDAMSSAISSKNQRTKVYSGN
KVIGKITTLYEMCVHVLQEHIDALEYTGGVPYEILKPVVDKATPQQLFVLEHYNPYLMDD
TDHLWQKFCEKSFRNKKRQEMETWREMYIRCQEEQEIKLKSLTANIKMTQEAKKAPIKQT
KMAYVDTVVKPPRNVAKKQAQHGTAFAATASPAARVASLSAAPNVLKGGRAAPAPVITNS
SNFKPKKAPLMQKALQFMRGRKR