New model in OGS2.0 | DPOGS212037  |
---|---|
Genomic Position | scaffold858:- 15889-19392 |
See gene structure | |
CDS Length | 3504 |
Paired RNAseq reads   | 2235 |
Single RNAseq reads   | 5421 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010005 (1e-83) |
Best Drosophila hit   | bip2 (7e-28) |
Best Human hit | transcription initiation factor TFIID subunit 3 (2e-19) |
Best NR hit (blastp)   | PREDICTED: similar to bip2 CG2009-PA [Tribolium castaneum] (9e-51) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL010857 [Aedes aegypti] (2e-46) |
GeneOntology terms    | GO:0005669 transcription factor TFIID complex GO:0006357 regulation of transcription from RNA polymerase II promoter GO:0016251 general RNA polymerase II transcription factor activity GO:0008270 zinc ion binding GO:0005515 protein binding |
InterPro families    | IPR011011 Zinc finger, FYVE/PHD-type IPR019786 Zinc finger, PHD-type, conserved site IPR006565 Bromodomain transcription factor IPR001965 Zinc finger, PHD-type IPR013083 Zinc finger, RING/FYVE/PHD-type IPR019787 Zinc finger, PHD-finger |
Orthology group | MCL40480 |
Nucleotide sequence:
ATGTCAGAGGCGTACGCCCGAGAGATATTACGGAGGAATGTTGCCCAAGTATGCCAAACT
ATAGGATGGAATGGGATAAACTCTACGCCACTCGACATTTTAGTGCATGTTTTGGAAAAG
TACATTTGTACTTTGGGCACTCAAGCTAACCGATACGCCGAACAATTTAATAGGACTGAA
CCAAACTTGAATGACCTAGGGTTAGTGTTTCGTGACCTTCACATCCAATTGCCAGAATTA
GGAGAGTATACTAGATCTGTGCCTCCCGTTCCACCTCCTGTTAAAACAGAAAGATTCCCA
AAACCTAAAGAATCTAATTTAAATTTCCTTAAGCCTGGCAGTTATGAGGTAGTTACAAGA
CCTATGCATGTGCATGAGCACTTGCCTCCTATGTACCCAGAGAAAGAAAGAGATACACCT
GTTGTTGCAGGAACAGTTGAAATTCGGCAAAATGGTATTGATAATGTTGATGCTAATGTG
TCCTGTACAAGTCCTGAAATATCTGTCACAGACAGTCCAGAAAAACCTAAAGATATATTT
AAGAGGCCTATTGATCCAGTTTCATTACCAAATAGTAAAAGACCAAGGTTACGACTGGAT
GAAGAGGAAAGGACAAGGGAAATTAGCAGTGTTATGATGACTATGTCAGGTTTTCTTTCA
CCAGCTAGAGAAGGTAAATTACCAGAAGCTAAGCCTCCTACTATTATTTCTGAAAGACAT
CATGACAAGCATAAAGTGAATTCACACCATTCAAATGCAATTAAAGTACCAATGTTAGAT
AAAATTGATAAGAAATCGAAGAAGAGTAAGTTAATTAATGGAAAAATTATGAAAAGTAAA
AGAAAAGATAAGAGTCATAAAGGTGAGGGTAGTAAATCTAAAGATAGTAGTAAATTGGAG
AGGTATCCTCCGGGATATCCAATGAAAAGTAAAGACGTTCATCCAACGCATCATAATCAC
GTGACAATGCCTGCGCCAAGGCCTTTACAAGCACCTGTAAGACCAACAATGCCACCACCA
CCTATACCTTTACCAACACCGCCACCTGTTACAATACAAGAACCTATCAGGCCAGTTATA
AAACAGGAACCAATAGATCCACCCCCACCTGTCACAACTCGGCCATCTCTACCTGCTGAA
GATTCAATTCCGGTACCTAAAAAAATACCAATTTCAAAATCTCCGCTCGTTTCCAATTCT
TTAACTACTCACAAACATCAATCTTCCACAATTTCTCTTATTCCGGATGTTCAAATCAAA
AAAGAGGTTATAGATGAAGAAGAAAAGTTAGCATCTCAGCCAGATAGATCAAAGATTAAT
ATATTTAAAAGAATATCGAACAAATCTAAAGAAGAAAAGCATACACCAGAAGTTGTTCCT
GAAAAATTATTTTCACAGCCAGATACAACAATTTCAAGGTTGCAGAATTCATCACATGAA
ATAAGTGAAAACCGAGTTAAAAGTGCCGAATATGTGAACAATAATAACAGTCCAGTTGAT
TTGTCGCGAGAAATAGATATTAGGTCTCACGAAATTATCAACATTGATGATGATTCATTG
GATGCTCAACCAGTGCCTCATTCTAGAAATACATCCCCTGAGCCAAAATCAGTTGGCCTT
TCAATAAATAAATCGTTACAAGTACCTTTTCCCAAAGATATTGCTAGTGTCAGTCCAAAA
TTGAAGAAGGAAAAGAAACATAAAGATAAAAAAGATAAAGCAGCGAAATTAGAAGCAAAA
CTCAAAAAACAGCATCAACAGTTGGCCTTTGAAATGATGCCAATGGTAGAAAAGAAAAAG
TCTAAAATTAAAAGTGAAAAGTCAGTTAAGTCTAATAGGCTTAAAAATGATTTAAAATTA
CCACAAATGCCACCTGGTTTTCCATTCTTTCCTAACATGCCACCAGGACGGGGAATGATG
CCAGGTCCGGGATTAATACCTAGTCATGGCTTAATACCTGGTGGGGACTTTTTAGCTGGT
TTGACTAACAACCCTGCACTAAGAGGTTTACAACCCCCAAATATCCTTGGTAATCCTTTT
GCTGTAGGGGCAGGCGGACCAGGGCTTATACCAGGATCAAGTTTTCTACCAGGTGGTCTC
GGCCCCGGTATCCCCAATCATCTTATGCCGATGGGTAACTTTCCTCATTCTTCGCGATCG
TCTCCTGTAAAAATACCGCCAATGCTTAGACGTCCAAGTTTAGAGGTTATACCTGTTGAA
AACGAAGAAGACAGGATGATGCATAAATCAGCAATGACGGGGCGCGACAAAGATCGTCAT
GATAAGCACAAATCTCCAACCATTCCAAATATTTTACAAAAACAGAAATCTAAATCAAAT
AAGGATCATAAGTCAAGTATATATAAAATGCCACCTGTACAGCCGGATATAACGATTGAA
TTGAATCCTCCTAAAGAGCCAGTTAGACCTGAACCACCACGAGAAAATCCAACGCCACTT
CGAATACCCACACCAGAACCACAAGCCGTTGTTTCTAAACCTGAACCGCTCCCAATTCCT
GAACCAACGCCAGTCAAACATTCTGCTCCAGAAATTTCTCAAGACCCGGATAATATAGAG
AAGAAGAAAGACAAGTCTCATAAAAAGGAAAAACGAGATAAGGATGGCATTAAAATAAAA
AAGAAGAAGGATAAAAAAGACAAAAATAAAGATAGGTCTGAAAAGAAAAAGGATAAGGAA
GAGAGACAGGAAATTAAAGATAGAATAAAGAAAGAAAAGAAAGAGAAAAAGAAAGAAAAG
TCGGCAGATGGTCTCGTGCCTAAACTTACCCTTAAACTAGCTTCTTCCAACTCAAATTCA
CCGATGCCACCCAGCTCTCCAGATGTATATAAACTAAATATAAAGCCTGTTGTAAAGAAA
GAAGAGGAAGAGACATCTCCTATTAAAGAGGAATCCGTATCACGAGAGCACAGTCGGTCC
CCAGAATTAGCCCAAATATCTGCCTTAGTAACGAGGCCACCAAAGCAGAGACATTCTAAA
CATAATCATGTTTCAGAGCCGTTAGAATCACAATCGCCTCCGCCTATACCTGGTTCGCCG
CAAAGAAAGAATCGTCCACCCTCGAGTCATTCTAAATATAAAAGAATCTTGATAAAACCT
TTGTCGAAGAAAGGTAATAACGAAGATTTTGAAGACGAGCCAGCTACGATATCAGATGAA
CCGCAAGCGCCGGCACCAGTTTCTGTAGAAAAACCAACTGGACCACTTCCAACACCATAT
TATGTGGACGAACAAGGAAACAAAATATGGGTATGTCCCGCTTGTGGACGGCCAGACAAT
GGCTCGCCGATGATAGGTTGTGACGGATGCGATGGGTGGTACCATTGGATCTGTGTTGGA
ATCACGGAGGATCCGGGGGCCACGGAAGACTGGTTTTGTAAATCTTGCGTTGCTAAAAGG
GCTGCGATGGTTCTCGCCGGCGTCACTTCCGGCAAAAAGAGGGGGCGGAAACCAAAAGGA
GAAAAAATCAGAGACTGTCATTGA
Protein sequence:
MSEAYAREILRRNVAQVCQTIGWNGINSTPLDILVHVLEKYICTLGTQANRYAEQFNRTE
PNLNDLGLVFRDLHIQLPELGEYTRSVPPVPPPVKTERFPKPKESNLNFLKPGSYEVVTR
PMHVHEHLPPMYPEKERDTPVVAGTVEIRQNGIDNVDANVSCTSPEISVTDSPEKPKDIF
KRPIDPVSLPNSKRPRLRLDEEERTREISSVMMTMSGFLSPAREGKLPEAKPPTIISERH
HDKHKVNSHHSNAIKVPMLDKIDKKSKKSKLINGKIMKSKRKDKSHKGEGSKSKDSSKLE
RYPPGYPMKSKDVHPTHHNHVTMPAPRPLQAPVRPTMPPPPIPLPTPPPVTIQEPIRPVI
KQEPIDPPPPVTTRPSLPAEDSIPVPKKIPISKSPLVSNSLTTHKHQSSTISLIPDVQIK
KEVIDEEEKLASQPDRSKINIFKRISNKSKEEKHTPEVVPEKLFSQPDTTISRLQNSSHE
ISENRVKSAEYVNNNNSPVDLSREIDIRSHEIINIDDDSLDAQPVPHSRNTSPEPKSVGL
SINKSLQVPFPKDIASVSPKLKKEKKHKDKKDKAAKLEAKLKKQHQQLAFEMMPMVEKKK
SKIKSEKSVKSNRLKNDLKLPQMPPGFPFFPNMPPGRGMMPGPGLIPSHGLIPGGDFLAG
LTNNPALRGLQPPNILGNPFAVGAGGPGLIPGSSFLPGGLGPGIPNHLMPMGNFPHSSRS
SPVKIPPMLRRPSLEVIPVENEEDRMMHKSAMTGRDKDRHDKHKSPTIPNILQKQKSKSN
KDHKSSIYKMPPVQPDITIELNPPKEPVRPEPPRENPTPLRIPTPEPQAVVSKPEPLPIP
EPTPVKHSAPEISQDPDNIEKKKDKSHKKEKRDKDGIKIKKKKDKKDKNKDRSEKKKDKE
ERQEIKDRIKKEKKEKKKEKSADGLVPKLTLKLASSNSNSPMPPSSPDVYKLNIKPVVKK
EEEETSPIKEESVSREHSRSPELAQISALVTRPPKQRHSKHNHVSEPLESQSPPPIPGSP
QRKNRPPSSHSKYKRILIKPLSKKGNNEDFEDEPATISDEPQAPAPVSVEKPTGPLPTPY
YVDEQGNKIWVCPACGRPDNGSPMIGCDGCDGWYHWICVGITEDPGATEDWFCKSCVAKR
AAMVLAGVTSGKKRGRKPKGEKIRDCH