DPGLEAN08958 in OGS1.0

New model in OGS2.0DPOGS212037 
Genomic Positionscaffold858:- 15889-19392
See gene structure
CDS Length3504
Paired RNAseq reads  2235
Single RNAseq reads  5421
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010005 (1e-83)
Best Drosophila hit  bip2 (7e-28)
Best Human hittranscription initiation factor TFIID subunit 3 (2e-19)
Best NR hit (blastp)  PREDICTED: similar to bip2 CG2009-PA [Tribolium castaneum] (9e-51)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL010857 [Aedes aegypti] (2e-46)
GeneOntology terms



  
GO:0005669 transcription factor TFIID complex
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0016251 general RNA polymerase II transcription factor activity
GO:0008270 zinc ion binding
GO:0005515 protein binding
InterPro families




  
IPR011011 Zinc finger, FYVE/PHD-type
IPR019786 Zinc finger, PHD-type, conserved site
IPR006565 Bromodomain transcription factor
IPR001965 Zinc finger, PHD-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR019787 Zinc finger, PHD-finger
Orthology groupMCL40480

Nucleotide sequence:

ATGTCAGAGGCGTACGCCCGAGAGATATTACGGAGGAATGTTGCCCAAGTATGCCAAACT
ATAGGATGGAATGGGATAAACTCTACGCCACTCGACATTTTAGTGCATGTTTTGGAAAAG
TACATTTGTACTTTGGGCACTCAAGCTAACCGATACGCCGAACAATTTAATAGGACTGAA
CCAAACTTGAATGACCTAGGGTTAGTGTTTCGTGACCTTCACATCCAATTGCCAGAATTA
GGAGAGTATACTAGATCTGTGCCTCCCGTTCCACCTCCTGTTAAAACAGAAAGATTCCCA
AAACCTAAAGAATCTAATTTAAATTTCCTTAAGCCTGGCAGTTATGAGGTAGTTACAAGA
CCTATGCATGTGCATGAGCACTTGCCTCCTATGTACCCAGAGAAAGAAAGAGATACACCT
GTTGTTGCAGGAACAGTTGAAATTCGGCAAAATGGTATTGATAATGTTGATGCTAATGTG
TCCTGTACAAGTCCTGAAATATCTGTCACAGACAGTCCAGAAAAACCTAAAGATATATTT
AAGAGGCCTATTGATCCAGTTTCATTACCAAATAGTAAAAGACCAAGGTTACGACTGGAT
GAAGAGGAAAGGACAAGGGAAATTAGCAGTGTTATGATGACTATGTCAGGTTTTCTTTCA
CCAGCTAGAGAAGGTAAATTACCAGAAGCTAAGCCTCCTACTATTATTTCTGAAAGACAT
CATGACAAGCATAAAGTGAATTCACACCATTCAAATGCAATTAAAGTACCAATGTTAGAT
AAAATTGATAAGAAATCGAAGAAGAGTAAGTTAATTAATGGAAAAATTATGAAAAGTAAA
AGAAAAGATAAGAGTCATAAAGGTGAGGGTAGTAAATCTAAAGATAGTAGTAAATTGGAG
AGGTATCCTCCGGGATATCCAATGAAAAGTAAAGACGTTCATCCAACGCATCATAATCAC
GTGACAATGCCTGCGCCAAGGCCTTTACAAGCACCTGTAAGACCAACAATGCCACCACCA
CCTATACCTTTACCAACACCGCCACCTGTTACAATACAAGAACCTATCAGGCCAGTTATA
AAACAGGAACCAATAGATCCACCCCCACCTGTCACAACTCGGCCATCTCTACCTGCTGAA
GATTCAATTCCGGTACCTAAAAAAATACCAATTTCAAAATCTCCGCTCGTTTCCAATTCT
TTAACTACTCACAAACATCAATCTTCCACAATTTCTCTTATTCCGGATGTTCAAATCAAA
AAAGAGGTTATAGATGAAGAAGAAAAGTTAGCATCTCAGCCAGATAGATCAAAGATTAAT
ATATTTAAAAGAATATCGAACAAATCTAAAGAAGAAAAGCATACACCAGAAGTTGTTCCT
GAAAAATTATTTTCACAGCCAGATACAACAATTTCAAGGTTGCAGAATTCATCACATGAA
ATAAGTGAAAACCGAGTTAAAAGTGCCGAATATGTGAACAATAATAACAGTCCAGTTGAT
TTGTCGCGAGAAATAGATATTAGGTCTCACGAAATTATCAACATTGATGATGATTCATTG
GATGCTCAACCAGTGCCTCATTCTAGAAATACATCCCCTGAGCCAAAATCAGTTGGCCTT
TCAATAAATAAATCGTTACAAGTACCTTTTCCCAAAGATATTGCTAGTGTCAGTCCAAAA
TTGAAGAAGGAAAAGAAACATAAAGATAAAAAAGATAAAGCAGCGAAATTAGAAGCAAAA
CTCAAAAAACAGCATCAACAGTTGGCCTTTGAAATGATGCCAATGGTAGAAAAGAAAAAG
TCTAAAATTAAAAGTGAAAAGTCAGTTAAGTCTAATAGGCTTAAAAATGATTTAAAATTA
CCACAAATGCCACCTGGTTTTCCATTCTTTCCTAACATGCCACCAGGACGGGGAATGATG
CCAGGTCCGGGATTAATACCTAGTCATGGCTTAATACCTGGTGGGGACTTTTTAGCTGGT
TTGACTAACAACCCTGCACTAAGAGGTTTACAACCCCCAAATATCCTTGGTAATCCTTTT
GCTGTAGGGGCAGGCGGACCAGGGCTTATACCAGGATCAAGTTTTCTACCAGGTGGTCTC
GGCCCCGGTATCCCCAATCATCTTATGCCGATGGGTAACTTTCCTCATTCTTCGCGATCG
TCTCCTGTAAAAATACCGCCAATGCTTAGACGTCCAAGTTTAGAGGTTATACCTGTTGAA
AACGAAGAAGACAGGATGATGCATAAATCAGCAATGACGGGGCGCGACAAAGATCGTCAT
GATAAGCACAAATCTCCAACCATTCCAAATATTTTACAAAAACAGAAATCTAAATCAAAT
AAGGATCATAAGTCAAGTATATATAAAATGCCACCTGTACAGCCGGATATAACGATTGAA
TTGAATCCTCCTAAAGAGCCAGTTAGACCTGAACCACCACGAGAAAATCCAACGCCACTT
CGAATACCCACACCAGAACCACAAGCCGTTGTTTCTAAACCTGAACCGCTCCCAATTCCT
GAACCAACGCCAGTCAAACATTCTGCTCCAGAAATTTCTCAAGACCCGGATAATATAGAG
AAGAAGAAAGACAAGTCTCATAAAAAGGAAAAACGAGATAAGGATGGCATTAAAATAAAA
AAGAAGAAGGATAAAAAAGACAAAAATAAAGATAGGTCTGAAAAGAAAAAGGATAAGGAA
GAGAGACAGGAAATTAAAGATAGAATAAAGAAAGAAAAGAAAGAGAAAAAGAAAGAAAAG
TCGGCAGATGGTCTCGTGCCTAAACTTACCCTTAAACTAGCTTCTTCCAACTCAAATTCA
CCGATGCCACCCAGCTCTCCAGATGTATATAAACTAAATATAAAGCCTGTTGTAAAGAAA
GAAGAGGAAGAGACATCTCCTATTAAAGAGGAATCCGTATCACGAGAGCACAGTCGGTCC
CCAGAATTAGCCCAAATATCTGCCTTAGTAACGAGGCCACCAAAGCAGAGACATTCTAAA
CATAATCATGTTTCAGAGCCGTTAGAATCACAATCGCCTCCGCCTATACCTGGTTCGCCG
CAAAGAAAGAATCGTCCACCCTCGAGTCATTCTAAATATAAAAGAATCTTGATAAAACCT
TTGTCGAAGAAAGGTAATAACGAAGATTTTGAAGACGAGCCAGCTACGATATCAGATGAA
CCGCAAGCGCCGGCACCAGTTTCTGTAGAAAAACCAACTGGACCACTTCCAACACCATAT
TATGTGGACGAACAAGGAAACAAAATATGGGTATGTCCCGCTTGTGGACGGCCAGACAAT
GGCTCGCCGATGATAGGTTGTGACGGATGCGATGGGTGGTACCATTGGATCTGTGTTGGA
ATCACGGAGGATCCGGGGGCCACGGAAGACTGGTTTTGTAAATCTTGCGTTGCTAAAAGG
GCTGCGATGGTTCTCGCCGGCGTCACTTCCGGCAAAAAGAGGGGGCGGAAACCAAAAGGA
GAAAAAATCAGAGACTGTCATTGA

Protein sequence:

MSEAYAREILRRNVAQVCQTIGWNGINSTPLDILVHVLEKYICTLGTQANRYAEQFNRTE
PNLNDLGLVFRDLHIQLPELGEYTRSVPPVPPPVKTERFPKPKESNLNFLKPGSYEVVTR
PMHVHEHLPPMYPEKERDTPVVAGTVEIRQNGIDNVDANVSCTSPEISVTDSPEKPKDIF
KRPIDPVSLPNSKRPRLRLDEEERTREISSVMMTMSGFLSPAREGKLPEAKPPTIISERH
HDKHKVNSHHSNAIKVPMLDKIDKKSKKSKLINGKIMKSKRKDKSHKGEGSKSKDSSKLE
RYPPGYPMKSKDVHPTHHNHVTMPAPRPLQAPVRPTMPPPPIPLPTPPPVTIQEPIRPVI
KQEPIDPPPPVTTRPSLPAEDSIPVPKKIPISKSPLVSNSLTTHKHQSSTISLIPDVQIK
KEVIDEEEKLASQPDRSKINIFKRISNKSKEEKHTPEVVPEKLFSQPDTTISRLQNSSHE
ISENRVKSAEYVNNNNSPVDLSREIDIRSHEIINIDDDSLDAQPVPHSRNTSPEPKSVGL
SINKSLQVPFPKDIASVSPKLKKEKKHKDKKDKAAKLEAKLKKQHQQLAFEMMPMVEKKK
SKIKSEKSVKSNRLKNDLKLPQMPPGFPFFPNMPPGRGMMPGPGLIPSHGLIPGGDFLAG
LTNNPALRGLQPPNILGNPFAVGAGGPGLIPGSSFLPGGLGPGIPNHLMPMGNFPHSSRS
SPVKIPPMLRRPSLEVIPVENEEDRMMHKSAMTGRDKDRHDKHKSPTIPNILQKQKSKSN
KDHKSSIYKMPPVQPDITIELNPPKEPVRPEPPRENPTPLRIPTPEPQAVVSKPEPLPIP
EPTPVKHSAPEISQDPDNIEKKKDKSHKKEKRDKDGIKIKKKKDKKDKNKDRSEKKKDKE
ERQEIKDRIKKEKKEKKKEKSADGLVPKLTLKLASSNSNSPMPPSSPDVYKLNIKPVVKK
EEEETSPIKEESVSREHSRSPELAQISALVTRPPKQRHSKHNHVSEPLESQSPPPIPGSP
QRKNRPPSSHSKYKRILIKPLSKKGNNEDFEDEPATISDEPQAPAPVSVEKPTGPLPTPY
YVDEQGNKIWVCPACGRPDNGSPMIGCDGCDGWYHWICVGITEDPGATEDWFCKSCVAKR
AAMVLAGVTSGKKRGRKPKGEKIRDCH