DPGLEAN06962 in OGS1.0

New model in OGS2.0DPOGS202186 
Genomic Positionscaffold747:+ 5555-6808
See gene structure
CDS Length1254
Paired RNAseq reads  284
Single RNAseq reads  809
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013484 (4e-139)
Best Drosophila hit  Rpb4, isoform E (3e-67)
Best Human hittranscriptional adapter 2-alpha isoform a (3e-58)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC030774 [Tribolium castaneum] (7e-78)
Best NR hit (blastx)  PREDICTED: similar to transcriptional adaptor 2 (ADA2 homolog, yeast)-like [Apis mellifera] (2e-78)
GeneOntology terms

















  
GO:0005665 DNA-directed RNA polymerase II, core complex
GO:0003899 DNA-directed RNA polymerase activity
GO:0006366 transcription from RNA polymerase II promoter
GO:0030528 transcription regulator activity
GO:0016591 DNA-directed RNA polymerase II, holoenzyme
GO:0005515 protein binding
GO:0005634 nucleus
GO:0005700 polytene chromosome
GO:0005671 Ada2/Gcn5/Ada3 transcription activator complex
GO:0004402 histone acetyltransferase activity
GO:0042789 mRNA transcription from RNA polymerase II promoter
GO:0000123 histone acetyltransferase complex
GO:0003677 DNA binding
GO:0008270 zinc ion binding
GO:0000166 nucleotide binding
GO:0043966 histone H3 acetylation
GO:0008283 cell proliferation
GO:0035065 regulation of histone acetylation
GO:0043189 H4/H2A histone acetyltransferase complex
InterPro families




  
IPR001005 SANT domain, DNA binding
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
IPR007526 SWIRM
IPR014778 Myb, DNA-binding
IPR017884 SANT, eukarya
Orthology groupMCL10535

Nucleotide sequence:

ATGGCAAATGATTTATTGCAAGTAAAGTGTGATATTTGCGACGAGATCGCTCACGAGCCC
TATATAGAGTGTTGTGAATGCGACACTGTATTGTGTTGTTCCTGTTTCGCGTCGGGAAAA
GAGAAAGATAATCACAGAAACGATCACAAGTACGCTATAAGAAAAAATGACTTTCCACTA
TTTGAAAACTGCAACTGGTCAGCTAAAGAGGAATGTAAGCTATTGAATGCACTATCTAAT
TATGGTTATGGAAATTGGGAAGAAATAGCTAAAAGTGTGCATACGAGATCGAAACTGGAA
TGCCAAGAGCATTATAAAAAGTATTACATAGAAAACGTGAAGTATGATGAGCTGAAATTA
TTACCGGAAACTAAAGAGTCATTATATCAACCACCTCTAACCCCATACCTGTATAACACA
GATCTTAGTATAAACCCACCAAGAAATAACCAATCCGACCCACTTCTCGCCGGTTACAAT
GCTCATAGATCTGACTTTGAACTCAGCTATGACCATAACGCCGAAAACATATTCAGCACC
GATATAAGCTATTCCGCTGATGATGAAGAGGACGATGAATGTATGGATTCGCTGAAGGTT
AGTTTGGTCAGTGCACTAAACACTAGATTAAGAGAAAGGCAACGGCGTTACAACATCATC
CAGGAACATGGACTCATCATGACCAATAAGTTATTGTCCTGGTTGAAGAGGTTCGATAGT
ACTCTGTCCAGATCCAAAGCAGAAAAACTACTGTCATTCATGCAGTTCATGAGTGGAATG
CAGTTTGATAGCTTAATGGAGTCCCTTAGTTTAGAAGAGGAAATTCTCAATAGAATCGTA
AGGCTGTGTGATTATCGGAGGAATGGGATACAAAACGACAAAGTCTATAAAGAACAGAAA
TATGTCACCAATATGATGATTAAGAAATTTGACAGTCAGTCACAGATGAAGAGTAAGAAC
AGTTTGTTTGGTAACAGTATCGGCAGCAAGAAGATAAAAAGAACACTCATGCCGCTGGAC
ATATTGGACATGCCAGGATACCACCTGCTGTCGGACAGTGAGCGAGACTTGTGCTCCAAT
GTCAGAGTGATCCCGGAGAATTTTCTCGACATCAAAAGAGTCCTCATAGCGGAGAACAAC
AAACTGGGTTTCCTACGCTTACTGGATGCACGGCGAGTCGTCAAGATAGATGTGAATAAG
ACTAGGAAAATATATGATCACCTGTTGTCCGAGGGATTCATTGTAAAACCTTAG

Protein sequence:

MANDLLQVKCDICDEIAHEPYIECCECDTVLCCSCFASGKEKDNHRNDHKYAIRKNDFPL
FENCNWSAKEECKLLNALSNYGYGNWEEIAKSVHTRSKLECQEHYKKYYIENVKYDELKL
LPETKESLYQPPLTPYLYNTDLSINPPRNNQSDPLLAGYNAHRSDFELSYDHNAENIFST
DISYSADDEEDDECMDSLKVSLVSALNTRLRERQRRYNIIQEHGLIMTNKLLSWLKRFDS
TLSRSKAEKLLSFMQFMSGMQFDSLMESLSLEEEILNRIVRLCDYRRNGIQNDKVYKEQK
YVTNMMIKKFDSQSQMKSKNSLFGNSIGSKKIKRTLMPLDILDMPGYHLLSDSERDLCSN
VRVIPENFLDIKRVLIAENNKLGFLRLLDARRVVKIDVNKTRKIYDHLLSEGFIVKP