New model in OGS2.0 | DPOGS202186  |
---|---|
Genomic Position | scaffold747:+ 5555-6808 |
See gene structure | |
CDS Length | 1254 |
Paired RNAseq reads   | 284 |
Single RNAseq reads   | 809 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013484 (4e-139) |
Best Drosophila hit   | Rpb4, isoform E (3e-67) |
Best Human hit | transcriptional adapter 2-alpha isoform a (3e-58) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC030774 [Tribolium castaneum] (7e-78) |
Best NR hit (blastx)   | PREDICTED: similar to transcriptional adaptor 2 (ADA2 homolog, yeast)-like [Apis mellifera] (2e-78) |
GeneOntology terms    | GO:0005665 DNA-directed RNA polymerase II, core complex GO:0003899 DNA-directed RNA polymerase activity GO:0006366 transcription from RNA polymerase II promoter GO:0030528 transcription regulator activity GO:0016591 DNA-directed RNA polymerase II, holoenzyme GO:0005515 protein binding GO:0005634 nucleus GO:0005700 polytene chromosome GO:0005671 Ada2/Gcn5/Ada3 transcription activator complex GO:0004402 histone acetyltransferase activity GO:0042789 mRNA transcription from RNA polymerase II promoter GO:0000123 histone acetyltransferase complex GO:0003677 DNA binding GO:0008270 zinc ion binding GO:0000166 nucleotide binding GO:0043966 histone H3 acetylation GO:0008283 cell proliferation GO:0035065 regulation of histone acetylation GO:0043189 H4/H2A histone acetyltransferase complex |
InterPro families    | IPR001005 SANT domain, DNA binding IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR007526 SWIRM IPR014778 Myb, DNA-binding IPR017884 SANT, eukarya |
Orthology group | MCL10535 |
Nucleotide sequence:
ATGGCAAATGATTTATTGCAAGTAAAGTGTGATATTTGCGACGAGATCGCTCACGAGCCC
TATATAGAGTGTTGTGAATGCGACACTGTATTGTGTTGTTCCTGTTTCGCGTCGGGAAAA
GAGAAAGATAATCACAGAAACGATCACAAGTACGCTATAAGAAAAAATGACTTTCCACTA
TTTGAAAACTGCAACTGGTCAGCTAAAGAGGAATGTAAGCTATTGAATGCACTATCTAAT
TATGGTTATGGAAATTGGGAAGAAATAGCTAAAAGTGTGCATACGAGATCGAAACTGGAA
TGCCAAGAGCATTATAAAAAGTATTACATAGAAAACGTGAAGTATGATGAGCTGAAATTA
TTACCGGAAACTAAAGAGTCATTATATCAACCACCTCTAACCCCATACCTGTATAACACA
GATCTTAGTATAAACCCACCAAGAAATAACCAATCCGACCCACTTCTCGCCGGTTACAAT
GCTCATAGATCTGACTTTGAACTCAGCTATGACCATAACGCCGAAAACATATTCAGCACC
GATATAAGCTATTCCGCTGATGATGAAGAGGACGATGAATGTATGGATTCGCTGAAGGTT
AGTTTGGTCAGTGCACTAAACACTAGATTAAGAGAAAGGCAACGGCGTTACAACATCATC
CAGGAACATGGACTCATCATGACCAATAAGTTATTGTCCTGGTTGAAGAGGTTCGATAGT
ACTCTGTCCAGATCCAAAGCAGAAAAACTACTGTCATTCATGCAGTTCATGAGTGGAATG
CAGTTTGATAGCTTAATGGAGTCCCTTAGTTTAGAAGAGGAAATTCTCAATAGAATCGTA
AGGCTGTGTGATTATCGGAGGAATGGGATACAAAACGACAAAGTCTATAAAGAACAGAAA
TATGTCACCAATATGATGATTAAGAAATTTGACAGTCAGTCACAGATGAAGAGTAAGAAC
AGTTTGTTTGGTAACAGTATCGGCAGCAAGAAGATAAAAAGAACACTCATGCCGCTGGAC
ATATTGGACATGCCAGGATACCACCTGCTGTCGGACAGTGAGCGAGACTTGTGCTCCAAT
GTCAGAGTGATCCCGGAGAATTTTCTCGACATCAAAAGAGTCCTCATAGCGGAGAACAAC
AAACTGGGTTTCCTACGCTTACTGGATGCACGGCGAGTCGTCAAGATAGATGTGAATAAG
ACTAGGAAAATATATGATCACCTGTTGTCCGAGGGATTCATTGTAAAACCTTAG
Protein sequence:
MANDLLQVKCDICDEIAHEPYIECCECDTVLCCSCFASGKEKDNHRNDHKYAIRKNDFPL
FENCNWSAKEECKLLNALSNYGYGNWEEIAKSVHTRSKLECQEHYKKYYIENVKYDELKL
LPETKESLYQPPLTPYLYNTDLSINPPRNNQSDPLLAGYNAHRSDFELSYDHNAENIFST
DISYSADDEEDDECMDSLKVSLVSALNTRLRERQRRYNIIQEHGLIMTNKLLSWLKRFDS
TLSRSKAEKLLSFMQFMSGMQFDSLMESLSLEEEILNRIVRLCDYRRNGIQNDKVYKEQK
YVTNMMIKKFDSQSQMKSKNSLFGNSIGSKKIKRTLMPLDILDMPGYHLLSDSERDLCSN
VRVIPENFLDIKRVLIAENNKLGFLRLLDARRVVKIDVNKTRKIYDHLLSEGFIVKP