New model in OGS2.0 | DPOGS216125  |
---|---|
Genomic Position | scaffold847:+ 38742-47971 |
See gene structure | |
CDS Length | 2373 |
Paired RNAseq reads   | 966 |
Single RNAseq reads   | 2997 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009251 (0.0) |
Best Drosophila hit   | pcaf (0.0) |
Best Human hit | histone acetyltransferase KAT2A (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to GCN5 [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to GCN5 [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0008080 N-acetyltransferase activity GO:0030914 STAGA complex GO:0030901 midbrain development GO:0001701 in utero embryonic development GO:0001756 somitogenesis GO:0044154 histone H3-K14 acetylation GO:0005634 nucleus GO:0042826 histone deacetylase binding GO:0021537 telencephalon development GO:0008283 cell proliferation GO:0042981 regulation of apoptosis GO:0007399 nervous system development GO:0033276 transcription factor TFTC complex GO:0008152 metabolic process GO:0043997 histone acetyltransferase activity (H4-K12 specific) GO:0001843 neural tube closure GO:0016578 histone deubiquitination GO:0043983 histone H4-K12 acetylation GO:0022037 metencephalon development GO:0035264 multicellular organism growth GO:0005515 protein binding GO:0010484 H3 histone acetyltransferase activity GO:0000123 histone acetyltransferase complex GO:0003713 transcription coactivator activity GO:0003682 chromatin binding GO:0006355 regulation of transcription, DNA-dependent |
InterPro families    | IPR001487 Bromodomain IPR016181 Acyl-CoA N-acyltransferase IPR018359 Bromodomain, conserved site IPR009464 PCAF, N-terminal IPR000182 GCN5-related N-acetyltransferase (GNAT) domain |
Orthology group | MCL11251 |
Nucleotide sequence:
ATGTCTTCTCAATTGACAGACGATATTAATGACATAGCTATGACAGGCTCAGAAGCTGAT
TCGTCTCAAGCTCACGGAAACGAAGCTACATCAAGTGTGACTGCTGATGATGCATCATCA
ACTGCAACTACACCCAATGAGAGTCAAGCCTCCAGACAATCCAACTTGCAGCGCATCCAG
CAAAGGAAGCAGCAGGTTTACAATTGGTCGCACAACAAGAAGCTTTTAAAACTGGCCATA
TATTCAGCCTGTCAGGAACAAGACTGTAATTGCAACGGTTGGAAGACACCAGTGCAACAG
GCGGCAAAAACAAATGCTAGGGCGAGTGATCAGCCACCAGCAAACTTTACTGATCCATGC
CGGAACTGCAATCATATTTTAGAGTCTCATATAACACAGCTCAAAGGTGTTTCTGTGACT
GAAGTCAATAGATTGTTGGGGGCCGTTGTTGATGTCGAGAATATCTTCATGTCAATGCAC
AGAGAAGACGATCACGATACAAAACGTGTTTATTACTATCTATTTAAGCTTCTCAGGAAC
TGTATACTAACCCGGTCCCAGCCTCGCATCGAAGGCCCTTTAGGGCAGCCTCCCTTTGAG
AGGCCGTCCATAGCGAAAGCCATAACAAATTTTGTGTTATACAAGTTCAACACACTACCA
CAGAGGGAATGGCAGACAATGTATGATCTAGCGAAAATGTTCCTACACTGCTTCAACCAC
TGGAACTTTGAAACGCCTAGTGTCAGGAAAATGCAAGTTTCAAATCCAGACGACATATCA
GCCTATCAAATCAATTACACCAGGTGGTTGGTGTTCTGTCATGTGCCAGCGTTCTGTGAT
TCCCTCCCTCACTACGAGACATCCGTTGTGTTTGGTCGGACACTACTACGTGCTGTTTTC
AAATCAGTTTGCAAACAACTCATGGATAAATGTCATTTGGAAAGAGATAGAATGCCCCCG
GAGAAAAGAGTGCTGGTCCTGAACCACTTTCCAAGATTTTTGGGTCTGTTGGAGGAAGAG
ATCTTCAGTGTGAACTCGCCTATATGGGACCCCGACTATAAACAAATGCCGCCTAACCAC
TTGCAGGCTATATTAGATAATAAAACTCCCGGAAAACGCGGCGAGTTCGAACGCGTCACA
GCCTCCGGCGAGAGCAAAGACGGGTTCACAACGGTGACGCTCTCATCTGGTTCAATAAAG
CAGGAGGCTGTTAAGAGGTCCGAGGGGCGAGCGTCTAGCGAGGTGGCTGCCAAGCGGAAG
CGGCTCGACGATGACGTCAGCGAACGGACTGTAGCGGAAATAGTGGCCACTATCACAGAC
CCCAACTACATGTGTGGACCGGATGCTCTGTTCCAAATGCAAGCTCCGCGAGACGAAGCC
GCCAAGCTGGAGGAGCAGCGGAAGCTGATAGAGTTCCACGTCATAGGAAACTCGCTGACC
GGACCCGTGAACAAACAGACAATGCTGTGGCTCATCGGTCTGCACAACGTGTTTAATTAC
AGGAAACACAAAACACTGGCGCTGATCAAAGAAGGTCGACCGATCGGCGGCATCTGCTTC
AGAACATTCCACTCCCAGGGCTTCAGTGAGATAGTTTTCTGTGCTGTGACCTCGAACGAG
CAAGTCAAAGGTTACGGGACACATCTCATGAACCACTTGAAGGACTACCACATCAGGAAC
AACATATTACATTTTCTGACATTCGCTGATGAATTTGCCATTGGCGAGTATTCGTTAATG
CAGGGTTTCAGTAAAGATATCAAGCTCCCCAGAGCGATGTACTCCGGCTACATAAAGGAC
TATGAGGGAGCTACGCTGATGCACTGTGAACTGAACCCTCGCATCGTGTACACAAAGTTC
ACTTCTGTTATTCGGACGCAGAAAGAGATCGTCAAGAAATTAATAGACATGCGTCAGAAG
GAAGTGCGGAAGGTAAATCCTGGCTTGACGTGCTTCAAAGAAGGCGTCCGCAGTATCCCG
GTGGAGTGCGTCCCGGGCGCGCGTGAGGCGGGCTGGCGGGAGGTGAGGACCCGCCCGCCG
GTGGACGGTGACGATAACCACGCAGCGCTGAGATCAGTGCTCACAGCGGTCAAGAATCAC
GCCTCAGCCTGGCCGTTCTTGAAGCCGGTCGATAAGACAGAAGTGCCAGACTACTACGAC
CACATTAAATATCCGATGGATCTCCGTACGATGGGCGAGCGACTCAAATCCCGTTATTAT
TCATCTCGTCGCCTCTTCGTCGCAGACATGGCGCGGATCTTCTCTAACTGCAGACTCTAC
AACTCACCCGACACCGACTACTATAGATGTGCAAACACTCTCGAAAAATATTTCCAGGCC
AAGATGAAGGAGGCTGGACTGTGGGATAAATGA
Protein sequence:
MSSQLTDDINDIAMTGSEADSSQAHGNEATSSVTADDASSTATTPNESQASRQSNLQRIQ
QRKQQVYNWSHNKKLLKLAIYSACQEQDCNCNGWKTPVQQAAKTNARASDQPPANFTDPC
RNCNHILESHITQLKGVSVTEVNRLLGAVVDVENIFMSMHREDDHDTKRVYYYLFKLLRN
CILTRSQPRIEGPLGQPPFERPSIAKAITNFVLYKFNTLPQREWQTMYDLAKMFLHCFNH
WNFETPSVRKMQVSNPDDISAYQINYTRWLVFCHVPAFCDSLPHYETSVVFGRTLLRAVF
KSVCKQLMDKCHLERDRMPPEKRVLVLNHFPRFLGLLEEEIFSVNSPIWDPDYKQMPPNH
LQAILDNKTPGKRGEFERVTASGESKDGFTTVTLSSGSIKQEAVKRSEGRASSEVAAKRK
RLDDDVSERTVAEIVATITDPNYMCGPDALFQMQAPRDEAAKLEEQRKLIEFHVIGNSLT
GPVNKQTMLWLIGLHNVFNYRKHKTLALIKEGRPIGGICFRTFHSQGFSEIVFCAVTSNE
QVKGYGTHLMNHLKDYHIRNNILHFLTFADEFAIGEYSLMQGFSKDIKLPRAMYSGYIKD
YEGATLMHCELNPRIVYTKFTSVIRTQKEIVKKLIDMRQKEVRKVNPGLTCFKEGVRSIP
VECVPGAREAGWREVRTRPPVDGDDNHAALRSVLTAVKNHASAWPFLKPVDKTEVPDYYD
HIKYPMDLRTMGERLKSRYYSSRRLFVADMARIFSNCRLYNSPDTDYYRCANTLEKYFQA
KMKEAGLWDK