New model in OGS2.0 | DPOGS215725  |
---|---|
Genomic Position | scaffold788:- 36031-45341 |
See gene structure | |
CDS Length | 3279 |
Paired RNAseq reads   | 1615 |
Single RNAseq reads   | 3525 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005759 (0.0) |
Best Drosophila hit   | HDAC6, isoform C (0.0) |
Best Human hit | histone deacetylase 6 (4e-147) |
Best NR hit (blastp)   | histone deacetylase hda2, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | histone deacetylase [Aedes aegypti] (0.0) |
GeneOntology terms    | GO:0004407 histone deacetylase activity GO:0016575 histone deacetylation GO:0008270 zinc ion binding GO:0005737 cytoplasm GO:0045449 regulation of transcription GO:0006099 tricarboxylic acid cycle GO:0022904 respiratory electron transport chain |
InterPro families    | IPR001607 Zinc finger, UBP-type IPR000286 Histone deacetylase superfamily IPR013083 Zinc finger, RING/FYVE/PHD-type |
Orthology group | MCL12625 |
Nucleotide sequence:
ATGGCAGCAGCAAAGCCGTCAGCGTCTGTGGTGGCAGCCAAACGGAAGGCTCAGCAGAAG
AAAAAATTTACAATGGACACAGTGCTTAGAGATCCTTATCAAACGGCCATGGAGTCTAAA
TTCAAAGTGAAGGGTGCTACAGGCTTTGTGACTGACCCTCGGATGTCTGAACACCGTTGC
CTATGGGATGATAACTATCCGGAGTGTCCCGAAAGACTGATAAGTGTCATTAACAGATGT
CAGGAGCTGAATCTAATAGAGCAGTGTAAAGTGTATCCTCCCCGGAGCGCCACCCGCGAG
GAGGTGTTGGAACTGCACTCGCCATCCGTCTACAGTATGATGGAGGGAACCCATCAGAAC
CAGGACCTGGAGTATTTGGAGGAACTGTCTGCTGGCTTCGATGCGGTTTATATACACCCT
ACTACTCACGAGCTGGCTTTATCGTCTGCTGGTTGTACCCTGGATATGGTGGAACGCCTG
GTCTCCGACGAGCTGCAGAACGCCGCGTGTATGGTGAGGCCGCCCGGACACCACGCCATG
AGGGCGGAGCCCTGTGGATACTGTATCTACAACAACGCAGCCCTCGCGGCGAATAGGGCT
CTCAAACTTGGGCTACAGAGAATATTAATAGTGGACTGGGACGTGCATCACGGACAAGCC
ACACAACAGATGTTCTACGACGATCCGAGAGTGGTGTACTTCTCCATCCATCGTTACGAG
CACGGGGCGTTCTGGCCCAACCTGCGGCAGTCAGACTTCCCTTACATAGGGAGCGGGCAG
GGCGAGGGTCACAACTTCAACGTGCCCCTCAACAACACCGGCATGACGGACGCGGACTAT
ATCGCTATATGGCATCAGTTGTTACTGCCCATGGCTTTTGAGTACCAGCCGCAGCTGGTG
CTGGTGTCTGCGGGATACGATGCCGCAGTCGGCTGTCCTGAGTTCTCCCCAGAACTGATC
ATAGTGTCAGCTGGTTATGATGCCGCACTCGGTGATGAGAAGGGTGAGATGGAAGTGACC
CCCGCGTGCTATGCCTCCCTCCTGCACATGCTGCAGGGAGTGTGCTCCCGCGTGTTGGTG
CTGCTAGAGGGCGGGTACTGTCTGCGCTCGCTGGCCGAGGGCGCCGCCCTGACGCTGCGG
ACGCTACTGGGACACGCGCCGCCCGCCCTGCCGCCGTTGCAAGAACCGTGCGAATCGATA
AGAGATTCGATCCTGAACTGTATATACTCGCACAAGAAACACTGGAGGTGCTTCAACAAC
CAGCCGAGCTACAGCATTGACCCCTCCGTGTTGAACACGGGCGAGCGAGGGGTAGGTCAG
CATACAGTGGTCATGAAGTGGGAAGGGGATGAGACCAGAGCCGACAGGTTCGCCACCAGG
AACTGCTATCCATTACAGACCGACGACACCAGGAAGAGGATCCAGGACAGACTGAACCAT
TTACAATTGGCCACGGACGTCAGCTGCTCCGAGCACCCCGTGTGTTACATCTACGACCCG
GCCATGTTGAAGCATCATAATGTTTGTGAACCTGGCCACGTGGAGTGTCCAGAACGTATA
ATGCGTATACATGAGCGGCACAGGGATTTCGGTCTGTTGGAACGCGTCCACCGCCTGCCG
CCCAGGAGCGCCGCCGACGACGAGATACTCGCTGTGCATACTGAGAAGTATCTAGATAGC
CTGAAGGAGTTGTCGAGTACGAAGCTTCGAGATCTGAACGCACAGAGGAAGTCCTTCGAC
TCTGTATATTTCCACCCTGACTCCTTACAAAGCGCCGCAGCCGCTGCCGGCGCCGTTATA
CAGATGGTAGACGCCGTTCTAAGACACGGCAGTGGCGGAGTGTGCGTGGTCCGGCCGCCG
GGACATCACGCGGACGAGGACGTGCCGAGCGGGTTCTGTCTCCTCAACAACGTGGCGGTG
GGCGCCAGGCACGCTCGCGCCGCCCACGGACTGACCAGGATCATGATACTGGACTGGGAC
GTTCATCACGGGAACGGCACGCAGAGGATCACATATGAAGATAAAGAGATACTGTACATA
TCGATCCACCGCTATGACAACGGCTCGTTTTTCCCCAACTCCCCCGCGGCCGACCACACC
GCCGTGGGCCAGGGTCGGGGGGAGGGGTACAACATTAACATACCGTGGAATAAACGAGGT
ATGGGAGACGCGGAGTATCTGTCAGCGATGTGTTCCGTGGTGTTGCCGGTCGCCTACGAG
TACGGGCCGCAGCTCGTGCTGGTGTCGGCCGGCTTCGACGCGGCCGTCGGAGATCCCCTC
GGAGGTTGCAAAGTAACGCCGGAGTGCTACGGGCGGATGACGCACATGTTGCGGGGTCTG
GCCGGGGGGCGGGTCATTGTGTGTCTAGAGGGCGGCTACAACGTCACCAGCATATCGTAC
GCCATGACCATGTGCACTAAGGCCCTCCTCGGAGACCCGCTGCAACATCAGTACGACCCC
AAACAAACGGTCAACCCGTCCGCCGTGGAGAGCATCAATAACGTGATCAGAACACACCAG
AAGTATTGGAAATCTCTAAAGTTTCAACTAGCCCTGCCCATGGAGGACGTCATTGGCCCG
CTCCCCAAATCCAGAGGCTTACCTGACTCCGAGCCGGCGCCGGACCACGCTAAGATCGTG
AGCGACAAACTGAAAAAATATGCACACGATAATAATCTAGCAAAAGAAATATCAAACTTG
AGCATAAGGAACGACTGCACCGACGGCATCCACTGCGGCACCGACGACGAAAACGATGAC
GGCGTTCATAAAACTAAGACAAACGACGGCGCCGGCGCCAGCCACGGGACTGGGAGTGCG
GGGCCCAGCGGCAATAAACAGAACCGCACGGGAAGCGAACCCACGACCCTAGTGGACTAT
CTCTCGGAGAACATGCAGGCCATAGTCAACGAGGAGATGTTCGCTGTGATACCTTTAACC
TGGTGCCCTCATCTCGACATGCTGCACGCCGTCCCCGAGGGCGTGCACTTCCAACAAGGA
GTCCAATGTGTCAGCTGTGATCACGTAGAAGAGAATTGGGTCTGTCTGCACTGTTACATT
ACCGCCTGCGGCAGACATGTGAACGGTCATATGCAGGACCACTTCAAGGCAGCGCAGCAT
CCCTTATCGCTGTCTCTCTCCGACCTCTCGGTGTGGTGCAGTGTGTGCGACGCCTACGTC
GACAACCATCTCCTCTACGACGCCAAAAACAACGCCCACAGATGTAAATTCGGCGAAGAC
ATGCCCTGGTGCTATAACAATACAATACAAATGCACTGA
Protein sequence:
MAAAKPSASVVAAKRKAQQKKKFTMDTVLRDPYQTAMESKFKVKGATGFVTDPRMSEHRC
LWDDNYPECPERLISVINRCQELNLIEQCKVYPPRSATREEVLELHSPSVYSMMEGTHQN
QDLEYLEELSAGFDAVYIHPTTHELALSSAGCTLDMVERLVSDELQNAACMVRPPGHHAM
RAEPCGYCIYNNAALAANRALKLGLQRILIVDWDVHHGQATQQMFYDDPRVVYFSIHRYE
HGAFWPNLRQSDFPYIGSGQGEGHNFNVPLNNTGMTDADYIAIWHQLLLPMAFEYQPQLV
LVSAGYDAAVGCPEFSPELIIVSAGYDAALGDEKGEMEVTPACYASLLHMLQGVCSRVLV
LLEGGYCLRSLAEGAALTLRTLLGHAPPALPPLQEPCESIRDSILNCIYSHKKHWRCFNN
QPSYSIDPSVLNTGERGVGQHTVVMKWEGDETRADRFATRNCYPLQTDDTRKRIQDRLNH
LQLATDVSCSEHPVCYIYDPAMLKHHNVCEPGHVECPERIMRIHERHRDFGLLERVHRLP
PRSAADDEILAVHTEKYLDSLKELSSTKLRDLNAQRKSFDSVYFHPDSLQSAAAAAGAVI
QMVDAVLRHGSGGVCVVRPPGHHADEDVPSGFCLLNNVAVGARHARAAHGLTRIMILDWD
VHHGNGTQRITYEDKEILYISIHRYDNGSFFPNSPAADHTAVGQGRGEGYNINIPWNKRG
MGDAEYLSAMCSVVLPVAYEYGPQLVLVSAGFDAAVGDPLGGCKVTPECYGRMTHMLRGL
AGGRVIVCLEGGYNVTSISYAMTMCTKALLGDPLQHQYDPKQTVNPSAVESINNVIRTHQ
KYWKSLKFQLALPMEDVIGPLPKSRGLPDSEPAPDHAKIVSDKLKKYAHDNNLAKEISNL
SIRNDCTDGIHCGTDDENDDGVHKTKTNDGAGASHGTGSAGPSGNKQNRTGSEPTTLVDY
LSENMQAIVNEEMFAVIPLTWCPHLDMLHAVPEGVHFQQGVQCVSCDHVEENWVCLHCYI
TACGRHVNGHMQDHFKAAQHPLSLSLSDLSVWCSVCDAYVDNHLLYDAKNNAHRCKFGED
MPWCYNNTIQMH