DPGLEAN00991 in OGS1.0

New model in OGS2.0DPOGS215725 
Genomic Positionscaffold788:- 36031-45341
See gene structure
CDS Length3279
Paired RNAseq reads  1615
Single RNAseq reads  3525
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005759 (0.0)
Best Drosophila hit  HDAC6, isoform C (0.0)
Best Human hithistone deacetylase 6 (4e-147)
Best NR hit (blastp)  histone deacetylase hda2, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  histone deacetylase [Aedes aegypti] (0.0)
GeneOntology terms





  
GO:0004407 histone deacetylase activity
GO:0016575 histone deacetylation
GO:0008270 zinc ion binding
GO:0005737 cytoplasm
GO:0045449 regulation of transcription
GO:0006099 tricarboxylic acid cycle
GO:0022904 respiratory electron transport chain
InterPro families

  
IPR001607 Zinc finger, UBP-type
IPR000286 Histone deacetylase superfamily
IPR013083 Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL12625

Nucleotide sequence:

ATGGCAGCAGCAAAGCCGTCAGCGTCTGTGGTGGCAGCCAAACGGAAGGCTCAGCAGAAG
AAAAAATTTACAATGGACACAGTGCTTAGAGATCCTTATCAAACGGCCATGGAGTCTAAA
TTCAAAGTGAAGGGTGCTACAGGCTTTGTGACTGACCCTCGGATGTCTGAACACCGTTGC
CTATGGGATGATAACTATCCGGAGTGTCCCGAAAGACTGATAAGTGTCATTAACAGATGT
CAGGAGCTGAATCTAATAGAGCAGTGTAAAGTGTATCCTCCCCGGAGCGCCACCCGCGAG
GAGGTGTTGGAACTGCACTCGCCATCCGTCTACAGTATGATGGAGGGAACCCATCAGAAC
CAGGACCTGGAGTATTTGGAGGAACTGTCTGCTGGCTTCGATGCGGTTTATATACACCCT
ACTACTCACGAGCTGGCTTTATCGTCTGCTGGTTGTACCCTGGATATGGTGGAACGCCTG
GTCTCCGACGAGCTGCAGAACGCCGCGTGTATGGTGAGGCCGCCCGGACACCACGCCATG
AGGGCGGAGCCCTGTGGATACTGTATCTACAACAACGCAGCCCTCGCGGCGAATAGGGCT
CTCAAACTTGGGCTACAGAGAATATTAATAGTGGACTGGGACGTGCATCACGGACAAGCC
ACACAACAGATGTTCTACGACGATCCGAGAGTGGTGTACTTCTCCATCCATCGTTACGAG
CACGGGGCGTTCTGGCCCAACCTGCGGCAGTCAGACTTCCCTTACATAGGGAGCGGGCAG
GGCGAGGGTCACAACTTCAACGTGCCCCTCAACAACACCGGCATGACGGACGCGGACTAT
ATCGCTATATGGCATCAGTTGTTACTGCCCATGGCTTTTGAGTACCAGCCGCAGCTGGTG
CTGGTGTCTGCGGGATACGATGCCGCAGTCGGCTGTCCTGAGTTCTCCCCAGAACTGATC
ATAGTGTCAGCTGGTTATGATGCCGCACTCGGTGATGAGAAGGGTGAGATGGAAGTGACC
CCCGCGTGCTATGCCTCCCTCCTGCACATGCTGCAGGGAGTGTGCTCCCGCGTGTTGGTG
CTGCTAGAGGGCGGGTACTGTCTGCGCTCGCTGGCCGAGGGCGCCGCCCTGACGCTGCGG
ACGCTACTGGGACACGCGCCGCCCGCCCTGCCGCCGTTGCAAGAACCGTGCGAATCGATA
AGAGATTCGATCCTGAACTGTATATACTCGCACAAGAAACACTGGAGGTGCTTCAACAAC
CAGCCGAGCTACAGCATTGACCCCTCCGTGTTGAACACGGGCGAGCGAGGGGTAGGTCAG
CATACAGTGGTCATGAAGTGGGAAGGGGATGAGACCAGAGCCGACAGGTTCGCCACCAGG
AACTGCTATCCATTACAGACCGACGACACCAGGAAGAGGATCCAGGACAGACTGAACCAT
TTACAATTGGCCACGGACGTCAGCTGCTCCGAGCACCCCGTGTGTTACATCTACGACCCG
GCCATGTTGAAGCATCATAATGTTTGTGAACCTGGCCACGTGGAGTGTCCAGAACGTATA
ATGCGTATACATGAGCGGCACAGGGATTTCGGTCTGTTGGAACGCGTCCACCGCCTGCCG
CCCAGGAGCGCCGCCGACGACGAGATACTCGCTGTGCATACTGAGAAGTATCTAGATAGC
CTGAAGGAGTTGTCGAGTACGAAGCTTCGAGATCTGAACGCACAGAGGAAGTCCTTCGAC
TCTGTATATTTCCACCCTGACTCCTTACAAAGCGCCGCAGCCGCTGCCGGCGCCGTTATA
CAGATGGTAGACGCCGTTCTAAGACACGGCAGTGGCGGAGTGTGCGTGGTCCGGCCGCCG
GGACATCACGCGGACGAGGACGTGCCGAGCGGGTTCTGTCTCCTCAACAACGTGGCGGTG
GGCGCCAGGCACGCTCGCGCCGCCCACGGACTGACCAGGATCATGATACTGGACTGGGAC
GTTCATCACGGGAACGGCACGCAGAGGATCACATATGAAGATAAAGAGATACTGTACATA
TCGATCCACCGCTATGACAACGGCTCGTTTTTCCCCAACTCCCCCGCGGCCGACCACACC
GCCGTGGGCCAGGGTCGGGGGGAGGGGTACAACATTAACATACCGTGGAATAAACGAGGT
ATGGGAGACGCGGAGTATCTGTCAGCGATGTGTTCCGTGGTGTTGCCGGTCGCCTACGAG
TACGGGCCGCAGCTCGTGCTGGTGTCGGCCGGCTTCGACGCGGCCGTCGGAGATCCCCTC
GGAGGTTGCAAAGTAACGCCGGAGTGCTACGGGCGGATGACGCACATGTTGCGGGGTCTG
GCCGGGGGGCGGGTCATTGTGTGTCTAGAGGGCGGCTACAACGTCACCAGCATATCGTAC
GCCATGACCATGTGCACTAAGGCCCTCCTCGGAGACCCGCTGCAACATCAGTACGACCCC
AAACAAACGGTCAACCCGTCCGCCGTGGAGAGCATCAATAACGTGATCAGAACACACCAG
AAGTATTGGAAATCTCTAAAGTTTCAACTAGCCCTGCCCATGGAGGACGTCATTGGCCCG
CTCCCCAAATCCAGAGGCTTACCTGACTCCGAGCCGGCGCCGGACCACGCTAAGATCGTG
AGCGACAAACTGAAAAAATATGCACACGATAATAATCTAGCAAAAGAAATATCAAACTTG
AGCATAAGGAACGACTGCACCGACGGCATCCACTGCGGCACCGACGACGAAAACGATGAC
GGCGTTCATAAAACTAAGACAAACGACGGCGCCGGCGCCAGCCACGGGACTGGGAGTGCG
GGGCCCAGCGGCAATAAACAGAACCGCACGGGAAGCGAACCCACGACCCTAGTGGACTAT
CTCTCGGAGAACATGCAGGCCATAGTCAACGAGGAGATGTTCGCTGTGATACCTTTAACC
TGGTGCCCTCATCTCGACATGCTGCACGCCGTCCCCGAGGGCGTGCACTTCCAACAAGGA
GTCCAATGTGTCAGCTGTGATCACGTAGAAGAGAATTGGGTCTGTCTGCACTGTTACATT
ACCGCCTGCGGCAGACATGTGAACGGTCATATGCAGGACCACTTCAAGGCAGCGCAGCAT
CCCTTATCGCTGTCTCTCTCCGACCTCTCGGTGTGGTGCAGTGTGTGCGACGCCTACGTC
GACAACCATCTCCTCTACGACGCCAAAAACAACGCCCACAGATGTAAATTCGGCGAAGAC
ATGCCCTGGTGCTATAACAATACAATACAAATGCACTGA

Protein sequence:

MAAAKPSASVVAAKRKAQQKKKFTMDTVLRDPYQTAMESKFKVKGATGFVTDPRMSEHRC
LWDDNYPECPERLISVINRCQELNLIEQCKVYPPRSATREEVLELHSPSVYSMMEGTHQN
QDLEYLEELSAGFDAVYIHPTTHELALSSAGCTLDMVERLVSDELQNAACMVRPPGHHAM
RAEPCGYCIYNNAALAANRALKLGLQRILIVDWDVHHGQATQQMFYDDPRVVYFSIHRYE
HGAFWPNLRQSDFPYIGSGQGEGHNFNVPLNNTGMTDADYIAIWHQLLLPMAFEYQPQLV
LVSAGYDAAVGCPEFSPELIIVSAGYDAALGDEKGEMEVTPACYASLLHMLQGVCSRVLV
LLEGGYCLRSLAEGAALTLRTLLGHAPPALPPLQEPCESIRDSILNCIYSHKKHWRCFNN
QPSYSIDPSVLNTGERGVGQHTVVMKWEGDETRADRFATRNCYPLQTDDTRKRIQDRLNH
LQLATDVSCSEHPVCYIYDPAMLKHHNVCEPGHVECPERIMRIHERHRDFGLLERVHRLP
PRSAADDEILAVHTEKYLDSLKELSSTKLRDLNAQRKSFDSVYFHPDSLQSAAAAAGAVI
QMVDAVLRHGSGGVCVVRPPGHHADEDVPSGFCLLNNVAVGARHARAAHGLTRIMILDWD
VHHGNGTQRITYEDKEILYISIHRYDNGSFFPNSPAADHTAVGQGRGEGYNINIPWNKRG
MGDAEYLSAMCSVVLPVAYEYGPQLVLVSAGFDAAVGDPLGGCKVTPECYGRMTHMLRGL
AGGRVIVCLEGGYNVTSISYAMTMCTKALLGDPLQHQYDPKQTVNPSAVESINNVIRTHQ
KYWKSLKFQLALPMEDVIGPLPKSRGLPDSEPAPDHAKIVSDKLKKYAHDNNLAKEISNL
SIRNDCTDGIHCGTDDENDDGVHKTKTNDGAGASHGTGSAGPSGNKQNRTGSEPTTLVDY
LSENMQAIVNEEMFAVIPLTWCPHLDMLHAVPEGVHFQQGVQCVSCDHVEENWVCLHCYI
TACGRHVNGHMQDHFKAAQHPLSLSLSDLSVWCSVCDAYVDNHLLYDAKNNAHRCKFGED
MPWCYNNTIQMH