DPGLEAN04354 in OGS1.0

New model in OGS2.0DPOGS213269 
Genomic Positionscaffold1029:- 43882-46555
See gene structure
CDS Length1194
Paired RNAseq reads  390
Single RNAseq reads  1054
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001185 (1e-170)
Best Drosophila hit  CG17065, isoform A (4e-121)
Best Human hitputative N-acetylglucosamine-6-phosphate deacetylase isoform 1 (3e-103)
Best NR hit (blastp)  n-acetylglucosamine-6-phosphate deacetylase [Aedes aegypti] (2e-140)
Best NR hit (blastx)  n-acetylglucosamine-6-phosphate deacetylase [Aedes aegypti] (7e-138)
GeneOntology terms

  
GO:0008448 N-acetylglucosamine-6-phosphate deacetylase activity
GO:0005575 cellular_component
GO:0006044 N-acetylglucosamine metabolic process
InterPro families

  
IPR011059 Metal-dependent hydrolase, composite domain
IPR006680 Amidohydrolase 1
IPR003764 N-acetylglucosamine-6-phosphate deacetylase
Orthology groupMCL12915

Nucleotide sequence:

ATGAAGGCAAAATCGGGGTTAACAAGATTTTCTAATTGTTATATTTTGCGTGACGGAAAC
ATTATAAAAGAAGATTTATGGATACGCAATGGAAAAATTGTAAACCCTGAACAGGTATTT
TATGTAGAACAAGAAGAAGCTGACATAACGGTAAACAGCGAAGACTCTCTCATAGTGCCA
GGATTTATAGACATTCAAATAAATGGTGGATGGGGTGTAGATTTTTCCTATGATTCAGAA
AATGTTGAAGAAGGGGTAAATAAAGTATCAAAACAGCTATTGGCTCATGGAGTGACCTCA
TTCTGTCCAACTATGGTTACATCTGAGAAAGATAAATATTATAAGATTTTACCCAAAATA
CAAAAAAGGCAAGGAGGAGAACATGGAGCTACGGTTCTTGGAGTGCATCTTGAAGGGCCA
TTTATTAGTTTAGCAAAAAAGGGTGCACACAAAGATGAATATATTTTAAATCCTGAAAAG
GGGCTCGAATCAATTAAAGAGGTGTATGGATCTTTAGACAATGTAATTTTAGTTACAATA
GCCCCAGAATTGCCTGGGGCTTTGGATGCTATAAGAGGGTTATCAAACATGGGCATCAAA
GTGGCCCTAGGACATTCATCTGCTAGCCTTGCTCAAGGTGAAGAAGGCATTAAAAAGGGA
GCAAACTTAATAACACACTTATTCAATGCTATGCTTCCATTTCATCATCGGGACCCTGGT
TTGGTGGGCTTACTTGCTTCGAAGACTGATAGACAAGTTTATTATGGGATAATATCAGAT
GGCATTCACACTCATCCTGCAGCTTTGAGAATTGCTTGTCGAACTAATCAAGAAGGTTTG
ATATTAGTGAGTGATGCCGTAGCGGCTCAAGGCCTACCAGATGGTGCATACCGCATCGGA
CCTCAAGCTGTAAATGTCAATGAAGGCCGCGCATATGTCGCTGGGACCAAAACTCTCTGT
GGCAGTACTACTGCCCTGGACCAGTCAATTAAAACATTCAAAGAAGCCACAGAATGTTCA
CTGGAATATGCTATAGAAGCAGCAACTCTGCATCCAGCCAAGGCTTTGGGAATAGATGAT
AGGAAGGGCAAATTAAATTTTGGCTTCGACGCAGATTTCGTCATCTTACATCCCAAATCT
CTGAATGTTTTGTCCACTTGGATTGCTGGCGAATGCGTCTATCGATCTTCTTAG

Protein sequence:

MKAKSGLTRFSNCYILRDGNIIKEDLWIRNGKIVNPEQVFYVEQEEADITVNSEDSLIVP
GFIDIQINGGWGVDFSYDSENVEEGVNKVSKQLLAHGVTSFCPTMVTSEKDKYYKILPKI
QKRQGGEHGATVLGVHLEGPFISLAKKGAHKDEYILNPEKGLESIKEVYGSLDNVILVTI
APELPGALDAIRGLSNMGIKVALGHSSASLAQGEEGIKKGANLITHLFNAMLPFHHRDPG
LVGLLASKTDRQVYYGIISDGIHTHPAALRIACRTNQEGLILVSDAVAAQGLPDGAYRIG
PQAVNVNEGRAYVAGTKTLCGSTTALDQSIKTFKEATECSLEYAIEAATLHPAKALGIDD
RKGKLNFGFDADFVILHPKSLNVLSTWIAGECVYRSS