DPGLEAN17726 in OGS1.0

New model in OGS2.0DPOGS213576 
Genomic Positionscaffold398:+ 41653-49854
See gene structure
CDS Length1824
Paired RNAseq reads  1005
Single RNAseq reads  2532
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011646 (0.0)
Best Drosophila hit  hexosaminidase 2 (4e-126)
Best Human hitbeta-hexosaminidase subunit alpha preproprotein (2e-40)
Best NR hit (blastp)  beta-N-acetylglucosaminidase 1 [Bombyx mori] (0.0)
Best NR hit (blastx)  beta-N-acetylglucosaminidase 1 [Bombyx mori] (0.0)
GeneOntology terms



  
GO:0004563 beta-N-acetylhexosaminidase activity
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
GO:0016231 beta-N-acetylglucosaminidase activity
GO:0005886 plasma membrane
InterPro families



  
IPR017853 Glycoside hydrolase, superfamily
IPR001540 Glycoside hydrolase, family 20
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR015883 Glycoside hydrolase, family 20, catalytic core
IPR015882 Acetylhexosaminidase, subunit a/b
Orthology groupMCL17254

Nucleotide sequence:

ATGTGTCGTCTGCTGGTGCTGGCGATAGTATTGGTGCATTTAGTCAACTGTCAGGACAAA
GAAAATGACATAGAGGAATCAACACCGCTTGTCTATGAACCATCCTGGACATATGAATGT
GTGCCGGATGAGGGCTGTCAACGCTCCGATTTTCCGCAGCCGACCCTGAACAGGAGTATG
ATCTTCGATAGCATCGACCTATGCAGGACCGTCTGCGGCCGCTTCGGAGGAATCTGGCCC
AAACCGGTCACGGCGGCCTTGAGCATGCAGACTATTAAGATACATCCGAATTACTTGAGA
TTCGATCTTAGCAACGCGCCGGCAGAAACCAGAAAGATATTGGCCGAGATGTCTCAAGTG
GCCACTCAGAATATCATAAGTGAATGCGAGGGTAATGTCACGGAGGTCGTTGAGATGCCT
GTCATTGTACACATCACGGTCAAAACAGACAACATGAACCTAACCTGGCAGACTGACGAA
CAATACCGCTTGGATGTCCAGAGCAAAGATACCAGCGTCGTGGTCCAAGTTATCGCGGAA
ACGGTTTTCGGAGCACGCCATGGTTTAGAGACTTTGACGCACCTGATTTCGGCTGATAAG
CCAGATTTATCGGAACAATCTAAATGTGGGCTCCGTATGGTAGCGGGTGCCAAAATATGG
GATAAACCTGTGTACCCACATAGAGGATTTCTTCTGGACACGTCAAGAAACTTCATCCCT
ATGGACGATATCAAAAGAATGATCGATGGTTTGGCTACACTCAAGATGAACGTCTTCCAC
TGGCACGTAACAGATTCACACAGCTTCCCGTTGGAGTCAAGACGAGTGCCGCAGTTTACT
AAGTACGGTGCTTATTCCGCCTCAGAGATCTACAGTTCGGAGGAAGTACGTGGTCTGGTG
GAATACGCTCTCGTTCGCGGAGTCCGTATCTTAATTGAAATAGACTCACCAGCGCACGCC
GGCAATGGCTGGCAGTGGGGGAATGAGTACGGTTTGGGTGATCTGGCGGTTTGTGTGAAC
GAGAAACCCTGGCGGCAGCTTTGTATTCAGCCACCCTGCGGGCAACTGAACCCCGCCAAC
CCGGCCGTATATAGAGTATTGAGAGACTTGTACAGAGACATCGCTGAAACTCTCACCAAA
CCTCCTCTATTTCACATCGGCGGTGATGAGGTATTCTTTGAATGCTGGAACTCGTCAAAT
ACGATCCTAGAATACATGCAGACTAAAGGCTACAGCAGGAATGTTGAAGGTTTCATCAAT
TTATGGTCGGAATTTCATGAGAAGGCTCTCAATATTTGGGATGAAGAACTAGCAGCGATC
GGAGAAACGGAGAAGCAGCCAGTCTTAATCTGGTCTTCCGAACTCACACAAGCCCACAGA
ATACAAAAACATTTGGACAAAAAAAGATACACAATTGAAGTTTGGGAACCTCTGTCCAGT
CCTTTGTTAATTCAACTGATACGTCTGGGCTACAACGTTATATCAGTCCCCAAAGACGTG
TGGTATCTCGATCATGGGTTTTGGGGTCAGACGAAATACTCTAACTGGAGAAGAATGTAC
GCACACACACTACCAAGAGATCCAAATGTTTTGGGGGGTGAAGTCGCTATGTGGACTGAA
TATGTGGATAAAGAGGCTTTGGATCCGAGAGTGTTCCCTCGAGTGGCTAGCGTGGCGGAA
CGTCTTTGGTCGGATCCCACGACGGGAGCGAGTGGCGCTCAGCCTCGCCTGCAGCGTGTG
AGGACGAGACTCGTCCAACGAGGACTGAGAGCGGACGTGCTAGCTCCAGGCTGGTGCGCT
CAACACGACACGCGCTGTTTATAG

Protein sequence:

MCRLLVLAIVLVHLVNCQDKENDIEESTPLVYEPSWTYECVPDEGCQRSDFPQPTLNRSM
IFDSIDLCRTVCGRFGGIWPKPVTAALSMQTIKIHPNYLRFDLSNAPAETRKILAEMSQV
ATQNIISECEGNVTEVVEMPVIVHITVKTDNMNLTWQTDEQYRLDVQSKDTSVVVQVIAE
TVFGARHGLETLTHLISADKPDLSEQSKCGLRMVAGAKIWDKPVYPHRGFLLDTSRNFIP
MDDIKRMIDGLATLKMNVFHWHVTDSHSFPLESRRVPQFTKYGAYSASEIYSSEEVRGLV
EYALVRGVRILIEIDSPAHAGNGWQWGNEYGLGDLAVCVNEKPWRQLCIQPPCGQLNPAN
PAVYRVLRDLYRDIAETLTKPPLFHIGGDEVFFECWNSSNTILEYMQTKGYSRNVEGFIN
LWSEFHEKALNIWDEELAAIGETEKQPVLIWSSELTQAHRIQKHLDKKRYTIEVWEPLSS
PLLIQLIRLGYNVISVPKDVWYLDHGFWGQTKYSNWRRMYAHTLPRDPNVLGGEVAMWTE
YVDKEALDPRVFPRVASVAERLWSDPTTGASGAQPRLQRVRTRLVQRGLRADVLAPGWCA
QHDTRCL