DPGLEAN16336 in OGS1.0

New model in OGS2.0DPOGS214332 
Genomic Positionscaffold29:- 280593-282953
See gene structure
CDS Length1905
Paired RNAseq reads  969
Single RNAseq reads  2520
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003990 (0.0)
Best Drosophila hit  fused lobes, isoform B (4e-135)
Best Human hitbeta-hexosaminidase subunit beta preproprotein (2e-54)
Best NR hit (blastp)  beta-N-acetylglucosaminidase [Spodoptera frugiperda] (0.0)
Best NR hit (blastx)  beta-N-acetylglucosaminidase [Spodoptera frugiperda] (0.0)
GeneOntology terms









  
GO:0004563 beta-N-acetylhexosaminidase activity
GO:0007420 brain development
GO:0043169 cation binding
GO:0006491 N-glycan processing
GO:0005770 late endosome
GO:0032428 beta-N-acetylgalactosaminidase activity
GO:0005794 Golgi apparatus
GO:0016231 beta-N-acetylglucosaminidase activity
GO:0006032 chitin catabolic process
GO:0005886 plasma membrane
GO:0005783 endoplasmic reticulum
InterPro families



  
IPR001540 Glycoside hydrolase, family 20
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR017853 Glycoside hydrolase, superfamily
IPR015883 Glycoside hydrolase, family 20, catalytic core
IPR015882 Acetylhexosaminidase, subunit a/b
Orthology groupMCL16823

Nucleotide sequence:

ATGAAGTCGTGGGGGGAAACGTTATGGCGGAGTGCCTCCTCGCACTTGTACCGAGTCGGC
AGGTTGCGGCGAGCTCTGCTGCTCTTGGCTGCCGCTGCCTGTACTGCTGCGGGCCTTCTC
TATTGGAGACAACAGACGGACGACTCTGCTAGACGACCGCTACACTTGTACGCTGGTGTA
GAGCCACAGTGGTCTTGGGTATGTCGGAACGATCGTTGTGAGCGACTCCTAGCATCAGAG
ACCTCTATACTTCAGTCACTTCCTACGTGCAACATGCTCTGCGCGTCCACTCAATTATGG
CCGCAGCCAACCGGCCCCGTCAGTCTCGCTACGGCCTCAGTTCATGTCAGATCCAGCGGC
TTCTCACTACAAGTCATATCCTCTCCATCAAGAGAAGTGACAGAAAACCTCAACGATGCC
TTCCAATTAATGCGCGACGACTTGAAAATTCTGGAGAAAAACGCGGGCGTAGAGAACAGG
AGATCAGATAGTGGAACTCCCCGTGAAGTTGTTGTAAGGGTCGCTGTGAACGGCAGCGCT
GATCCACGCATGCGACAAGACACCGATGAAACCTACAAGCTCTCTCTCAGACCGTCGGGG
AAGTCCCTCGTCGCTGATATAACAGCGCATTCCTTCTGTGGAGCTCGGCACGGCTTTGAA
ACTCTGTCCCAACTAGTGTGGTTGGATCCTTACGCTGAATCTCTCTTAATACTCGAAGCT
GCCACCGTGGACGACGGCCCTCGGTTTAGATATCGTGGTTTGTTATTGGATACAGCCAGG
AATTTCTTCCCCGTAACTGACATATTGCGTACAATCGATGCTATGGGAGCGTGCAAGCTG
AACACGTTCCATTGGCATGTGAGTGACTCGCAGTCCTTTCCTTTGAGACTGAACAGCGCT
CCTCAACTAGCTCAGCACGGAGCTTATGGGCCTGGTGCTATATACACGACTGACGATGTA
AGGGCTATAGTACGCCGAGCTAGATTGAGAGGAATACGTGTCTTGATAGAAGTAGATGCG
CCGGCGCATGTTGGACGAGCGTGGTCGTGGGGCCCTCCTGCTGGGTTAGGACACTTAGCG
CATTGTGTTGAAGTAGAACCTTGGAGTACTTATTGTGGTGAACCGCCTTGTGGGCAATTA
AACCCACGAAATCCACACGTTTACTCACTTCTTGAACAGATTTATGCCGAAATCATTCAA
CTGACCGAAGTGGACGATATCTTCCATTTAGGCGGGGACGAGGTCTCGGAGCGGTGTTGG
GCTCAACACTTTAACGACACGGATCCCATGGAGTTATGGTTTGAGTTCACTCGTCGCGCC
ATGTCCTCCCTCGAACGTGCCAATGGCGGTAAACTGCCAGATCTAACGTTACTGTGGTCT
TCTCGGCTAACTCATACACCGTACCTGGAACGTTTAGATAAGAAGAGACACGGCGTGCAG
GTGTGGGGCTCGTCCCGGTGGCCGGAATCTCGCGCGGTATTGGACGCGGGCTACAGAACG
ATCATATCTCACGTAGACGCTTGGTACTTAGACTGCGGCTTCGGGTCCTGGCGAGATAGT
TCCGACGGTCACTGTGGACCTTACCGGTCTTGGCAGCAAATTTACGAGCACAGACCCTGG
ATAGAGGAAATGCCGGCCATGTCTACTGGAGTCGAACCATGGCAAGTGGAAGGCGGCGCG
ACGTGTCAGTGGACGGAACAGCTGGGTTCCGGAGGTTTGGATGCTAGAGTGTGGCCGAGG
ACTGCGGCGGTCGCGGAGCGTCTCTGGTCGGACCGCGCCGAGGGCGCCACCGCCGACGTC
TACCTGCGACTCGACACACAACGATCACGACTCCTAGATAAAGGGATCCAAGCCGCTCCT
CTCTGGCCGCGGTGGTGCTCTCACAACCCTCACGCCTGCCTTTAG

Protein sequence:

MKSWGETLWRSASSHLYRVGRLRRALLLLAAAACTAAGLLYWRQQTDDSARRPLHLYAGV
EPQWSWVCRNDRCERLLASETSILQSLPTCNMLCASTQLWPQPTGPVSLATASVHVRSSG
FSLQVISSPSREVTENLNDAFQLMRDDLKILEKNAGVENRRSDSGTPREVVVRVAVNGSA
DPRMRQDTDETYKLSLRPSGKSLVADITAHSFCGARHGFETLSQLVWLDPYAESLLILEA
ATVDDGPRFRYRGLLLDTARNFFPVTDILRTIDAMGACKLNTFHWHVSDSQSFPLRLNSA
PQLAQHGAYGPGAIYTTDDVRAIVRRARLRGIRVLIEVDAPAHVGRAWSWGPPAGLGHLA
HCVEVEPWSTYCGEPPCGQLNPRNPHVYSLLEQIYAEIIQLTEVDDIFHLGGDEVSERCW
AQHFNDTDPMELWFEFTRRAMSSLERANGGKLPDLTLLWSSRLTHTPYLERLDKKRHGVQ
VWGSSRWPESRAVLDAGYRTIISHVDAWYLDCGFGSWRDSSDGHCGPYRSWQQIYEHRPW
IEEMPAMSTGVEPWQVEGGATCQWTEQLGSGGLDARVWPRTAAVAERLWSDRAEGATADV
YLRLDTQRSRLLDKGIQAAPLWPRWCSHNPHACL