DPGLEAN21138 in OGS1.0

New model in OGS2.0DPOGS212532 
Genomic Positionscaffold1463:+ 37233-41368
See gene structure
CDS Length2295
Paired RNAseq reads  25
Single RNAseq reads  57
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008188 (2e-147)
Best Drosophila hit  fused lobes, isoform C (3e-56)
Best Human hitbeta-hexosaminidase subunit beta preproprotein (2e-43)
Best NR hit (blastp)  hexosaminidase [Ostrinia furnacalis] (5e-105)
Best NR hit (blastx)  hexosaminidase [Ostrinia furnacalis] (2e-93)
GeneOntology terms









  
GO:0004563 beta-N-acetylhexosaminidase activity
GO:0007420 brain development
GO:0043169 cation binding
GO:0006491 N-glycan processing
GO:0005770 late endosome
GO:0032428 beta-N-acetylgalactosaminidase activity
GO:0005794 Golgi apparatus
GO:0016231 beta-N-acetylglucosaminidase activity
GO:0006032 chitin catabolic process
GO:0005886 plasma membrane
GO:0005783 endoplasmic reticulum
InterPro families



  
IPR001540 Glycoside hydrolase, family 20
IPR015883 Glycoside hydrolase, family 20, catalytic core
IPR015882 Acetylhexosaminidase, subunit a/b
IPR017853 Glycoside hydrolase, superfamily
IPR013781 Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL40239

Nucleotide sequence:

ATGAATTCGAGAACCGGCGCAGTGGACGTCGACATCTGTAAGAATTTGGATAAAAACGAG
CAAATAGCAATATTAAAATTTTGCGAAACATTATCAGAGATGGAAGCAACTATTTATAAT
GTTGAGTGTGCTATTAAAGCTGCCTGGAGCTATCAAAACGAAAACCCAGACAATCAAGGC
ACGGAGACATCATCAGATTCTTCAAAGCAATTCTCATCGTTTATTTCGGATCCCCAAGAG
GTGAAGCGAACACCGATCTTGCAGTTTAAATCATCAATAACAGTTCTTCCGTCGGTTGTG
ACCGAAGCATCGCTGTCGACGTTTGATGATTGCTCGGAGAGTTTTGACTTCAAGAACTCG
TTTGGACGAGAGCCTACCATCTCCTCATACTCCAAAGATATCAAAATGGAAGCCCATTAT
AAACAAAACGAGGATTCCATTGAATCCAACCAAATATATTCCATCACAACGGAACCAGTG
ACTAAGCAAGCGGGCCAGGATCCTAAGGAATCAGATCACAGCTGTCAACCAGACTGCTCG
TGCACCAAATGCACTAACCAAGGCCCGTCTTGTGGAGATCCTGGATGTACCAAGGAACTC
TGTGACGATAAATCTGATGATTCCAAAAAAAAAGAAAAAGACGAAACGTTGCCCGAATGG
GCTTGGCAATGTCGCAAAAATTTTTGCCATAAAGTTTTCCGTCCTGCAACACTCCTACGC
CACTATACATCTTTCAGTAGGTGTACTTTGTTATGCATGGGTCCACAGCTCTGGCCCTAC
CCTATTGGCTACACACATTTTAGTAAAACTATCGTATCAGTATCCACAAAAAACTTGGAA
TATAAATTTCAATCCGTACCATCAGACAGCGTTCATTTATACCTAGCAGAGGCTTTTAAG
CTGTTCATAAAAGATTTGGCAAGATTAGAAAAACTACAAACAAAACCGACTAATCATAGT
AAGGCCACGGTTAAAAAAATGGTAATACTTATAGATGTAGAAAGTGATTCTGACCCACGG
CTTCGTATAAATACCGACGAGGGATATATGTTGAAGGTAGAAACTAAAAATAATCAAGTT
ATTATAAAAGTAAGTGGACTATCATTTTGTGGAGCTCGTCACGGGTTTGAAACCCTAAGT
CAGTTGATCTTGTTAGATCAAAGCACGGGTTACCTTATCATGCTATCCAGTGCGATAATA
AAGGATGCCCCAACTTACAAATACAGAGGATTGATGGTTGACACAGGAAGAAATTATATA
CCTGTTGTGGACTTATTAAGAACTGTTGACGCAATGTCTACCTGTAAATTGAACACATTT
CACTGGAGAATATCTGACGCTACAAGCTTTCCAATGAGCTTGTCAAAAATACCAGAATTA
GAGGAATATGGACCCTATGATAGATCAATGGTGTATACAAAAAAAGATATCAGGATGATT
GTGAATAGAGCCGGTATTCGTGGGATAAGAGTTTTAATAGAAATAGCTGCACCAGGTCCA
GTTGGCAGACCCTTTTCCTGGTTGTCCTCCACGACTTGTTCCCGAAAAAATAACAGCCTT
ACTTGCGACAATGATCTTTGTAGGCGTCTGACAATGCACGACTCTACATTTGATGTGCTT
CAAAAAATATATTCTGAAATCCTTGAAATGACGAACGTCGATGACGTCTTCCATTTGAGT
GATAGCGTTTTCTCGATGACCAATTGTTATTATTTATTCGATGATCGCGAGGGATTTTTA
GATAAAGCTCTATTCCGTCTGAAGATGGCTAATAAAGGATTTCTGCCACAACTGCCTATT
ATTTGGTACACGTCACATTTAATGAAACATTTTGAAGCTAAAACTTGGGAGAGATTAGGA
GTGCAAATCGATGAATGGGATGCAAACCCTTATGAGTCATATTTAAATAAATTTAGGGTT
ATACATTCTACCAAGTGGGATTTGTCTTGCGAAATGAGAAAGCAGAGATGCATAAGATAC
AGAACCTGGCAACAAATGTATTTATGGAAATCTTGGAGAAATGTTAATGTGTTTACTACC
GAAGGAGGAGAATCTATTTTATGGACAGACTTGGTTGATTCAAGTAACCTTGATTACCAT
CTATGGCCTCGTGCTGCAGTTGTTGCAGAACGTTTGTGGTCAGATGTGGTGGCCAATGGA
AGTGCCAACAAATACGTTTACATGAGGCTAGATACCCATAGATGGAGAATGATGCAACGT
GGCATCCAGGTGCAACCGATTTGGCCACCTTGGTGCAGTTTCAGCCCTAGCTCATGCCTT
GAGAGAGTGCATTAA

Protein sequence:

MNSRTGAVDVDICKNLDKNEQIAILKFCETLSEMEATIYNVECAIKAAWSYQNENPDNQG
TETSSDSSKQFSSFISDPQEVKRTPILQFKSSITVLPSVVTEASLSTFDDCSESFDFKNS
FGREPTISSYSKDIKMEAHYKQNEDSIESNQIYSITTEPVTKQAGQDPKESDHSCQPDCS
CTKCTNQGPSCGDPGCTKELCDDKSDDSKKKEKDETLPEWAWQCRKNFCHKVFRPATLLR
HYTSFSRCTLLCMGPQLWPYPIGYTHFSKTIVSVSTKNLEYKFQSVPSDSVHLYLAEAFK
LFIKDLARLEKLQTKPTNHSKATVKKMVILIDVESDSDPRLRINTDEGYMLKVETKNNQV
IIKVSGLSFCGARHGFETLSQLILLDQSTGYLIMLSSAIIKDAPTYKYRGLMVDTGRNYI
PVVDLLRTVDAMSTCKLNTFHWRISDATSFPMSLSKIPELEEYGPYDRSMVYTKKDIRMI
VNRAGIRGIRVLIEIAAPGPVGRPFSWLSSTTCSRKNNSLTCDNDLCRRLTMHDSTFDVL
QKIYSEILEMTNVDDVFHLSDSVFSMTNCYYLFDDREGFLDKALFRLKMANKGFLPQLPI
IWYTSHLMKHFEAKTWERLGVQIDEWDANPYESYLNKFRVIHSTKWDLSCEMRKQRCIRY
RTWQQMYLWKSWRNVNVFTTEGGESILWTDLVDSSNLDYHLWPRAAVVAERLWSDVVANG
SANKYVYMRLDTHRWRMMQRGIQVQPIWPPWCSFSPSSCLERVH