DPGLEAN20684 in OGS1.0

New model in OGS2.0DPOGS205116 
Genomic Positionscaffold1001:+ 782-5177
See gene structure
CDS Length1194
Paired RNAseq reads  2969
Single RNAseq reads  8619
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005899 (5e-169)
Best Drosophila hit  hexosaminidase 1, isoform E (5e-125)
Best Human hitbeta-hexosaminidase subunit beta preproprotein (3e-42)
Best NR hit (blastp)  beta-N-acetylglucosaminidase [Choristoneura fumiferana] (0.0)
Best NR hit (blastx)  beta-N-acetylglucosaminidase [Choristoneura fumiferana] (0.0)
GeneOntology terms

  
GO:0004563 beta-N-acetylhexosaminidase activity
GO:0005575 cellular_component
GO:0006032 chitin catabolic process
InterPro families


  
IPR017853 Glycoside hydrolase, superfamily
IPR001540 Glycoside hydrolase, family 20
IPR015883 Glycoside hydrolase, family 20, catalytic core
IPR013781 Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL16250

Nucleotide sequence:

ATGGTCCGTGACGTCACAATCGATGATAAACCCGTGTACCCTTACAGAGGAGTGTTGTTG
GACACCGCGAGGAACTATTTCTCTATCGACTCCATCAAAGAGACTATCGAGGCCATGAGT
AGCGTGAAACTGAACACTTTCCACTGGCACATCACAGACAGCCAAAGTTTCCCCTTCGTA
TCCAAGAGACGGCCAGAACTCACTAAATACGGAGCTTACAGTCCCAGTAAAATCTACACT
GAAGAGATGATCCGTGATGTGGTGGAGTTCGCTCGTGTCCGCGGAGTCCGAGTGCTGCCC
GAGTTTGACGCTCCAGCACACGTGGGCGAGGGCTGGCAGGAGACAGACCTCACTGTTTGC
TTCAAGGCTGAACCTTGGGCGTCGTACTGCGTGGAACCTCCGTGCGGTCAATTGAACCCT
ACCAAGGAGGAGCTGTACGATGTTCTACAAGACATCTACACGGATATGGCCGATGTTTTC
CCGTCGGACCTCTTCCACATGGGTGGAGACGAGGTGTCGGAGCGCTGCTGGAACTCGTCG
CGCCAGGTGCAGCAGTTTATGGAGGAGAACCGCTGGGGACTGGACAAGGCCAGCTATTTA
CAACTGTGGAACTACTTCCAGAATAAAGCCCAAGATAGGGTGTACAAGGCATTTGGTAAA
AGGATCCCACTGATTCTATGGACCAGCACGCTAACTGATTACAGTCACGTCGACAAGTTC
TTAAACAAAGACGATTACATTATTCAAGTGTGGACTACTGGCGAAGACCCTCAAATATCA
GGTCTCCTGCAGAAGGGTTATCGTCTCATCATGTCCAACTACGACGCCCTGTATTTCGAC
TGTGGTTTCGGTGCTTGGGTTGGAACTGGCAACAACTGGTGCTCTCCGTACATCGGATGG
CAGAAAGTTTATGAAAATAGTCCTAAACAGATGGCGAGAGACCACCAAGATCAAATCCTA
GGTGGTGAAGCAGCGCTGTGGTCTGAGCAGTCTGACTCAGCGACCCTGGACAGTCGCCTG
TGGCCGCGGGCCGCCGCCCTCGCTGAGAGGTTGTGGGCGGAGCCCGCGACCAGCTGGAGG
GAGGCCGAGCGGCGGATGTTGAACGTACGCGAGCGTCTCGTCCGTAAAGGCATCAAAGCG
GAGTCCCTGGAGCCCGAGTGGTGCTATCAGAACGACGGCTACTGCTACGCCTGA

Protein sequence:

MVRDVTIDDKPVYPYRGVLLDTARNYFSIDSIKETIEAMSSVKLNTFHWHITDSQSFPFV
SKRRPELTKYGAYSPSKIYTEEMIRDVVEFARVRGVRVLPEFDAPAHVGEGWQETDLTVC
FKAEPWASYCVEPPCGQLNPTKEELYDVLQDIYTDMADVFPSDLFHMGGDEVSERCWNSS
RQVQQFMEENRWGLDKASYLQLWNYFQNKAQDRVYKAFGKRIPLILWTSTLTDYSHVDKF
LNKDDYIIQVWTTGEDPQISGLLQKGYRLIMSNYDALYFDCGFGAWVGTGNNWCSPYIGW
QKVYENSPKQMARDHQDQILGGEAALWSEQSDSATLDSRLWPRAAALAERLWAEPATSWR
EAERRMLNVRERLVRKGIKAESLEPEWCYQNDGYCYA