DPGLEAN19369 in OGS1.0

New model in OGS2.0DPOGS210840 
Genomic Positionscaffold77:- 56016-60931
See gene structure
CDS Length1215
Paired RNAseq reads  383
Single RNAseq reads  1022
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003916 (1e-107)
Best Drosophila hit  CG7985 (3e-44)
Best Human hithexosaminidase D (2e-45)
Best NR hit (blastp)  PREDICTED: similar to hexosaminidase (glycosyl hydrolase family 20, catalytic domain) containing [Nasonia vitripennis] (7e-90)
Best NR hit (blastx)  PREDICTED: similar to hexosaminidase (glycosyl hydrolase family 20, catalytic domain) containing [Nasonia vitripennis] (4e-86)
GeneOntology terms




  
GO:0043169 cation binding
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004563 beta-N-acetylhexosaminidase activity
GO:0005975 carbohydrate metabolic process
GO:0005634 nucleus
GO:0005737 cytoplasm
InterPro families

  
IPR017853 Glycoside hydrolase, superfamily
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR015883 Glycoside hydrolase, family 20, catalytic core
Orthology groupMCL18467

Nucleotide sequence:

ATGTATTCTATGGATGAAGTGCGACAAATACTACAGTTAGCTAGGAACTGTGGACTTGAA
GTCATTCAACTAATTCAAACTATTGGACACATGGAGTTTGTTCTGAAGCACCCTTTGTTC
CAAGATCTCAGGGAATTGCCATATTCTCCGGCTGTTTTGTGTCCATCACAGCACCGTTCT
CAATTACTAGTGAGAGAGATGTTGAGGCAGGTTTTGGAGGTACAGCCGGATGCTAGATAT
ATACACATTGGGGCAGATGAGGTTTGGCACAGAGGGGAATGTGAACTTTGTAAATATAAA
GCATCAACGAACGAACACAAATTACACTCAATTTATTTAGAACACATACGAGATTTAGCC
TTATTTATAAAGCAGTTGAGACCGGATTTGATTGTTCTCATGTGGGATGACATGCTGCGG
TCTATAAGTGTAGATGTATTGAAAAATTACAGCCTGGGTGAGTTAGTTCAGCCAGTGGTG
TGGAACTACAGTCCGCTGCATTTGTTCCATGTTGAAGTGCAATTATGGACATGTTACAGT
CAGGTGTTCCCAAGTGTTTGGGCTGCTTCAGCTTACAAAGGAGCCAGCGGAAGTTGTGAG
ATCTGGCCGGTGGTATCCCGTTACGCCAGCAACCAACAAGCCTGGTTGAAGACAGTCAAA
GAGTATTCCTCGGCTGTTAACTTTGTTGGAGTCGTCCTTACTGGTTGGTCGAGGTTCGAT
CATTACGCCACTCTATGTGAACTGTTGCCGCCATCTTTGCCAAGTTTGTCTATCTGTCTG
AAGATGTGGATGACTATGGACGAATGTTTTGTTTCAGACAACTCGGAGTCGTTGCCGCTG
GAGGAGTGGCCGGGAGTAGAACTCGCACTCAGCATACGAAACTTCGCTTCGTTGAGGGAA
CGCGCGCATAACGTCATGTACAGAGAGCTCGTTCCCACGTGGCTGAACCCCTGGCAGCTG
CAGCACGCGTACACCAGCCCCATACAACTACGTGGCATCGTGGCTACTATGACGCAAATA
ATAGCGGATATAAAGGCGATACATAGCGAACTTCTAACGCAATTTCCTTTATATACGGGG
GAGAGGAGTGCTCAGGAGTGGCTCGGCTCTCTGGTGACGCCTTTGTTGAGGAAGGTTACG
GAGGTACACGACGTAGCTGCTATAAGGACGGACATGCAGGCCGGGGTCACACCGGGGATG
ACAGCCACTCGTTAA

Protein sequence:

MYSMDEVRQILQLARNCGLEVIQLIQTIGHMEFVLKHPLFQDLRELPYSPAVLCPSQHRS
QLLVREMLRQVLEVQPDARYIHIGADEVWHRGECELCKYKASTNEHKLHSIYLEHIRDLA
LFIKQLRPDLIVLMWDDMLRSISVDVLKNYSLGELVQPVVWNYSPLHLFHVEVQLWTCYS
QVFPSVWAASAYKGASGSCEIWPVVSRYASNQQAWLKTVKEYSSAVNFVGVVLTGWSRFD
HYATLCELLPPSLPSLSICLKMWMTMDECFVSDNSESLPLEEWPGVELALSIRNFASLRE
RAHNVMYRELVPTWLNPWQLQHAYTSPIQLRGIVATMTQIIADIKAIHSELLTQFPLYTG
ERSAQEWLGSLVTPLLRKVTEVHDVAAIRTDMQAGVTPGMTATR