New model in OGS2.0 | DPOGS213576  |
---|---|
Genomic Position | scaffold398:+ 41653-49854 |
See gene structure | |
CDS Length | 1824 |
Paired RNAseq reads   | 1005 |
Single RNAseq reads   | 2532 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011646 (0.0) |
Best Drosophila hit   | hexosaminidase 2 (4e-126) |
Best Human hit | beta-hexosaminidase subunit alpha preproprotein (2e-40) |
Best NR hit (blastp)   | beta-N-acetylglucosaminidase 1 [Bombyx mori] (0.0) |
Best NR hit (blastx)   | beta-N-acetylglucosaminidase 1 [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0004563 beta-N-acetylhexosaminidase activity GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0016231 beta-N-acetylglucosaminidase activity GO:0005886 plasma membrane |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR001540 Glycoside hydrolase, family 20 IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR015883 Glycoside hydrolase, family 20, catalytic core IPR015882 Acetylhexosaminidase, subunit a/b |
Orthology group | MCL17254 |
Nucleotide sequence:
ATGTGTCGTCTGCTGGTGCTGGCGATAGTATTGGTGCATTTAGTCAACTGTCAGGACAAA
GAAAATGACATAGAGGAATCAACACCGCTTGTCTATGAACCATCCTGGACATATGAATGT
GTGCCGGATGAGGGCTGTCAACGCTCCGATTTTCCGCAGCCGACCCTGAACAGGAGTATG
ATCTTCGATAGCATCGACCTATGCAGGACCGTCTGCGGCCGCTTCGGAGGAATCTGGCCC
AAACCGGTCACGGCGGCCTTGAGCATGCAGACTATTAAGATACATCCGAATTACTTGAGA
TTCGATCTTAGCAACGCGCCGGCAGAAACCAGAAAGATATTGGCCGAGATGTCTCAAGTG
GCCACTCAGAATATCATAAGTGAATGCGAGGGTAATGTCACGGAGGTCGTTGAGATGCCT
GTCATTGTACACATCACGGTCAAAACAGACAACATGAACCTAACCTGGCAGACTGACGAA
CAATACCGCTTGGATGTCCAGAGCAAAGATACCAGCGTCGTGGTCCAAGTTATCGCGGAA
ACGGTTTTCGGAGCACGCCATGGTTTAGAGACTTTGACGCACCTGATTTCGGCTGATAAG
CCAGATTTATCGGAACAATCTAAATGTGGGCTCCGTATGGTAGCGGGTGCCAAAATATGG
GATAAACCTGTGTACCCACATAGAGGATTTCTTCTGGACACGTCAAGAAACTTCATCCCT
ATGGACGATATCAAAAGAATGATCGATGGTTTGGCTACACTCAAGATGAACGTCTTCCAC
TGGCACGTAACAGATTCACACAGCTTCCCGTTGGAGTCAAGACGAGTGCCGCAGTTTACT
AAGTACGGTGCTTATTCCGCCTCAGAGATCTACAGTTCGGAGGAAGTACGTGGTCTGGTG
GAATACGCTCTCGTTCGCGGAGTCCGTATCTTAATTGAAATAGACTCACCAGCGCACGCC
GGCAATGGCTGGCAGTGGGGGAATGAGTACGGTTTGGGTGATCTGGCGGTTTGTGTGAAC
GAGAAACCCTGGCGGCAGCTTTGTATTCAGCCACCCTGCGGGCAACTGAACCCCGCCAAC
CCGGCCGTATATAGAGTATTGAGAGACTTGTACAGAGACATCGCTGAAACTCTCACCAAA
CCTCCTCTATTTCACATCGGCGGTGATGAGGTATTCTTTGAATGCTGGAACTCGTCAAAT
ACGATCCTAGAATACATGCAGACTAAAGGCTACAGCAGGAATGTTGAAGGTTTCATCAAT
TTATGGTCGGAATTTCATGAGAAGGCTCTCAATATTTGGGATGAAGAACTAGCAGCGATC
GGAGAAACGGAGAAGCAGCCAGTCTTAATCTGGTCTTCCGAACTCACACAAGCCCACAGA
ATACAAAAACATTTGGACAAAAAAAGATACACAATTGAAGTTTGGGAACCTCTGTCCAGT
CCTTTGTTAATTCAACTGATACGTCTGGGCTACAACGTTATATCAGTCCCCAAAGACGTG
TGGTATCTCGATCATGGGTTTTGGGGTCAGACGAAATACTCTAACTGGAGAAGAATGTAC
GCACACACACTACCAAGAGATCCAAATGTTTTGGGGGGTGAAGTCGCTATGTGGACTGAA
TATGTGGATAAAGAGGCTTTGGATCCGAGAGTGTTCCCTCGAGTGGCTAGCGTGGCGGAA
CGTCTTTGGTCGGATCCCACGACGGGAGCGAGTGGCGCTCAGCCTCGCCTGCAGCGTGTG
AGGACGAGACTCGTCCAACGAGGACTGAGAGCGGACGTGCTAGCTCCAGGCTGGTGCGCT
CAACACGACACGCGCTGTTTATAG
Protein sequence:
MCRLLVLAIVLVHLVNCQDKENDIEESTPLVYEPSWTYECVPDEGCQRSDFPQPTLNRSM
IFDSIDLCRTVCGRFGGIWPKPVTAALSMQTIKIHPNYLRFDLSNAPAETRKILAEMSQV
ATQNIISECEGNVTEVVEMPVIVHITVKTDNMNLTWQTDEQYRLDVQSKDTSVVVQVIAE
TVFGARHGLETLTHLISADKPDLSEQSKCGLRMVAGAKIWDKPVYPHRGFLLDTSRNFIP
MDDIKRMIDGLATLKMNVFHWHVTDSHSFPLESRRVPQFTKYGAYSASEIYSSEEVRGLV
EYALVRGVRILIEIDSPAHAGNGWQWGNEYGLGDLAVCVNEKPWRQLCIQPPCGQLNPAN
PAVYRVLRDLYRDIAETLTKPPLFHIGGDEVFFECWNSSNTILEYMQTKGYSRNVEGFIN
LWSEFHEKALNIWDEELAAIGETEKQPVLIWSSELTQAHRIQKHLDKKRYTIEVWEPLSS
PLLIQLIRLGYNVISVPKDVWYLDHGFWGQTKYSNWRRMYAHTLPRDPNVLGGEVAMWTE
YVDKEALDPRVFPRVASVAERLWSDPTTGASGAQPRLQRVRTRLVQRGLRADVLAPGWCA
QHDTRCL