New model in OGS2.0 | DPOGS214332  |
---|---|
Genomic Position | scaffold29:- 280593-282953 |
See gene structure | |
CDS Length | 1905 |
Paired RNAseq reads   | 969 |
Single RNAseq reads   | 2520 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003990 (0.0) |
Best Drosophila hit   | fused lobes, isoform B (4e-135) |
Best Human hit | beta-hexosaminidase subunit beta preproprotein (2e-54) |
Best NR hit (blastp)   | beta-N-acetylglucosaminidase [Spodoptera frugiperda] (0.0) |
Best NR hit (blastx)   | beta-N-acetylglucosaminidase [Spodoptera frugiperda] (0.0) |
GeneOntology terms    | GO:0004563 beta-N-acetylhexosaminidase activity GO:0007420 brain development GO:0043169 cation binding GO:0006491 N-glycan processing GO:0005770 late endosome GO:0032428 beta-N-acetylgalactosaminidase activity GO:0005794 Golgi apparatus GO:0016231 beta-N-acetylglucosaminidase activity GO:0006032 chitin catabolic process GO:0005886 plasma membrane GO:0005783 endoplasmic reticulum |
InterPro families    | IPR001540 Glycoside hydrolase, family 20 IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR017853 Glycoside hydrolase, superfamily IPR015883 Glycoside hydrolase, family 20, catalytic core IPR015882 Acetylhexosaminidase, subunit a/b |
Orthology group | MCL16823 |
Nucleotide sequence:
ATGAAGTCGTGGGGGGAAACGTTATGGCGGAGTGCCTCCTCGCACTTGTACCGAGTCGGC
AGGTTGCGGCGAGCTCTGCTGCTCTTGGCTGCCGCTGCCTGTACTGCTGCGGGCCTTCTC
TATTGGAGACAACAGACGGACGACTCTGCTAGACGACCGCTACACTTGTACGCTGGTGTA
GAGCCACAGTGGTCTTGGGTATGTCGGAACGATCGTTGTGAGCGACTCCTAGCATCAGAG
ACCTCTATACTTCAGTCACTTCCTACGTGCAACATGCTCTGCGCGTCCACTCAATTATGG
CCGCAGCCAACCGGCCCCGTCAGTCTCGCTACGGCCTCAGTTCATGTCAGATCCAGCGGC
TTCTCACTACAAGTCATATCCTCTCCATCAAGAGAAGTGACAGAAAACCTCAACGATGCC
TTCCAATTAATGCGCGACGACTTGAAAATTCTGGAGAAAAACGCGGGCGTAGAGAACAGG
AGATCAGATAGTGGAACTCCCCGTGAAGTTGTTGTAAGGGTCGCTGTGAACGGCAGCGCT
GATCCACGCATGCGACAAGACACCGATGAAACCTACAAGCTCTCTCTCAGACCGTCGGGG
AAGTCCCTCGTCGCTGATATAACAGCGCATTCCTTCTGTGGAGCTCGGCACGGCTTTGAA
ACTCTGTCCCAACTAGTGTGGTTGGATCCTTACGCTGAATCTCTCTTAATACTCGAAGCT
GCCACCGTGGACGACGGCCCTCGGTTTAGATATCGTGGTTTGTTATTGGATACAGCCAGG
AATTTCTTCCCCGTAACTGACATATTGCGTACAATCGATGCTATGGGAGCGTGCAAGCTG
AACACGTTCCATTGGCATGTGAGTGACTCGCAGTCCTTTCCTTTGAGACTGAACAGCGCT
CCTCAACTAGCTCAGCACGGAGCTTATGGGCCTGGTGCTATATACACGACTGACGATGTA
AGGGCTATAGTACGCCGAGCTAGATTGAGAGGAATACGTGTCTTGATAGAAGTAGATGCG
CCGGCGCATGTTGGACGAGCGTGGTCGTGGGGCCCTCCTGCTGGGTTAGGACACTTAGCG
CATTGTGTTGAAGTAGAACCTTGGAGTACTTATTGTGGTGAACCGCCTTGTGGGCAATTA
AACCCACGAAATCCACACGTTTACTCACTTCTTGAACAGATTTATGCCGAAATCATTCAA
CTGACCGAAGTGGACGATATCTTCCATTTAGGCGGGGACGAGGTCTCGGAGCGGTGTTGG
GCTCAACACTTTAACGACACGGATCCCATGGAGTTATGGTTTGAGTTCACTCGTCGCGCC
ATGTCCTCCCTCGAACGTGCCAATGGCGGTAAACTGCCAGATCTAACGTTACTGTGGTCT
TCTCGGCTAACTCATACACCGTACCTGGAACGTTTAGATAAGAAGAGACACGGCGTGCAG
GTGTGGGGCTCGTCCCGGTGGCCGGAATCTCGCGCGGTATTGGACGCGGGCTACAGAACG
ATCATATCTCACGTAGACGCTTGGTACTTAGACTGCGGCTTCGGGTCCTGGCGAGATAGT
TCCGACGGTCACTGTGGACCTTACCGGTCTTGGCAGCAAATTTACGAGCACAGACCCTGG
ATAGAGGAAATGCCGGCCATGTCTACTGGAGTCGAACCATGGCAAGTGGAAGGCGGCGCG
ACGTGTCAGTGGACGGAACAGCTGGGTTCCGGAGGTTTGGATGCTAGAGTGTGGCCGAGG
ACTGCGGCGGTCGCGGAGCGTCTCTGGTCGGACCGCGCCGAGGGCGCCACCGCCGACGTC
TACCTGCGACTCGACACACAACGATCACGACTCCTAGATAAAGGGATCCAAGCCGCTCCT
CTCTGGCCGCGGTGGTGCTCTCACAACCCTCACGCCTGCCTTTAG
Protein sequence:
MKSWGETLWRSASSHLYRVGRLRRALLLLAAAACTAAGLLYWRQQTDDSARRPLHLYAGV
EPQWSWVCRNDRCERLLASETSILQSLPTCNMLCASTQLWPQPTGPVSLATASVHVRSSG
FSLQVISSPSREVTENLNDAFQLMRDDLKILEKNAGVENRRSDSGTPREVVVRVAVNGSA
DPRMRQDTDETYKLSLRPSGKSLVADITAHSFCGARHGFETLSQLVWLDPYAESLLILEA
ATVDDGPRFRYRGLLLDTARNFFPVTDILRTIDAMGACKLNTFHWHVSDSQSFPLRLNSA
PQLAQHGAYGPGAIYTTDDVRAIVRRARLRGIRVLIEVDAPAHVGRAWSWGPPAGLGHLA
HCVEVEPWSTYCGEPPCGQLNPRNPHVYSLLEQIYAEIIQLTEVDDIFHLGGDEVSERCW
AQHFNDTDPMELWFEFTRRAMSSLERANGGKLPDLTLLWSSRLTHTPYLERLDKKRHGVQ
VWGSSRWPESRAVLDAGYRTIISHVDAWYLDCGFGSWRDSSDGHCGPYRSWQQIYEHRPW
IEEMPAMSTGVEPWQVEGGATCQWTEQLGSGGLDARVWPRTAAVAERLWSDRAEGATADV
YLRLDTQRSRLLDKGIQAAPLWPRWCSHNPHACL