New model in OGS2.0 | DPOGS210840  |
---|---|
Genomic Position | scaffold77:- 56016-60931 |
See gene structure | |
CDS Length | 1215 |
Paired RNAseq reads   | 383 |
Single RNAseq reads   | 1022 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003916 (1e-107) |
Best Drosophila hit   | CG7985 (3e-44) |
Best Human hit | hexosaminidase D (2e-45) |
Best NR hit (blastp)   | PREDICTED: similar to hexosaminidase (glycosyl hydrolase family 20, catalytic domain) containing [Nasonia vitripennis] (7e-90) |
Best NR hit (blastx)   | PREDICTED: similar to hexosaminidase (glycosyl hydrolase family 20, catalytic domain) containing [Nasonia vitripennis] (4e-86) |
GeneOntology terms    | GO:0043169 cation binding GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds GO:0004563 beta-N-acetylhexosaminidase activity GO:0005975 carbohydrate metabolic process GO:0005634 nucleus GO:0005737 cytoplasm |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR015883 Glycoside hydrolase, family 20, catalytic core |
Orthology group | MCL18467 |
Nucleotide sequence:
ATGTATTCTATGGATGAAGTGCGACAAATACTACAGTTAGCTAGGAACTGTGGACTTGAA
GTCATTCAACTAATTCAAACTATTGGACACATGGAGTTTGTTCTGAAGCACCCTTTGTTC
CAAGATCTCAGGGAATTGCCATATTCTCCGGCTGTTTTGTGTCCATCACAGCACCGTTCT
CAATTACTAGTGAGAGAGATGTTGAGGCAGGTTTTGGAGGTACAGCCGGATGCTAGATAT
ATACACATTGGGGCAGATGAGGTTTGGCACAGAGGGGAATGTGAACTTTGTAAATATAAA
GCATCAACGAACGAACACAAATTACACTCAATTTATTTAGAACACATACGAGATTTAGCC
TTATTTATAAAGCAGTTGAGACCGGATTTGATTGTTCTCATGTGGGATGACATGCTGCGG
TCTATAAGTGTAGATGTATTGAAAAATTACAGCCTGGGTGAGTTAGTTCAGCCAGTGGTG
TGGAACTACAGTCCGCTGCATTTGTTCCATGTTGAAGTGCAATTATGGACATGTTACAGT
CAGGTGTTCCCAAGTGTTTGGGCTGCTTCAGCTTACAAAGGAGCCAGCGGAAGTTGTGAG
ATCTGGCCGGTGGTATCCCGTTACGCCAGCAACCAACAAGCCTGGTTGAAGACAGTCAAA
GAGTATTCCTCGGCTGTTAACTTTGTTGGAGTCGTCCTTACTGGTTGGTCGAGGTTCGAT
CATTACGCCACTCTATGTGAACTGTTGCCGCCATCTTTGCCAAGTTTGTCTATCTGTCTG
AAGATGTGGATGACTATGGACGAATGTTTTGTTTCAGACAACTCGGAGTCGTTGCCGCTG
GAGGAGTGGCCGGGAGTAGAACTCGCACTCAGCATACGAAACTTCGCTTCGTTGAGGGAA
CGCGCGCATAACGTCATGTACAGAGAGCTCGTTCCCACGTGGCTGAACCCCTGGCAGCTG
CAGCACGCGTACACCAGCCCCATACAACTACGTGGCATCGTGGCTACTATGACGCAAATA
ATAGCGGATATAAAGGCGATACATAGCGAACTTCTAACGCAATTTCCTTTATATACGGGG
GAGAGGAGTGCTCAGGAGTGGCTCGGCTCTCTGGTGACGCCTTTGTTGAGGAAGGTTACG
GAGGTACACGACGTAGCTGCTATAAGGACGGACATGCAGGCCGGGGTCACACCGGGGATG
ACAGCCACTCGTTAA
Protein sequence:
MYSMDEVRQILQLARNCGLEVIQLIQTIGHMEFVLKHPLFQDLRELPYSPAVLCPSQHRS
QLLVREMLRQVLEVQPDARYIHIGADEVWHRGECELCKYKASTNEHKLHSIYLEHIRDLA
LFIKQLRPDLIVLMWDDMLRSISVDVLKNYSLGELVQPVVWNYSPLHLFHVEVQLWTCYS
QVFPSVWAASAYKGASGSCEIWPVVSRYASNQQAWLKTVKEYSSAVNFVGVVLTGWSRFD
HYATLCELLPPSLPSLSICLKMWMTMDECFVSDNSESLPLEEWPGVELALSIRNFASLRE
RAHNVMYRELVPTWLNPWQLQHAYTSPIQLRGIVATMTQIIADIKAIHSELLTQFPLYTG
ERSAQEWLGSLVTPLLRKVTEVHDVAAIRTDMQAGVTPGMTATR