New model in OGS2.0 | DPOGS212760  |
---|---|
Genomic Position | scaffold3:+ 417651-421425 |
See gene structure | |
CDS Length | 1617 |
Paired RNAseq reads   | 565 |
Single RNAseq reads   | 1576 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013129 (1e-84) |
Best Drosophila hit   | CG31414 (1e-70) |
Best Human hit | glucosylceramidase isoform 1 precursor (5e-89) |
Best NR hit (blastp)   | PREDICTED: similar to glucocerebrosidase precursor isoform 1 [Apis mellifera] (1e-102) |
Best NR hit (blastx)   | PREDICTED: similar to glucocerebrosidase precursor [Apis mellifera] (3e-101) |
GeneOntology terms    | GO:0005765 lysosomal membrane GO:0016020 membrane GO:0007040 lysosome organization GO:0005975 carbohydrate metabolic process GO:0006629 lipid metabolic process GO:0016798 hydrolase activity, acting on glycosyl bonds GO:0006665 sphingolipid metabolic process GO:0005515 protein binding GO:0043169 cation binding GO:0004348 glucosylceramidase activity |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR001139 Glycoside hydrolase, family 30 IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR013780 Glycosyl hydrolase, family 13, all-beta |
Orthology group | MCL10212 |
Nucleotide sequence:
ATGAGAACGCTAATCTTGCTTGGAGGCCTACTTACTGTTGGAGTTTACGCGGATGTATCA
TGCGCACCGAGATCCCTGAACAACTCAGTGGTGTGTGTCTGTAACGCCACTTACTGTGAC
ACCGTGACGAGGAGGGTGCCTGAAGCTGGCACCTACATAGCTTACACTTCCTCCAAGTCA
GGATTACGATTCAGCATCACTCAGGGAGACATTGAAGACGCAGATACATCTTATAGCAGT
GCTGATTATGGAAAGGTTTTCGACTTACAACCAAGCAAAGTTTATCAAGCTATTGAAGGA
TTCGGAGGAGCAGCGACTGACGCAGCGGGAATAAATTGGAAGAAAATGAAGCCGGCTATT
CAGGACACTCTAGTGAAGTCCTACTTTAGTGAAGACGGCTTAGAATATAATATAATCAGA
CTCCCGATTGGTTCTACTGATTTCTCGACACGTTTTTATGCTTATAACCAATATCCGAAA
GACGACACAGCTCTCAGCAACTTTACTTTCGCTCCAGAAGATATAAAGTATAAGGTTCCT
TTAGTGAAGTCCTGTTTGAGTGCTGCAAATAATGAAGTAAAAATAGTGTCTGCGACATGG
TCACCACCAAAATGGATGAAAGTAAAGGAGCCACAAAGCGGTATTAGTTTTATTAAAGAG
GAATTTTACCAAGTTTACTCCGATTACCACTGCAAATTTGCAGAGCTGTTTGAAGAAGAG
GGTATTCATATTTGGGGAATATCCACTGGAAATGAGCCTTTAGTGAATATGTTCGCTGGT
GTGAGAAAAGACGAAACAGCCTGGAATGCACCCAGTTTTGCGAAATTCATAAGGGAATAC
TTCGGCCCGACCATAAAAAACTGTTCAGTAAAGGACATGAAAATTCTAGCTATAGAAGAT
CAACGTTACGCTTTGCCCCTTTTCTTTACAAAATTACAATCCGATACTGAAGCTATGTCT
TATGTAGACGGAATATCCTTACATTTTTATGGTGACAAGAATACTCCGGCTTCGACAATT
CCAAGAGTTCTCAAGGAATTTCCCGACAAATTCGTTTTATACACGGAAGCCTGTAATGGT
CCCCAAAGTCCTAAAGATGAGAAAGTCGTTTTGGGTTCCTGGGACAGAGCCAAGACTTAT
TTTACGAACATACTTGAGAATCTTAATTACAATGTGGTTGGATGGCTCGACTGGAATTTG
TTCTTAGACACAGAAGGTGGCCCGACTTGGACAAAAAATTTTGTTGACTCTTCGATAATT
GTCGATTACGATAAACAAGAATTCTACAAACAACCTACATACTATGCAATAGGTCATTTT
TCAAAATTTGTTCCCAGAGGCTCTCAAAGGATTAAGGTTAAAACTATTTTACCTGTAACC
AACTATGGTTTGGATATAATTGACTTTACGACAGTGGAGGCAAGCTTTTTCGACAACGTA
GCCTTTATCACTCCTAAAGGCACTATCGTCGTAATCATACACAACGAGGGGGCAGAACAA
AACTGTGCAATACAATTAGGTGATTCGCAAGCCACTGTACTCTTAGAAGCTGAATCCATA
ACTACGGTCGAGATACCATACGACGGTAAATCACTCGGAACACCGTGCAGCCAATGA
Protein sequence:
MRTLILLGGLLTVGVYADVSCAPRSLNNSVVCVCNATYCDTVTRRVPEAGTYIAYTSSKS
GLRFSITQGDIEDADTSYSSADYGKVFDLQPSKVYQAIEGFGGAATDAAGINWKKMKPAI
QDTLVKSYFSEDGLEYNIIRLPIGSTDFSTRFYAYNQYPKDDTALSNFTFAPEDIKYKVP
LVKSCLSAANNEVKIVSATWSPPKWMKVKEPQSGISFIKEEFYQVYSDYHCKFAELFEEE
GIHIWGISTGNEPLVNMFAGVRKDETAWNAPSFAKFIREYFGPTIKNCSVKDMKILAIED
QRYALPLFFTKLQSDTEAMSYVDGISLHFYGDKNTPASTIPRVLKEFPDKFVLYTEACNG
PQSPKDEKVVLGSWDRAKTYFTNILENLNYNVVGWLDWNLFLDTEGGPTWTKNFVDSSII
VDYDKQEFYKQPTYYAIGHFSKFVPRGSQRIKVKTILPVTNYGLDIIDFTTVEASFFDNV
AFITPKGTIVVIIHNEGAEQNCAIQLGDSQATVLLEAESITTVEIPYDGKSLGTPCSQ