DPGLEAN08668 in OGS1.0

New model in OGS2.0DPOGS212760 
Genomic Positionscaffold3:+ 417651-421425
See gene structure
CDS Length1617
Paired RNAseq reads  565
Single RNAseq reads  1576
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013129 (1e-84)
Best Drosophila hit  CG31414 (1e-70)
Best Human hitglucosylceramidase isoform 1 precursor (5e-89)
Best NR hit (blastp)  PREDICTED: similar to glucocerebrosidase precursor isoform 1 [Apis mellifera] (1e-102)
Best NR hit (blastx)  PREDICTED: similar to glucocerebrosidase precursor [Apis mellifera] (3e-101)
GeneOntology terms








  
GO:0005765 lysosomal membrane
GO:0016020 membrane
GO:0007040 lysosome organization
GO:0005975 carbohydrate metabolic process
GO:0006629 lipid metabolic process
GO:0016798 hydrolase activity, acting on glycosyl bonds
GO:0006665 sphingolipid metabolic process
GO:0005515 protein binding
GO:0043169 cation binding
GO:0004348 glucosylceramidase activity
InterPro families


  
IPR017853 Glycoside hydrolase, superfamily
IPR001139 Glycoside hydrolase, family 30
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR013780 Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL10212

Nucleotide sequence:

ATGAGAACGCTAATCTTGCTTGGAGGCCTACTTACTGTTGGAGTTTACGCGGATGTATCA
TGCGCACCGAGATCCCTGAACAACTCAGTGGTGTGTGTCTGTAACGCCACTTACTGTGAC
ACCGTGACGAGGAGGGTGCCTGAAGCTGGCACCTACATAGCTTACACTTCCTCCAAGTCA
GGATTACGATTCAGCATCACTCAGGGAGACATTGAAGACGCAGATACATCTTATAGCAGT
GCTGATTATGGAAAGGTTTTCGACTTACAACCAAGCAAAGTTTATCAAGCTATTGAAGGA
TTCGGAGGAGCAGCGACTGACGCAGCGGGAATAAATTGGAAGAAAATGAAGCCGGCTATT
CAGGACACTCTAGTGAAGTCCTACTTTAGTGAAGACGGCTTAGAATATAATATAATCAGA
CTCCCGATTGGTTCTACTGATTTCTCGACACGTTTTTATGCTTATAACCAATATCCGAAA
GACGACACAGCTCTCAGCAACTTTACTTTCGCTCCAGAAGATATAAAGTATAAGGTTCCT
TTAGTGAAGTCCTGTTTGAGTGCTGCAAATAATGAAGTAAAAATAGTGTCTGCGACATGG
TCACCACCAAAATGGATGAAAGTAAAGGAGCCACAAAGCGGTATTAGTTTTATTAAAGAG
GAATTTTACCAAGTTTACTCCGATTACCACTGCAAATTTGCAGAGCTGTTTGAAGAAGAG
GGTATTCATATTTGGGGAATATCCACTGGAAATGAGCCTTTAGTGAATATGTTCGCTGGT
GTGAGAAAAGACGAAACAGCCTGGAATGCACCCAGTTTTGCGAAATTCATAAGGGAATAC
TTCGGCCCGACCATAAAAAACTGTTCAGTAAAGGACATGAAAATTCTAGCTATAGAAGAT
CAACGTTACGCTTTGCCCCTTTTCTTTACAAAATTACAATCCGATACTGAAGCTATGTCT
TATGTAGACGGAATATCCTTACATTTTTATGGTGACAAGAATACTCCGGCTTCGACAATT
CCAAGAGTTCTCAAGGAATTTCCCGACAAATTCGTTTTATACACGGAAGCCTGTAATGGT
CCCCAAAGTCCTAAAGATGAGAAAGTCGTTTTGGGTTCCTGGGACAGAGCCAAGACTTAT
TTTACGAACATACTTGAGAATCTTAATTACAATGTGGTTGGATGGCTCGACTGGAATTTG
TTCTTAGACACAGAAGGTGGCCCGACTTGGACAAAAAATTTTGTTGACTCTTCGATAATT
GTCGATTACGATAAACAAGAATTCTACAAACAACCTACATACTATGCAATAGGTCATTTT
TCAAAATTTGTTCCCAGAGGCTCTCAAAGGATTAAGGTTAAAACTATTTTACCTGTAACC
AACTATGGTTTGGATATAATTGACTTTACGACAGTGGAGGCAAGCTTTTTCGACAACGTA
GCCTTTATCACTCCTAAAGGCACTATCGTCGTAATCATACACAACGAGGGGGCAGAACAA
AACTGTGCAATACAATTAGGTGATTCGCAAGCCACTGTACTCTTAGAAGCTGAATCCATA
ACTACGGTCGAGATACCATACGACGGTAAATCACTCGGAACACCGTGCAGCCAATGA

Protein sequence:

MRTLILLGGLLTVGVYADVSCAPRSLNNSVVCVCNATYCDTVTRRVPEAGTYIAYTSSKS
GLRFSITQGDIEDADTSYSSADYGKVFDLQPSKVYQAIEGFGGAATDAAGINWKKMKPAI
QDTLVKSYFSEDGLEYNIIRLPIGSTDFSTRFYAYNQYPKDDTALSNFTFAPEDIKYKVP
LVKSCLSAANNEVKIVSATWSPPKWMKVKEPQSGISFIKEEFYQVYSDYHCKFAELFEEE
GIHIWGISTGNEPLVNMFAGVRKDETAWNAPSFAKFIREYFGPTIKNCSVKDMKILAIED
QRYALPLFFTKLQSDTEAMSYVDGISLHFYGDKNTPASTIPRVLKEFPDKFVLYTEACNG
PQSPKDEKVVLGSWDRAKTYFTNILENLNYNVVGWLDWNLFLDTEGGPTWTKNFVDSSII
VDYDKQEFYKQPTYYAIGHFSKFVPRGSQRIKVKTILPVTNYGLDIIDFTTVEASFFDNV
AFITPKGTIVVIIHNEGAEQNCAIQLGDSQATVLLEAESITTVEIPYDGKSLGTPCSQ