New model in OGS2.0 | DPOGS212721  |
---|---|
Genomic Position | scaffold3:- 30141-41004 |
See gene structure | |
CDS Length | 5142 |
Paired RNAseq reads   | 106 |
Single RNAseq reads   | 359 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013129 (1e-59) |
Best Drosophila hit   | CG31414 (1e-72) |
Best Human hit | glucosylceramidase isoform 1 precursor (2e-64) |
Best NR hit (blastp)   | PREDICTED: similar to putative lysosomal glucocerebrosidase [Tribolium castaneum] (2e-88) |
Best NR hit (blastx)   | PREDICTED: similar to putative lysosomal glucocerebrosidase [Tribolium castaneum] (6e-87) |
GeneOntology terms    | GO:0004348 glucosylceramidase activity GO:0005975 carbohydrate metabolic process GO:0005764 lysosome GO:0006665 sphingolipid metabolic process GO:0007040 lysosome organization GO:0043169 cation binding |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR001139 Glycoside hydrolase, family 30 IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | ND |
Nucleotide sequence:
ATGATGTGGTTCCTTTTGGTAGCGTGTGTGTTGCAGATTCATGGCATATCCTCACAGCAT
TTAAAATCTGACGATTGCTGTCCTTGCTCTCCAGAAAATAGTTTCTCCAGTAGTAACGTT
TTGAACAAAGATCATGTCTCTGGGAATTGTCCTTGTAGAGATGCGGATTCTGAACCCGCT
TCATCTATATTTAGACCAAGTTTTAAAACAGCTACCAATGCCAGGCAAGTCGATGGAAAG
GAGGAGCCAAAGCCACTTTTAAATCCGGAAGTCGGCCTCGCGTCTTCAGTTTTAGAAACT
TTACGTGAAGTTTCTGACCAAGAATACCAAGATGCTCTGGCAAGAGATGTTGCGAGGAGT
GCTTTAAGATCAATGAACACAAATGCCGAGGCAGGTGAAACTGGCGATGAAAGCAAAAAT
GTTGTTAAGATTATATTAGGTCCTAGACATAACACAGATATGGACACCATATCTGAACCG
TCTCACAACCACGAAACCCGCTGCATACACCCATCGCCAGGACAAGAAAACGTATTTGAA
CAATACCAAGAACAGCCACAAAGAAAATTATATAAGCACTTCATCCTGAGTAAGCCAGTT
GTAGGTTTACAAAAAAATATTTTTGACATGGACAATAGCCCCAGCGACATTCAAGAAAGC
GTAGTGACAGTTAGACCCAGTGTATTAGATAATATAAAAATGAAGACAGATCTATTGAAA
TCAAAGTTATTTTTAAACAATGAGGGATTAACACAAAGAAACGAACCTAAAATATCTTTT
GATAAGGTATTTTATAAGCCAGTCATAAAATTGCCCGACTTCAATTTGGCTTCTCCCTTG
GAGCATTTTAAGAAATCAAAGTTATTTGATTCATACAAGACCCCCTCACCTGCCGTGATG
ACAAAAGACAATTTTTTATATAAAAGCGTCAACATAAATGGCAACAGTGATCTTATATCA
CCGCAGCAACAATCGCAAAGTAGAAAATTGGATGGCAATGAAAAATTATTACCACAATTT
AATTCACTTGCATCCAAAGAAGCTCGCAAATCTATAAAAAGTACGGATTCTTTTATTAGT
GATGATTTTAATACGGATAATACAATATTGAAACCAGTATTAGCATTGCGTGATGACTGT
GATCCATCACGAACCTGGTTACAGATACAAACTACTACTGAAAAAAGTGAATGTGTTGAG
GTCAATAGTTTGCCATCAAGTGTTCCAGAGGAAAAAAGTACAGATATTTTATCAACAGTT
CCAAGCCCTTCGGAATTAATTGGAGATGAAAACAACGCTATGAAAGATGGTGACAATAAA
CTTAGTGAAAATTGTGAACCAAATTTAAACAGTAATCCAAATCCAGAACGCGACGTTGTC
GATGAAGATAATGAAACTGATTTAAGAGCTAGTGCTGATCAAACGACTGATGACATGACG
AACGAGCTACTAATTAATGAGGACAGAGACGAAAGAAAAGAAGAGGATTCCGAAAATGAA
TGCATCGTATCAGAAAACACTAAGGATAAGGAAAATTCTATTGGTGTGGAAGTTTTAAAT
GACTCGATGGAGGAAAATAAAGAAAATAAGGAATTGATGGATGATTCTGAATCCGCTGCT
AATACAGATAGGAAAATGAACGCGATATCTAGTATCCCAGAGTTAAAAGAAGGCATAAGA
AATAATCTAAATGAATTAAGAAAAGCAAACGAGAAGTGGCAAGAAAAGATGAAACTTAAT
GTTCATGATACTTTGAACCGAATTAATAAAAAAAATAATATTGCTAAATCTCCAAATATT
CAGTCAGAATCCCTGAAGGATAATGATTCACAAAATGCATCTGAAATATTGCAGGAATTC
GACGGCGACACTGAAGCACACGCTTCAAATAATTGTAAAACACCTGCTGAAGATTTTGAT
GAATTTAAGGATCATCCAGTAACTGATAATATAAAAGAAAAATATAATACCGAATATGTA
GGAAGTGAACCACAAAATTTAAACGAACATATTCCTAAAAATGTTGATGAAACTAATGAA
TGTAAAATTCACGAAGAAAAGACTAGCAATCTAGATACAAAATTAGACTCAGTAAACAAC
GAAGAAAGCGAAATGGAAGTTCAGGCGAGTGAATCAAAGCAAAGGAATGCTAAAATAGAT
CTAGATAATTTAAGGTATGTGAAACAAGACATTAATGACGAAGCCAATAATTTAAATAAT
GTTGGAAGTTCGTTTTTACAAGCTAATAACTTTTTTGATGACGTCAAGTCAAAACTTACT
AATGTGTTTGGGAAAACAAACGAAAATATATTTAGTGGATTTGTATCCAACGCAGGAAAC
GTGGATAGAACTGATTCAAGAGGATTTATAAAAGATCGTGTTATAGACTTCACTAAACCT
CTTCTGAATTTAGATAAAAGTACCGATTTATTTAAGAATGCACCGGATGATACAGAAACG
AAAAGACATAGTATTAAGTCATTTAATTCGAATCTACAACCTTTCAAAGCAAGAGTTGCA
ATCCCAGAGTCCCTTGATATAACACCAAGTTTAACACGAGCTGGAAATACAAAAGAAATG
AGCCAAAAATTGTATAATTCACAAAGAATACTTCCGAAAACATACGCTTCTCGGATATAT
TCGCAAAATTCGGATGAAAAGTCGGGAGGAGATGAACGACTATTTTACAAAGAATCAGAC
ATCGGTAGCCTTAATTCAGAATCAACAATTTTATCACCTAAGACATTTAAAAATACCTTA
CAATTCCCGACATCCAAAAATCTTTTGTCGAGTAACTTAAAAGGCCGACCTGATGGAGCT
TTAAAGAATATACATGACACGTTGGGGAGTTCTCTGAGATCAATGCCATTCAATCATGAT
GACTTACTTGAAACGATGGCCAGTAAAAGAAACGAACTCAACGAAAGACTCAAATCTTTA
AACTATGATTTAAACGATAGACTGTCCTCGCTTTTGCCCTCATTTAGAGTCCAACCCTTA
ACGCAGAACAGAATAACTTCCAAAAACAGGTCCGATGTTAGAAATAAACAAGCTTTGGAT
AATTTAAGAAATTTACCGTTCACCAATAACGTCTCATCAAGATCTAAACTGAATACATTA
TCAAGAAAAAGTAATGCTGATTCGGTCGCATTTAAACCTTTAAAAAAATCAGTTAATGTG
AGAGATCGCCTCGCCGACCTCAAAATACTCAGACTACGTGATGAACCAAGTTTAAAAAGG
CCCACATCTGCTAAGTCTGTATTTACATCACCAGTGGAATCTAAATCTTCACGCACTGCG
AGCAGAACTATGAAACCTGCATTTCGTACGGATAGCAAAATTATTTCCTTGACTCCACCT
ACTTCTCCCAAAACGTCGCTTCCAAGGTTAAAAACTGTTAAATCTCCTGAAAAACAACCT
TCTGCTCTATATAATATGGAGAAAAGACCAATTATTTCAAAACTTAAGTCTGGTGAACCA
TTTACCTTGGCTAGCGAAAATACAATGTCCAGAGCACCCTTTCACCGTACGGCAGTTGCA
GACAAATCAGCCTTAGATACTTCATCAAGTGACTTGGATGCACGATCTCTTCACAAAACA
TCGAAACAAGACGTGGGAAAAAATATAGCCATTTCAAAATTGAAAGAAGTATTAAAAGCG
AGAAGTTTTAATAACGATTTACAAAAACCAAAAGTTATTGATGACGTCATCCCAGCAGCA
ACTGAAGTTGGGGATCCGAAGACAGGAATCTTGAGACAAAATGTACAGTATAAATGTAAG
ATGAGGGAACTCACATCCCTTCGGAGGCTTAGGGGTTCTAGATCGTTAGTGAGCATTGGA
TCGTTGACAATACGAACTGCCCGACGTTGTATGGAGTCAAGTAGAAGAAGTTGGTACTGC
GGGGCATCGGACCAGATATGGGAACAATACTCCATGTGGGGACGAACCTGTGCTTTGTAT
AATAAAAGCAATTGTCCCGTGTTAGACAGACCGTGTGCCGTCAGAACTACTCCTAGGGAG
TCTGTGGTCTGTGTGTGCAACGCAACTTACTGTGATGAGATCCGGAGAGAGATTCCCACC
AAAGACAACGTCATCATATACACATCCTCAAAGAGTGGCTTCCGCTTTACAAAATCTTAT
GGAAAATTGCAAGCTCTAGATTCACTACATTCATCGAGATTTCTGGAAATCGATCCGACG
ACAAAATATCAGACGATCGAGGGTTTCGGTGGAGCCGTCACGGATGCCGCTTCAATGAAC
TGGAAAAGTGTGTCCGATAATTTACAGGACAAGCTCATAAATTCTTACTTCAGTGAAGAC
GGCCTCCAATATAACATGCTTCGAGTACCAATCGGAGGATCAGATTTCTCCACCCACGCG
TATGCTTACAACGACTTACCAGAGAACGACGCCTTTCTTACTAACTTCACTCTCGCTCCC
GAAGACATCATGTATAAAATTCCAATGATCAAAAGAATTATGGCAGTATCCCGAACAGCA
GTGCACATCGTGGCCACGACCTGGTCTCCGCCGCCTTGGATGAAGACAGGACGCAGTTTC
GCTGGCTTTAGCAGATTAAAACAGGAATATTTTCAGACCTACGCTGATTATCACTTGAAA
TTTCTTCAAAAGTACAATGAATCTGGGATACCGATCTGGGGTCTCACAACCACCAACGAA
CCCATCAATGGAGTATTTGATTTATGCAACTTCAACTCTCTCGGCTGGACCGTCACTAAA
ATGGCAGATTGGATCGTCAATAACTTTGGTCCAACAATCCGGAACTCCTCTTTTAAAGAT
GTTAAAATAATGGTTGGAGACGATCAGCGGTTCACCATTCCTTATTGGTTTATCGGGATG
GTGGCATATCGTCCTGAATCTCTGAAGTACGTGGACGGTGTCGCTGTACACTACTATACA
GACCAGTTTATTTCCCCGATTGCGTTTGAAGCTGTGACTAAGGCTCATCCAGAGAAATTT
GTTCTTTCCACCGAAGCCTGTGAAGGCACTTTACCCTGGCAGAAAAATAAGGTCCTGTTG
GGATCATGGCAACGTGCGAAGACATATGTTTTAGATATATTGGAGGGCGGATCATCCCCG
ATCGAGCCAGCACATGTTCGAGCTACAGTTAATTTACCTTAG
Protein sequence:
MMWFLLVACVLQIHGISSQHLKSDDCCPCSPENSFSSSNVLNKDHVSGNCPCRDADSEPA
SSIFRPSFKTATNARQVDGKEEPKPLLNPEVGLASSVLETLREVSDQEYQDALARDVARS
ALRSMNTNAEAGETGDESKNVVKIILGPRHNTDMDTISEPSHNHETRCIHPSPGQENVFE
QYQEQPQRKLYKHFILSKPVVGLQKNIFDMDNSPSDIQESVVTVRPSVLDNIKMKTDLLK
SKLFLNNEGLTQRNEPKISFDKVFYKPVIKLPDFNLASPLEHFKKSKLFDSYKTPSPAVM
TKDNFLYKSVNINGNSDLISPQQQSQSRKLDGNEKLLPQFNSLASKEARKSIKSTDSFIS
DDFNTDNTILKPVLALRDDCDPSRTWLQIQTTTEKSECVEVNSLPSSVPEEKSTDILSTV
PSPSELIGDENNAMKDGDNKLSENCEPNLNSNPNPERDVVDEDNETDLRASADQTTDDMT
NELLINEDRDERKEEDSENECIVSENTKDKENSIGVEVLNDSMEENKENKELMDDSESAA
NTDRKMNAISSIPELKEGIRNNLNELRKANEKWQEKMKLNVHDTLNRINKKNNIAKSPNI
QSESLKDNDSQNASEILQEFDGDTEAHASNNCKTPAEDFDEFKDHPVTDNIKEKYNTEYV
GSEPQNLNEHIPKNVDETNECKIHEEKTSNLDTKLDSVNNEESEMEVQASESKQRNAKID
LDNLRYVKQDINDEANNLNNVGSSFLQANNFFDDVKSKLTNVFGKTNENIFSGFVSNAGN
VDRTDSRGFIKDRVIDFTKPLLNLDKSTDLFKNAPDDTETKRHSIKSFNSNLQPFKARVA
IPESLDITPSLTRAGNTKEMSQKLYNSQRILPKTYASRIYSQNSDEKSGGDERLFYKESD
IGSLNSESTILSPKTFKNTLQFPTSKNLLSSNLKGRPDGALKNIHDTLGSSLRSMPFNHD
DLLETMASKRNELNERLKSLNYDLNDRLSSLLPSFRVQPLTQNRITSKNRSDVRNKQALD
NLRNLPFTNNVSSRSKLNTLSRKSNADSVAFKPLKKSVNVRDRLADLKILRLRDEPSLKR
PTSAKSVFTSPVESKSSRTASRTMKPAFRTDSKIISLTPPTSPKTSLPRLKTVKSPEKQP
SALYNMEKRPIISKLKSGEPFTLASENTMSRAPFHRTAVADKSALDTSSSDLDARSLHKT
SKQDVGKNIAISKLKEVLKARSFNNDLQKPKVIDDVIPAATEVGDPKTGILRQNVQYKCK
MRELTSLRRLRGSRSLVSIGSLTIRTARRCMESSRRSWYCGASDQIWEQYSMWGRTCALY
NKSNCPVLDRPCAVRTTPRESVVCVCNATYCDEIRREIPTKDNVIIYTSSKSGFRFTKSY
GKLQALDSLHSSRFLEIDPTTKYQTIEGFGGAVTDAASMNWKSVSDNLQDKLINSYFSED
GLQYNMLRVPIGGSDFSTHAYAYNDLPENDAFLTNFTLAPEDIMYKIPMIKRIMAVSRTA
VHIVATTWSPPPWMKTGRSFAGFSRLKQEYFQTYADYHLKFLQKYNESGIPIWGLTTTNE
PINGVFDLCNFNSLGWTVTKMADWIVNNFGPTIRNSSFKDVKIMVGDDQRFTIPYWFIGM
VAYRPESLKYVDGVAVHYYTDQFISPIAFEAVTKAHPEKFVLSTEACEGTLPWQKNKVLL
GSWQRAKTYVLDILEGGSSPIEPAHVRATVNLP