Monarch geneset OGS2.0

DPOGS212694
TranscriptDPOGS212694-TA1623 bp
ProteinDPOGS212694-PA540 aa
Genomic positionDPSCF300012 - 903804-907704
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0073462e-12243.25% 
BombyxBGIBMGA013129-TA5e-12147.87% 
DrosophilaCG31414-PB2e-7933.52% 
EBI UniRef50UniRef50_E7EZM12e-10239.96%Si:dkey-19n13.4 n=76 Tax=Euteleostomi RepID=E7EZM1_DANRE
NCBI RefSeqXP_975652.17e-10841.80%PREDICTED: similar to putative lysosomal glucocerebrosidase [Tribolium castaneum]
NCBI nr blastpgi|1984355804e-11043.02%PREDICTED: similar to glucosidase, beta, acid [Ciona intestinalis]
NCBI nr blastxgi|1984355804e-10843.02%PREDICTED: similar to glucosidase, beta, acid [Ciona intestinalis]
Group
Gene OntologyGO:00066654e-190sphingolipid metabolic process
GO:00057644e-190lysosome
GO:00043484e-190glucosylceramidase activity
GO:00070404e-190lysosome organization
GO:00431694.2e-124cation binding
GO:00059754.2e-124carbohydrate metabolic process
GO:00038244.2e-124catalytic activity
KEGG pathwaycin:1001851827e-111 
 K01201 (E3.2.1.45, GBA, srfJ)maps-> Lysosome
    Sphingolipid metabolism
    Other glycan degradation
InterPro domain[8-529] IPR0011394e-190Glycoside hydrolase, family 30
[90-447] IPR0137814.2e-124Glycoside hydrolase, subgroup, catalytic core
[105-454] IPR0178533.1e-87Glycoside hydrolase, superfamily
[448-500] IPR0137806e-06Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL10162 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212694-TA
ATGGTGTCAGTTAAATTATTTGTAGTTTCGACGCTGTCGTTTTTACTGCATCAGATCTCTGCTGAATGTTCAGACGTGCCATGTGCTGCCAAATTTTATAATGAATCGTCAGTGTGTGTTTGCAACTCGACGTACTGCGACACTATCACCAGGGTTACCAGTTTTGAATCTGGTACCTTCGCCACGTATACTAGCAGTAAAGAAGGCAAAAGATTTTATAAACAGATTAATGATATCCAGAGCCGGGACCCGTCAGTGAGCGACGATGAAGGAAATGTTTTCCTACTTGATCCGACGATTCGTTATCAGAATATCGAGGGTTTTGGCGGCGCTGTCACGGACTCGGCTGGTATAAACTTGAAGAGCTTACCTTTAGCAGCGCAACGGAAATTAATTAATTCCTATTTCAGTGATGTAGGTATAGAATACAACATGCTCCGTCTGCCGATCGCCTCAACGGATTTTTCGACACGTATCTACTCTTACGATGATTATCGTAACGATACAGATCTAGACAATTTTAAATTAGCTAAAGAAGACTATGAATATAAGATTCCTTTCATTCAACATGCTAAAGATGTCGCCACTGACGATATTCATATAGTCGCGGCATCATGGTCACCACCTAAATGGATGAAGACTAAAGACAGCATGGTCGCAGGAGGTTCCGTAAAACATCGGTTCATGCAGGACTACGCTGAATATCATTGCAAATTTGCAGAGGCTTACAAGAAGAATGGCATTGATATCTGGGCTATGTCAACATCAAACGAACCAACGTCCCCTCTGCTGAGGACTCCTTTCCAAAGCACGTTGTGGTATGTGCCGGATATGGGTGCCTTCATTTCGAAGTATCTTGGACCAACATTAAGGAATTGTTCCGCGGAAGGAATTAAGTTACTCACTATTGACGACCAACGTGGCTTAATTCCACTGTTCTCTGCGTTGTTTTCAATTATAGCGCCGGAAGCTATTGAATACGTAGACGGTTTCGCTCTGCATTCATATTTTAATAAAATAACCCCACCCTCTATGTCAACATTCCTCCTGAAACAGTTTCCAGATAAATTTGCTCTGGTGACTGAATATTGTGCAGGATCTTCGCCGTCCGACAAACCGAAAGTAGATTTGGGTTCCTGGGCGAGAGCTAAAGACTACGTCCAAGATATTCTAGAGAATCTCAACAGCAATTTTGTCGGTTGGGTGGATTGGAACTTATGTTTGAACAGACAAGGCGGTCCGACCTGGGTCGGGAACTTCGTAGACTCCCCGATCATAGTTGACGCTAAAAAGCAAGAGTTCTACAAACAGCCCATGTTCTATGCCATGGGACATTTTTCTAAATTCCTACCGAGAGGATCACAAAGAATACAAATGTCTGCTGGTGCCGATAACCAATGTGACAAGTGCGTTAAGGATCCGTCCAAAGAATACATTGCCTTCATGACGCCCGAAGACACAATTGTGGTCATAATATATAATGACGGGGAAGCACGCGAGGCGACATTACGAGTGGGCACTTGTGACGTCATCACCCAACTCGATGCAAATTCCATAACAACCATCGAAATACCGACCACACAAGACGATGGAAACATGGACTGTAAAATCGTAGTGGACTAA

Protein sequence:

>DPOGS212694-PA
MVSVKLFVVSTLSFLLHQISAECSDVPCAAKFYNESSVCVCNSTYCDTITRVTSFESGTFATYTSSKEGKRFYKQINDIQSRDPSVSDDEGNVFLLDPTIRYQNIEGFGGAVTDSAGINLKSLPLAAQRKLINSYFSDVGIEYNMLRLPIASTDFSTRIYSYDDYRNDTDLDNFKLAKEDYEYKIPFIQHAKDVATDDIHIVAASWSPPKWMKTKDSMVAGGSVKHRFMQDYAEYHCKFAEAYKKNGIDIWAMSTSNEPTSPLLRTPFQSTLWYVPDMGAFISKYLGPTLRNCSAEGIKLLTIDDQRGLIPLFSALFSIIAPEAIEYVDGFALHSYFNKITPPSMSTFLLKQFPDKFALVTEYCAGSSPSDKPKVDLGSWARAKDYVQDILENLNSNFVGWVDWNLCLNRQGGPTWVGNFVDSPIIVDAKKQEFYKQPMFYAMGHFSKFLPRGSQRIQMSAGADNQCDKCVKDPSKEYIAFMTPEDTIVVIIYNDGEAREATLRVGTCDVITQLDANSITTIEIPTTQDDGNMDCKIVVD-