Monarch geneset OGS2.0

DPOGS212893
TranscriptDPOGS212893-TA702 bp
ProteinDPOGS212893-PA233 aa
Genomic positionDPSCF300909 - 3547-5314
RNAseq coverage115x (Rank: top 59%)
Annotation
HeliconiusHMEL0083169e-7066.85% 
BombyxBGIBMGA013129-TA5e-7761.40% 
DrosophilaCG31414-PB3e-4542.08% 
EBI UniRef50UniRef50_B7PNW76e-4847.06%Beta-glucocerebrosidase, putative n=3 Tax=Ixodidae RepID=B7PNW7_IXOSC
NCBI RefSeqXP_002435459.11e-4847.06%beta-glucocerebrosidase, putative [Ixodes scapularis]
NCBI nr blastpgi|3464676116e-5050.49%hypothetical protein [Amblyomma maculatum]
NCBI nr blastxgi|3464676111e-4850.49%hypothetical protein [Amblyomma maculatum]
Group
Gene OntologyGO:00066654.2e-82sphingolipid metabolic process
GO:00057644.2e-82lysosome
GO:00043484.2e-82glucosylceramidase activity
GO:00070404.2e-82lysosome organization
GO:00431694.7e-45cation binding
GO:00059754.7e-45carbohydrate metabolic process
GO:00038244.7e-45catalytic activity
KEGG pathwaycin:1001851823e-49 
 K01201 (E3.2.1.45, GBA, srfJ)maps-> Lysosome
    Sphingolipid metabolism
    Other glycan degradation
InterPro domain[1-223] IPR0011394.2e-82Glycoside hydrolase, family 30
[1-147] IPR0137814.7e-45Glycoside hydrolase, subgroup, catalytic core
[1-154] IPR0178538e-33Glycoside hydrolase, superfamily
[148-203] IPR0137809e-11Glycosyl hydrolase, family 13, all-beta
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212893-TA
ATGGTTGGAGACGATCAGCGACTCACCATACCTTATTGGTTTAACGAGATGGTTTCGTATCGTCCCGAATCTCTAAAATACGTGGACGGTGTCGCTGTACACTACTATACAGACATGTTTGTCTCCCCGGTTAAGCTTGAAGCAGTTACTAAGACCCATCCAGAAAAATTTATTCTGGCCACTGAAGCCTGTGAAGGCGTTAATGCCGGACAGAAAAACTTTGTGCTGTTGGGCTCCTGGGAAAGAGCGAGGTCATACATATTAGACATACTGGAAGATTTAAATTATAACTTAGTTGGATGGATTGATTGGAATCTGTGTCTCGACCCTCGCGGAGGTCCCAACTGGGCGAGTAACTTCGCTGATGCTGCTATAATTGTTGACAAAACAAACGACGAATTCATAAAACAGCCCATGTTCTATGCGATGGGACATTTTTCAAAGTTTATTCCTCGTGGTTCCAGAAGAATAAAAGTCAAAGAACATAAGTCAATATTTGAACTTTCTTTGAGACACGTCGCCTTTATCACGCCGAGAGGTACGATCGTGGCCGTCATTTACAACGACGGCCGTTCGCAGACGATATCCATAACTATTAAAAACAGACAGTTGAAGGTAAAGCTCGAAGGGGATTCAGTGTCGACGATAGAGTTCCAATCTGATGAAGTCATGACTTTACAAACAAAGGGAATTGACAAATAA

Protein sequence:

>DPOGS212893-PA
MVGDDQRLTIPYWFNEMVSYRPESLKYVDGVAVHYYTDMFVSPVKLEAVTKTHPEKFILATEACEGVNAGQKNFVLLGSWERARSYILDILEDLNYNLVGWIDWNLCLDPRGGPNWASNFADAAIIVDKTNDEFIKQPMFYAMGHFSKFIPRGSRRIKVKEHKSIFELSLRHVAFITPRGTIVAVIYNDGRSQTISITIKNRQLKVKLEGDSVSTIEFQSDEVMTLQTKGIDK-