Monarch geneset OGS2.0

DPOGS205617
TranscriptDPOGS205617-TA1629 bp
ProteinDPOGS205617-PA542 aa
Genomic positionDPSCF300023 - 987743-992082
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0073460.068.42% 
BombyxBGIBMGA013129-TA1e-13955.58% 
DrosophilaCG31414-PB6e-10037.18% 
EBI UniRef50UniRef50_UPI0002246E161e-10838.48%UPI0002246E16 related cluster n=1 Tax=unknown RepID=UPI0002246E16
NCBI RefSeqXP_001606590.12e-10939.47%PREDICTED: similar to glucocerebrosidase [Nasonia vitripennis]
NCBI nr blastpgi|3454857235e-10838.48%PREDICTED: glucosylceramidase-like [Nasonia vitripennis]
NCBI nr blastxgi|3454857237e-10938.48%PREDICTED: glucosylceramidase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00066651.5e-191sphingolipid metabolic process
GO:00057641.5e-191lysosome
GO:00043481.5e-191glucosylceramidase activity
GO:00070401.5e-191lysosome organization
GO:00431695.5e-126cation binding
GO:00059755.5e-126carbohydrate metabolic process
GO:00038245.5e-126catalytic activity
KEGG pathwayaga:AgaP_AGAP0016495e-108 
 K01201 (E3.2.1.45, GBA, srfJ)maps-> Lysosome
    Sphingolipid metabolism
    Other glycan degradation
InterPro domain[9-543] IPR0011391.5e-191Glycoside hydrolase, family 30
[112-469] IPR0137815.5e-126Glycoside hydrolase, subgroup, catalytic core
[126-476] IPR0178532e-88Glycoside hydrolase, superfamily
[470-524] IPR0137809.8e-12Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL10162 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205617-TA
ATGGGCTTCCAGAAGATGTATACGATCTTTGTCATCGCAATATGGCTATGCCTCGCCTTGGAAGCTACGGCGGACATGCCTTGTAATGCTCGAGACGTTGGGATACAGGGAAGGTCTGTGTTATGTGTCTGTAATGCTACCTACTGCGACACAATCACTAGAGACGCTCCCGCAAGAGGCAGGTTCGTCAGTTATACTAGCTCCAAAGCCGGAAAGCGCTTCCAGAAGGGTGGTGGTCCAATTCAAATAGACTTTTCAAGATCTCAAGAAAGTGCAAACAAAGTTCACGTAAATGAGGATTTTCTTGACAAAATACAACCCAAGTTTCTAGACTCTGCATCATACACACTAATCTCGGAAATTCAGTACCAGAGTATCGAAGGCTTTGGAGGTTCGGTTACAGATGCCGCTTCCATCAACTGGCAGAAACTCTCACCAGGAGCGCAAGATCACTTTGTTAATTCCTACTTCAGTAAAAACGGTCTGGAGTACAATTTGATTCGAACGCCCATAGGTGGCGCTGACTTCTCTACCTATCCTTACACATACAACGAATACCCCATTAATGATTACAATTTGTCCAACTTCTCGCTGAGCGAAGAAGATTATAAACTCAAGCTCCCATTAATCAAACGAGGACAAGCCGTCTCCACATCCGAAATCAAAGTGACAGCGAGCACATGGTCGCCACCAGTATGGATGAAGACCAACAACGCCATCACCGGATTTGCTCAAGTCAAACCGGAGTTCTACCAGTCATACGCTGACTATCACTTACGTTTCATTGAAGAATACGATAAAGAGAACGTCACAGTTTGGGCCATAACAACAACTAACGAACCTATCAACGGAATGATCCCCTTCGTTGATTTCAATTCGCTCGGATGGTTCCCGTGGGATTTGGGTCGCTGGGTCGCGAACAACCTCGGCCCAACTATCAAGAGCTCGAAATACAACAAAACATTGATTCTGGCCGTTGACGAACAACGCTACCTCTTGGATCTTTATTTGGAAGGGATGCTAGCGGCTGCGCCGAAGGCAATCGACTACATAGATGGTATCGCCGTCCATTACTATGGCAACTTTTTCCCTGCCCAAGTTTTAACGAACCTACAGGAGAGGTATCCTGGTAAAATTATTCTTGCCACCGAAGCCTGTGAAGGTCCGATGCCGTGGGATGTGATGAGAGTGAAGATTGGCTCTTGGGAACGAGCAGATAGATACACCAAACACATTATTGATGACCTAAATAACTTCGTCGTCGGATGGTTGGATTGGAATCTGTGTCTGGATGAAGACGGTGGCCCGAACTGGGCACACAACTACGTCGACTCGCCCATTTTAGTCAACGGAGAAAAAGATGAATTTTATAAACAGCCCATGTTCTACGCTATGGGACACTTTTCCAAATTCATTCCACGAGGTTCCAAAAGAATACAAGTCGTGAGAACCAGTATTGGACATGTCGAAAACGTGGCGTTCATCACGCCGGAGAGGAATGTCGTGATGGTCTTACATAATCCGAACAATTCAGTGAGGAGGGTTCGAATAACAGTAGCTTGGAGGAGATACATTGACGTCACCCTTGACCCAGAATCGATACAAACTGTTGAAGTTAACCTCAACTAA

Protein sequence:

>DPOGS205617-PA
MGFQKMYTIFVIAIWLCLALEATADMPCNARDVGIQGRSVLCVCNATYCDTITRDAPARGRFVSYTSSKAGKRFQKGGGPIQIDFSRSQESANKVHVNEDFLDKIQPKFLDSASYTLISEIQYQSIEGFGGSVTDAASINWQKLSPGAQDHFVNSYFSKNGLEYNLIRTPIGGADFSTYPYTYNEYPINDYNLSNFSLSEEDYKLKLPLIKRGQAVSTSEIKVTASTWSPPVWMKTNNAITGFAQVKPEFYQSYADYHLRFIEEYDKENVTVWAITTTNEPINGMIPFVDFNSLGWFPWDLGRWVANNLGPTIKSSKYNKTLILAVDEQRYLLDLYLEGMLAAAPKAIDYIDGIAVHYYGNFFPAQVLTNLQERYPGKIILATEACEGPMPWDVMRVKIGSWERADRYTKHIIDDLNNFVVGWLDWNLCLDEDGGPNWAHNYVDSPILVNGEKDEFYKQPMFYAMGHFSKFIPRGSKRIQVVRTSIGHVENVAFITPERNVVMVLHNPNNSVRRVRITVAWRRYIDVTLDPESIQTVEVNLN-