Monarch geneset OGS2.0

DPOGS212721
TranscriptDPOGS212721-TA6147 bp
ProteinDPOGS212721-PA2048 aa
Genomic positionDPSCF300012 - 392297-411797
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0073462e-13254.52% 
BombyxBGIBMGA013129-TA2e-15261.56% 
DrosophilaCG31148-PA3e-9242.40% 
EBI UniRef50UniRef50_Q0IG102e-9740.36%Glucosylceramidase n=5 Tax=Culicidae RepID=Q0IG10_AEDAE
NCBI RefSeqXP_975653.11e-10244.92%PREDICTED: similar to O-Glycosyl hydrolase family 30 protein [Tribolium castaneum]
NCBI nr blastpgi|910873872e-10144.92%PREDICTED: similar to O-Glycosyl hydrolase family 30 protein [Tribolium castaneum]
NCBI nr blastxgi|910873872e-10044.92%PREDICTED: similar to O-Glycosyl hydrolase family 30 protein [Tribolium castaneum]
Group
Gene OntologyGO:00066651.4e-179sphingolipid metabolic process
GO:00057641.4e-179lysosome
GO:00043481.4e-179glucosylceramidase activity
GO:00070401.4e-179lysosome organization
GO:00431692.5e-131cation binding
GO:00059752.5e-131carbohydrate metabolic process
GO:00038242.5e-131catalytic activity
KEGG pathwaytca:6645643e-102 
 K01201 (E3.2.1.45, GBA, srfJ)maps-> Lysosome
    Sphingolipid metabolism
    Other glycan degradation
InterPro domain[1627-2045] IPR0011391.4e-179Glycoside hydrolase, family 30
[1645-2004] IPR0137812.5e-131Glycoside hydrolase, subgroup, catalytic core
[1660-2011] IPR0178537.9e-87Glycoside hydrolase, superfamily
[2005-2046] IPR0137801.6e-06Glycosyl hydrolase, family 13, all-beta
Orthology groupMCL10162 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212721-TA
ATGATGTGGTTCCTTTTGGTAGCGTGTGTGTTGCAGATTCATGGCATATCCTCACAGCATTTAAAATCTGACGATTGCTGTCCTTGCTCTCCAGAAAATAGTTTCTCCAGTAGTAACGTTTTGAACAAAGATCATGTCTCTGGGAATTGTCCTTGTAGAGATGCGGATTCTGAACCCGCTTCATCTATATTTAGACCAAGTTTTAAAACAGCTACCAATGCCAGGCAAGTCGATGGAAAGGAGGAGCCAAAGCCACTTTTAAATCCGGAAGTCGGCCTCGCGTCTTCAGTTTTAGAAACTTTACGTGAAGTTTCTGACCAAGAATACCAAGATGCTCTGGCAAGAGATGTTGCGAGGAGTGCTTTAAGATCAATGAACACAAATGCCGAGGCAGGTGAAACTGGCGATGAAAGCAAAAATGTTGTTAAGATTATATTAGGTCCTAGACATAACACAGATATGGACACCATATCTGAACCGTCTCACAACCACGAAACCCGCTGCATACACCCATCGCCAGGACAAGAAAACGTATTTGAACAATACCAAGAACAGCCACAAAGAAAATTATATAAGCACTTCATCCTGAGTAAGCCAGTTGTAGGTTTACAAAAAAATATTTTTGACATGGACAATAGCCCCAGCGACATTCAAGAAAGCGTAGTGACAGTTAGACCCAGTGTATTAGATAATATAAAAATGAAGACAGATCTATTGAAATCAAAGTTATTTTTAAACAATGAGGGATTAACACAAAGAAACGAACCTAAAATATCTTTTGATAAGGTATTTTATAAGCCAGTCATAAAATTGCCCGACTTCAATTTGGCTTCTCCCTTGGAGCATTTTAAGAAATCAAAGTTATTTGATTCATACAAGACCCCCTCACCTGCCGTGATGACAAAAGACAATTTTTTATATAAAAGCGTCAACATAAATGGCAACAGTGATCTTATATCACCGCAGCAACAATCGCAAAGTAGAAAATTGGATGGCAATGAAAAATTATTACCACAATTTAATTCACTTGCATCCAAAGAAGCTCGCAAATCTATAAAAAGTACGGATTCTTTTATTAGTGATGATTTTAATACGGATAATACAATATTGAAACCAGTATTAGCATTGCGTGATGACTGTGATCCATCACGAACCTGGTTACAGATACAAACTACTACTGAAAAAAGTGAATGTGTTGAGGTCAATAGTTTGCCATCAAGTGTTCCAGAGGAAAAAAGTACAGATATTTTATCAACAGTTCCAAGCCCTTCGGAATTAATTGGAGATGAAAACAACGCTATGAAAGATGGTGACAATAAACTTAGTGAAAATTGTGAACCAAATTTAAACAGTAATCCAAATCCAGAACGCGACGTTGTCGATGAAGATAATGAAACTGATTTAAGAGCTAGTGCTGATCAAACGACTGATGACATGACGAACGAGCTACTAATTAATGAGGACAGAGACGAAAGAAAAGAAGAGGATTCCGAAAATGAATGCATCGTATCAGAAAACACTAAGGATAAGGAAAATTCTATTGGTGTGGAAGTTTTAAATGACTCGATGGAGGAAAATAAAGAAAATAAGGAATTGATGGATGATTCTGAATCCGCTGCTAATACAGATAGGAAAATGAACGCGATATCTAGTATCCCAGAGTTAAAAGAAGGCATAAGAAATAATCTAAATGAATTAAGAAAAGCAAACGAGAAGTGGCAAGAAAAGATGAAACTTAATGTTCATGATACTTTGAACCGAATTAATAAAAAAAATAATATTGCTAAATCTCCAAATATTCAGTCAGAATCCCTGAAGGATAATGATTCACAAAATGCATCTGAAATATTGCAGGAATTCGACGGCGACACTGAAGCACACGCTTCAAATAATTGTAAAACACCTGCTGAAGATTTTGATGAATTTAAGGATCATCCAGTAACTGATAATATAAAAGAAAAATATAATACCGAATATGTAGGAAGTGAACCACAAAATTTAAACGAACATATTCCTAAAAATGTTGATGAAACTAATGAATGTAAAATTCACGAAGAAAAGACTAGCAATCTAGATACAAAATTAGACTCAGTAAACAACGAAGAAAGCGAAATGGAAGTTCAGGCGAGTGAATCAAAGCAAAGGAATGCTAAAATAGATCTAGATAATTTAAGGTATGTGAAACAAGACATTAATGACGAAGCCAATAATTTAAATAATGTTGGAAGTTCGTTTTTACAAGCTAATAACTTTTTTGATGACGTCAAGTCAAAACTTACTAATGTGTTTGGGAAAACAAACGAAAATATATTTAGTGGATTTGTATCCAACGCAGGAAACGTGGATAGAACTGATTCAAGAGGATTTATAAAAGATCGTGTTATAGACTTCACTAAACCTCTTCTGAATTTAGATAAAAGTACCGATTTATTTAAGAATGCACCGGATGATACAGAAACGAAAAGACATAGTATTAAGTCATTTAATTCGAATCTACAACCTTTCAAAGCAAGAGTTGCAATCCCAGAGTCCCTTGATATAACACCAAGTTTAACACGAGCTGGAAATACAAAAGAAATGAGCCAAAAATTGTATAATTCACAAAGAATACTTCCGAAAACATACGCTTCTCGGATATATTCGCAAAATTCGGATGAAAAGTCGGGAGGAGATGAACGACTATTTTACAAAGAATCAGACATCGGTAGCCTTAATTCAGAATCAACAATTTTATCACCTAAGACATTTAAAAATACCTTACAATTCCCGACATCCAAAAATCTTTTGTCGAGTAACTTAAAAGGCCGACCTGATGGAGCTTTAAAGAATATACATGACACGTTGGGGAGTTCTCTGAGATCAATGCCATTCAATCATGATGACTTACTTGAAACGATGGCCAGTAAAAGAAACGAACTCAACGAAAGACTCAAATCTTTAAACTATGATTTAAACGATAGACTGTCCTCGCTTTTGCCCTCATTTAGAGTCCAACCCTTAACGCAGAACAGAATAACTTCCAAAAACAGGTCCGATGTTAGAAATAAACAAGCTTTGGATAATTTAAGAAATTTACCGTTCACCAATAACGTCTCATCAAGATCTAAACTGAATACATTATCAAGAAAAAGTAATGCTGATTCGGTCGCATTTAAACCTTTAAAAAAATCAGTTAATGTGAGAGATCGCCTCGCCGACCTCAAAATACTCAGACTACGTGATGAACCAAGTTTAAAAAGGCCCACATCTGCTAAGTCTGTATTTACATCACCAGTGGAATCTAAATCTTCACGCACTGCGAGCAGAACTATGAAACCTGCATTTCGTACGGATAGCAAAATTATTTCCTTGACTCCACCTACTTCTCCCAAAACGTCGCTTCCAAGGTTAAAAACTGTTAAATCTCCTGAAAAACAACCTTCTGCTCTATATAATATGGAGAAAAGACCAATTATTTCAAAACTTAAGTCTGGTGAACCATTTACCTTGGCTAGCGAAAATACAATGTCCAGAGCACCCTTTCACCGTACGGCAGTTGCAGACAAATCAGCCTTAGATACTTCATCAAGTGACTTGGATGCACGATCTCTTCACAAAACATCGAAACAAGACGTGGGAAAAAATATAGCCATTTCAAAATTGAAAGAAGTATTAAAAGCGAGAAGTTTTAATAACGATTTACAAAAACCAAAAGTTATTGATGACGTCATCCCAGCAGCAACTGAAGTTGGGGATCCGAAGACAGGAATCTTGAGACAAAATGTACAGTATAAATTGTTAGACAGACCGTGTGCCGTCAGAACTACTCCTAGGGAGTCTGTGGTCTGTGTGTGCAACGCAACTTACTGTGATGAGATCCGGAGAGAGATTCCCACCAAAGACAACGTCATCATATACACATCCTCAAAGAGTGGCTTCCGCTTTACAAAATCTTATGGAAAATTGCAAGCTCTAGATTCACTACATTCATCGAGATTTCTGGAAATCGATCCGACGACAAAATATCAGACGATCGAGGGTTTCGGTGGAGCCGTCACGGATGCCGCTTCAATGAACTGGAAAAGTGTGTCCGATAATTTACAGGACAAGCTCATAAATTCTTACTTCAGTGAAGACGGCCTCCAATATAACATGCTTCGAGTACCAATCGGAGGATCAGATTTCTCCACCCACGCGTATGCTTACAACGACTTACCAGAGAACGACGCCTTTCTTACTAACTTCACTCTCGCTCCCGAAGACATCATGTATAAAATTCCAATGATCAAAAGAATTATGGCAGTATCCCGAACAGCAGTGCACATCGTGGCCACGACCTGGTCTCCGCCGCCTTGGATGAAGACAGGACGCAGTTTCGCTGGCTTTAGCAGATTAAAACAGGAATATTTTCAGACCTACGCTGATTATCACTTGAAATTTCTTCAAAAGTACAATGAATCTGGGATACCGATCTGGGGTCTCACAACCACCAACGAACCCATCAATGGAGTATTTGATTTATGCAACTTCAACTCTCTCGGCTGGACCGTCACTAAAATGGCAGATTGGATCGTCAATAACTTTGGTCCAACAATCCGGAACTCCTCTTTTAAAGATGTTAAAATAATGGTTGGAGACGATCAGCGGTTCACCATTCCTTATTGGTTTATCGGGATGGTGGCATATCGTCCTGAATCTCTGAAGTACGTGGACGGTGTCGCTGTACACTACTATACAGACCAGTTTATTTCCCCGATTGCGTTTGAAGCTGTGACTAAGGCTCATCCAGAGAAATTTGTTCTTTCCACCGAAGCCTGTGAAGGCACTTTACCCTGGCAGAAAAATAAGGTCCTGTTGGGATCATGGCAACGTGCGAAGACATATGTTTTAGATATATTGGAGAGTGGTCTACGGTTTCATAAATCTTATGGAAAGTGGATAAACACTGATTCACATTATCCAACACTGGAAATTGACCCAACAGTAAAATATCAGACAATCGAGGGTTTTGGGGGATCGGTGACAGACGCTGCTGCTATCAACTGGAGGAGCTTATCAGAAAAGTCGCAAAGAAAACTTATAAAATCGTATTTCAGTAAAAATGGTTTGGAATACAATATGGTTCGGACGCCTATAGGAGCGTCAGACTTCTCAACCCACGCGTACGCTTACAACGACTTACCCGAGAATGACACTCTTCTCACTAATTTCACCCTCACTCCTGAAGACATTATGTACAAAATTCCAATGATAAAAGCTATAATGAAGTTGTCCAATACACCTGTGCATATCATGGCTGCGACCTGGTCTCCGCCGATGTGGATGAAGACGCGCTACAGCTTCGGTGGTATAAATCGATTAAAAAGAAGCTATTACCAAACATACGCTGATTATCATTTAAAATTTCTCCAGAGGTATAAGGAAGCAGGTGTTCCGATATGGGCTCTCACAACTACCAACGAACCCATCAACGGTGTCTTCAATCTTATAGGATTCAATTGTCTCGGCTGGGATATTAACGATATGGGAGACTGGATTGTAAATTATTTCGGACCGACCATCCGTAATTCGGAATTCAAAGATATCAAAATAATCGGTGGGGACGACCAGAGGTTCACGATACCTTACTGGATTGACATGATGCAGCACACTCATCCTAAGGCTTTGGAATACGTTGACGGAATCGGAGTACACTTCTATACTGATGAGATTATACCAGCCAACGTCTTCGAACCGGTCTCTCGTCAATATCCAGATAAATTTATTATATCAACTGAGGCTTGCGAAGGGAGTCAGCATAGACAAAGAAACAAAGTGTTATTAGGTTCCTGGCATCGAGCCAAGTCATACATAACAGATATTATAGAAGATTTAAATTATAACTTAGTTGGCTGGATTGATTGGAATCTGTGTCTCGACCCTCGCGGAGGTCCCAACTGGGCGAGTAACTTCGCTGATGCTGCTATAATTGTTGACAAAACAAACGACGAATTCATAAAACAGCCCATGTTCTATGCGATGGGACATTTTTCAAAGTTTATTCCTCGTGGTTCCAGAAGAATAAAAACTGTTGAAAACGAATCAACGTCTAAATTGCCTTTAAACCATGTTGCCTTCATTACACCGAGAGAAACAATCGTTATTGTACTTCATAACGATGTAGACTAA

Protein sequence:

>DPOGS212721-PA
MMWFLLVACVLQIHGISSQHLKSDDCCPCSPENSFSSSNVLNKDHVSGNCPCRDADSEPASSIFRPSFKTATNARQVDGKEEPKPLLNPEVGLASSVLETLREVSDQEYQDALARDVARSALRSMNTNAEAGETGDESKNVVKIILGPRHNTDMDTISEPSHNHETRCIHPSPGQENVFEQYQEQPQRKLYKHFILSKPVVGLQKNIFDMDNSPSDIQESVVTVRPSVLDNIKMKTDLLKSKLFLNNEGLTQRNEPKISFDKVFYKPVIKLPDFNLASPLEHFKKSKLFDSYKTPSPAVMTKDNFLYKSVNINGNSDLISPQQQSQSRKLDGNEKLLPQFNSLASKEARKSIKSTDSFISDDFNTDNTILKPVLALRDDCDPSRTWLQIQTTTEKSECVEVNSLPSSVPEEKSTDILSTVPSPSELIGDENNAMKDGDNKLSENCEPNLNSNPNPERDVVDEDNETDLRASADQTTDDMTNELLINEDRDERKEEDSENECIVSENTKDKENSIGVEVLNDSMEENKENKELMDDSESAANTDRKMNAISSIPELKEGIRNNLNELRKANEKWQEKMKLNVHDTLNRINKKNNIAKSPNIQSESLKDNDSQNASEILQEFDGDTEAHASNNCKTPAEDFDEFKDHPVTDNIKEKYNTEYVGSEPQNLNEHIPKNVDETNECKIHEEKTSNLDTKLDSVNNEESEMEVQASESKQRNAKIDLDNLRYVKQDINDEANNLNNVGSSFLQANNFFDDVKSKLTNVFGKTNENIFSGFVSNAGNVDRTDSRGFIKDRVIDFTKPLLNLDKSTDLFKNAPDDTETKRHSIKSFNSNLQPFKARVAIPESLDITPSLTRAGNTKEMSQKLYNSQRILPKTYASRIYSQNSDEKSGGDERLFYKESDIGSLNSESTILSPKTFKNTLQFPTSKNLLSSNLKGRPDGALKNIHDTLGSSLRSMPFNHDDLLETMASKRNELNERLKSLNYDLNDRLSSLLPSFRVQPLTQNRITSKNRSDVRNKQALDNLRNLPFTNNVSSRSKLNTLSRKSNADSVAFKPLKKSVNVRDRLADLKILRLRDEPSLKRPTSAKSVFTSPVESKSSRTASRTMKPAFRTDSKIISLTPPTSPKTSLPRLKTVKSPEKQPSALYNMEKRPIISKLKSGEPFTLASENTMSRAPFHRTAVADKSALDTSSSDLDARSLHKTSKQDVGKNIAISKLKEVLKARSFNNDLQKPKVIDDVIPAATEVGDPKTGILRQNVQYKLLDRPCAVRTTPRESVVCVCNATYCDEIRREIPTKDNVIIYTSSKSGFRFTKSYGKLQALDSLHSSRFLEIDPTTKYQTIEGFGGAVTDAASMNWKSVSDNLQDKLINSYFSEDGLQYNMLRVPIGGSDFSTHAYAYNDLPENDAFLTNFTLAPEDIMYKIPMIKRIMAVSRTAVHIVATTWSPPPWMKTGRSFAGFSRLKQEYFQTYADYHLKFLQKYNESGIPIWGLTTTNEPINGVFDLCNFNSLGWTVTKMADWIVNNFGPTIRNSSFKDVKIMVGDDQRFTIPYWFIGMVAYRPESLKYVDGVAVHYYTDQFISPIAFEAVTKAHPEKFVLSTEACEGTLPWQKNKVLLGSWQRAKTYVLDILESGLRFHKSYGKWINTDSHYPTLEIDPTVKYQTIEGFGGSVTDAAAINWRSLSEKSQRKLIKSYFSKNGLEYNMVRTPIGASDFSTHAYAYNDLPENDTLLTNFTLTPEDIMYKIPMIKAIMKLSNTPVHIMAATWSPPMWMKTRYSFGGINRLKRSYYQTYADYHLKFLQRYKEAGVPIWALTTTNEPINGVFNLIGFNCLGWDINDMGDWIVNYFGPTIRNSEFKDIKIIGGDDQRFTIPYWIDMMQHTHPKALEYVDGIGVHFYTDEIIPANVFEPVSRQYPDKFIISTEACEGSQHRQRNKVLLGSWHRAKSYITDIIEDLNYNLVGWIDWNLCLDPRGGPNWASNFADAAIIVDKTNDEFIKQPMFYAMGHFSKFIPRGSRRIKTVENESTSKLPLNHVAFITPRETIVIVLHNDVD-