Monarch geneset OGS2.0

DPOGS203813
TranscriptDPOGS203813-TA1872 bp
ProteinDPOGS203813-PA623 aa
Genomic positionDPSCF300010 + 2101825-2107526
RNAseq coverage503x (Rank: top 25%)
Annotation
HeliconiusHMEL0133220.092.13% 
BombyxBGIBMGA003720-TA0.091.78% 
DrosophilaCG3376-PA0.070.13% 
EBI UniRef50UniRef50_F4X0C30.073.50%Sphingomyelin phosphodiesterase n=12 Tax=Pancrustacea RepID=F4X0C3_ACREC
NCBI RefSeqXP_971230.10.073.45%PREDICTED: similar to sphingomyelin phosphodiesterase [Tribolium castaneum]
NCBI nr blastpgi|910883450.073.45%PREDICTED: similar to sphingomyelin phosphodiesterase [Tribolium castaneum]
NCBI nr blastxgi|1951248430.070.49%GI21317 [Drosophila mojavensis]
Group
Gene OntologyGO:00066850sphingomyelin catabolic process
GO:00047670sphingomyelin phosphodiesterase activity
GO:00167871.7e-33hydrolase activity
KEGG pathwaytca:6598700.0 
 K12350 (SMPD1, ASM)maps-> Lysosome
    Sphingolipid metabolism
InterPro domain[4-600] IPR0111600Sphingomyelin phosphodiesterase
[162-427] IPR0048431.7e-33Metallophosphoesterase domain
[47-133] IPR0110012e-13Saposin-like
[49-127] IPR0081398.4e-09Saposin B
Orthology groupMCL11066 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203813-TA
ATGGTGGCCTTAAAACGTAACTACTCACGGCTTTTGTGGGACAGCGACAAAAGCTCCAATTTACCTCTGTTTGTGGACAAGGCTTTAAAATTATTAAATCTGAAACAGGTGTACTATGAGGTCGAACACTCTGTTATGTCTAAAGTATCTTGTACAGCTTGTAAAGCTGGAGCGGGTCTGCTGCAGCACTATATGAGGCTGGGTAAAAGCAAAGATGAGATAAATAAAATGATCTATCAGTTTTGTGTCTCTTTAAACATACAGTCTGCAAGAGTTTGCGAAGGCATCACTAGATTGTTTGGTAGTGAAGTAGTGTACGTTTTGAAACGAATAACCTTAGGACCAAACGAAATATGTAGTTTTGTAATCGGCGATGCATGCGGCGATGTATACAATCCCTATCATGAATGGGAAGTTACTTTCCCTCCGGTTCCAAAACCACCTGTGCTGCCTTTAAAAGTACCTGACGATAAGGCACAGACATTCAAAGTACTTCAAATATCAGACACTCATTTCGATCCCTATTATGCTGAAGGAGCCAACGCAGAATGCAATGAGCCCCTTTGTTGTCGGGCATCAAGTGGACCAGCCTTGACTCCTGGCGACGGCGCTGGACGTTGGGGGGATTACCGCAAGTGTGATACTCCGAAACGAACCATAGATGACATGTTACAACATATAGCTAACACACATCCTGATATAGACTACATCTTATGGACCGGAGACTTACCACCTCACGACGTATGGAATCAAACGAAGGAAGAAAATTTAAAAGTTCTACAAGAGACTGTTGCACAGATGTCAGATATGTTTCCTGGAGTGCCTATCTTTCCTGCACTCGGGAACCATGAGTCGTCACCGGTTAACAGTTTTCCTCCTCCATTCATTTCATCACCGGAGTCAAACATGGCGTGGCTGTACAATGAGCTGGACGCCCAATGGCGTCGCTGGCTGCCCGCGGGAGTGTCTCACACTGTTCGACGCGGCGCCTTCTACTCCGTACTCGTTCGACCTGGATTCCGCATCATCTCGCTCAATATGAACTACTGCAACAATAAAAATTGGTGGCTTCTGCTCAATAGTACAGATCCAGCAACGGAACTTCAATGGCTAATATACGAATTGCAAACGGCTGAGTTTAGCGGTGAAAAGGTCCACCTGATAGGACACATCCCACCCGGACATTCGGACTGTCTCAAGGTCTGGAGCAGAAACTATTACGCGATCGTCAATCGTTATGAATCAACAATAACAGCTCAGTTCTTTGGACACACACATTACGATGAATTTGAAGTTTTTTATGATCCGAATGATTTAGGGAGAGCGACGAGTATAGCATATGTCGGTCCGTCAGTTTCGCCGTATTACGATTTAAATCTCGGATACCGGATTTACTACGTGGACGGTGATCACGAAGCAACAACACGATTAGTAGTCGATCACGAGACCTGGATAATGAACTTGAAGGAGGCTAATCTGTTCGGATACCCGATATGGTACAAGCTGTACTCGGCGCGCTCCGCCTACATGATGCCGGCGCTGCGGCCTCAAGACTGGGACAAGTTCATAGACGATATGACCACCAAGGAAGACGTCTTTAATTTATATTACAAACATTACTGGAAAAGCAGTCCACGCCGCGGTATGTGTGACGGCGAATGTCGGAAGAGGCTCGTGTGTGACGCGCGGTCTGGCCGCTCGCACGACAGGCGCGTGCTCTGTGGACACATCGAGGCGCGCATTGACGGTGCGCCCGCGCCGCAAACATGGCGCGCGTGGTTCTATAACGGATTGTCCGTCTCAATGTCAATGTTAATGCAAATACCGCAAGTGGCATATCAAATACCAAAGTTCGTGATGGGATTGGGTTGA

Protein sequence:

>DPOGS203813-PA
MVALKRNYSRLLWDSDKSSNLPLFVDKALKLLNLKQVYYEVEHSVMSKVSCTACKAGAGLLQHYMRLGKSKDEINKMIYQFCVSLNIQSARVCEGITRLFGSEVVYVLKRITLGPNEICSFVIGDACGDVYNPYHEWEVTFPPVPKPPVLPLKVPDDKAQTFKVLQISDTHFDPYYAEGANAECNEPLCCRASSGPALTPGDGAGRWGDYRKCDTPKRTIDDMLQHIANTHPDIDYILWTGDLPPHDVWNQTKEENLKVLQETVAQMSDMFPGVPIFPALGNHESSPVNSFPPPFISSPESNMAWLYNELDAQWRRWLPAGVSHTVRRGAFYSVLVRPGFRIISLNMNYCNNKNWWLLLNSTDPATELQWLIYELQTAEFSGEKVHLIGHIPPGHSDCLKVWSRNYYAIVNRYESTITAQFFGHTHYDEFEVFYDPNDLGRATSIAYVGPSVSPYYDLNLGYRIYYVDGDHEATTRLVVDHETWIMNLKEANLFGYPIWYKLYSARSAYMMPALRPQDWDKFIDDMTTKEDVFNLYYKHYWKSSPRRGMCDGECRKRLVCDARSGRSHDRRVLCGHIEARIDGAPAPQTWRAWFYNGLSVSMSMLMQIPQVAYQIPKFVMGLG-