Monarch geneset OGS2.0

DPOGS209007
TranscriptDPOGS209007-TA1821 bp
ProteinDPOGS209007-PA606 aa
Genomic positionDPSCF300209 - 85198-90142
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0025410.066.94% 
BombyxBGIBMGA012548-TA0.061.01% 
DrosophilaCG15533-PA6e-8432.11% 
EBI UniRef50UniRef50_Q7Q4M51e-10736.04%AGAP008487-PA n=7 Tax=Culicidae RepID=Q7Q4M5_ANOGA
NCBI RefSeqXP_316957.42e-10836.04%AGAP008487-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582965724e-10736.04%AGAP008487-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582965723e-10736.24%AGAP008487-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00066854.3e-87sphingomyelin catabolic process
GO:00047674.3e-87sphingomyelin phosphodiesterase activity
GO:00167872.2e-09hydrolase activity
KEGG pathwayaga:AgaP_AGAP0119404e-80 
 K12350 (SMPD1, ASM)maps-> Lysosome
    Sphingolipid metabolism
InterPro domain[1-607] IPR0111604.3e-87Sphingomyelin phosphodiesterase
[24-100] IPR0110012.1e-10Saposin-like
[254-451] IPR0048432.2e-09Metallophosphoesterase domain
Orthology groupMCL10909 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209007-TA
ATGTTAAACGATGTTTTTCATATTTATAGACCAACGTACCTGATGAAAACACAGGAAACTCATACTAGAACAACCCTGACCTGCCTACTATGTCGCAGTATATTTTCAGCATTCATAGACATGGTTGAAGAAAACCAATCCGAGCAGAATCTTATAAACACCATAACCACATTATGCTCCACTTTGGGAGTTCTTAGCCCGAAATCATGCAGTGGTCTTATCGATTTGAATTTGCCTATAATAATTTATATAATAAAAAATACTCCAGGCGTAGCCTCAAGAACTTTTTGTGGATTACTGTTTCAAACTGCCAATAATCCGAACTCTTGTGTTTACAATGATCCGAGATTTGAGTGGTCTGTTGATTTACCAGAACCTTCGGAATTTATGGAGACCGAGTCTCGCCAATCGAACTCGAAGCCACTGAAAATTGCTTTAATTTCGGATGCGCACATAGATCCGTTTTACGAACCAAATGGAGTCGCAGACTGCGATGAACCGACTTGTTGTAGGAAGGAACAAACTCCAAGGAGATTAACCTTCAATTATGACTTACTAGAGACTCATGTAGACAAAAGTCTGTCAAATACAGGTGATACATACATGCTGAATCTAGACGCAGCTACGGGTATTAAAAGTGTAAATATTGTGTCAAGAAATAACAACACTTCACCGCCCGCCGGTTATTGGGGAGATTATAGGAATTGCGATACCCCTCTATGGGCGTACGACGATGTTATTGAAAGAATAGCTTCCACACATAAGGATATAGATGTAGTTTATTATATAGGTGATAATATTGATCATCACGTATGGGAGACGACATTTGAAATGATAAATGGTATGAACCAATATGTTATTGATAAAATGAGAAAAGAATTTGGGGACAATGTGTTGATTGTACCGTGTATAGGAAACCACGAATCTCAACCAACAAACCAATTTGCTCCGAGTTCGATCAAAGGGGATAAACTCAATACGACTTGGCTTTACGAGGCCTTGGTTAAAAAATGGGACTACTACTTAACAGAGGAAGCCAAGATAACTATTTTGGAAAAAGGAGCGTTTACGAGACTCATTAAACCAGGATTAAGAGTGATTTCCATAAACAGTAACATTGCATATAGAAGCAATTGGTGGCTGGTGTATGATCCATTGGAAGCAAAGAGACATTTAGAATGGTTAGTATCCGAACTATACAAAGCAGAAGTCGCCGGAGAGAAGGTTCACATACTGTCTCACATTCCGCCAGGAGTACATGACCTCATTTACACTTGGACAAGGGAATATAATAGAATTATTAATAGATTTAAAAAAACCATAACTGCTGAATTCAATGGTCATTTGCATTCTGATGAATTCAAAATATTCTATAACGGCTCGGATCCTGTCGCGATGGCTTGGGGCGTAGGAAGTTCCACATCTTACTCGGATTACAACGTAAATTACAAAATCGCCACCATAGATAACAATACATTTGAACCATTGAACATAGTCAACTACATATACAATCTTACGGAAGCAAATCTTACACCGAATAGACGTCCTCATTGGTTCCAACTATATGACGTCAGAGGCACCTTTGGAATACCAGATCTATCGCCGGCATCATTAGACAACCTAGTATACCGAATGGTCACCAATCAAACTCAATATTTGGATCTATATGCCGCCTTCTACTCAAAACTCAGCGACACAAGATGGCCGAATTGCAATGACAATTGCAAAATTGACAACCTCTGTAAAACCGTTGTCACCGTGCTATGGGAGCGTTCGAAATGCGAAGAGTTACGTTTGTTATATTTTTCATCGAAATTATAA

Protein sequence:

>DPOGS209007-PA
MLNDVFHIYRPTYLMKTQETHTRTTLTCLLCRSIFSAFIDMVEENQSEQNLINTITTLCSTLGVLSPKSCSGLIDLNLPIIIYIIKNTPGVASRTFCGLLFQTANNPNSCVYNDPRFEWSVDLPEPSEFMETESRQSNSKPLKIALISDAHIDPFYEPNGVADCDEPTCCRKEQTPRRLTFNYDLLETHVDKSLSNTGDTYMLNLDAATGIKSVNIVSRNNNTSPPAGYWGDYRNCDTPLWAYDDVIERIASTHKDIDVVYYIGDNIDHHVWETTFEMINGMNQYVIDKMRKEFGDNVLIVPCIGNHESQPTNQFAPSSIKGDKLNTTWLYEALVKKWDYYLTEEAKITILEKGAFTRLIKPGLRVISINSNIAYRSNWWLVYDPLEAKRHLEWLVSELYKAEVAGEKVHILSHIPPGVHDLIYTWTREYNRIINRFKKTITAEFNGHLHSDEFKIFYNGSDPVAMAWGVGSSTSYSDYNVNYKIATIDNNTFEPLNIVNYIYNLTEANLTPNRRPHWFQLYDVRGTFGIPDLSPASLDNLVYRMVTNQTQYLDLYAAFYSKLSDTRWPNCNDNCKIDNLCKTVVTVLWERSKCEELRLLYFSSKL-