Monarch geneset OGS2.0

DPOGS209451
TranscriptDPOGS209451-TA1245 bp
ProteinDPOGS209451-PA414 aa
Genomic positionDPSCF300275 - 95881-99173
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0133784e-14864.66% 
BombyxBGIBMGA005868-TA3e-17271.16% 
Drosophila% 
EBI UniRef50UniRef50_Q16S245e-13656.47%Sphingosine-1-phosphate phosphohydrolase n=1 Tax=Aedes aegypti RepID=Q16S24_AEDAE
NCBI RefSeqXP_001661054.11e-13656.47%sphingosine-1-phosphate phosphohydrolase [Aedes aegypti]
NCBI nr blastpgi|1571271432e-13556.47%sphingosine-1-phosphate phosphohydrolase [Aedes aegypti]
NCBI nr blastxgi|1571271432e-13857.96%sphingosine-1-phosphate phosphohydrolase [Aedes aegypti]
Group
Gene OntologyGO:00160201.8e-17membrane
GO:00038241.8e-17catalytic activity
KEGG pathwaynvi:1001222852e-126 
 K04716 (SGPP1)maps-> Sphingolipid metabolism
InterPro domain[125-232] IPR0003261.8e-17Phosphatidic acid phosphatase type 2/haloperoxidase
[83-232] IPR0161184.8e-15Phosphatidic acid phosphatase/chloroperoxidase, N-terminal
Orthology groupMCL15807 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209451-TA
ATGTGGCATAAAATAATAGAACATCTAAACGATCCGTTGCTTGTGGTTAAAGTTCAGAATTTCTTCGGTGTTATATACAAAAGAAGTTGCCAAAATGAGGCGACGAGTATAATACATTCCGATAGACACGACCGTGAAGAGGAGTGTGATGTCAGACAGCACAAGAGAATCCCCAGTGACATATCTGGCAGTTCCCAGTGTTCTTCCACAACCGACTCATCTGAGGGCCCAGAGGAGTTGGAGTGTCACATCAACAATAAGTTCTGGTACTATCTGTTCGTGATTGGAACATCTCTGGGAGATGAAATTTTCTATGCAACTTTTATACCATTTTGGTTCTGGAATGTCGATGGTGCTGTGGGAAGGAGAGTGGTGCTCGTGTGGACCGTGGTAATGTACATAGGTCAGGGTATCAAGGACGTGGTTCGCTGGCCGCGGCCCGGTCACCCCGTCAGGAAGCTGCAGCAGAAGTGGGCGATCGAGTACGGCATGCCATCCACACACGCCATGGTGGGCGTCTCCATACCGTTCTCTGTTCTGTTGTTCACCATGAACAGGTACCAGTACCCCGTGCACTGGGGCCTGATCCTGGCGGTCTGTTGGTGCACCCTCATATGTGTCAGCAGAGTATATCTTGGGATGCATAGCGTTCTGGATATAGCGGCAGGTCTGTTGCTGGCGAGTCTCCTGCTGCTGCCTCTCGTGCCGCTGGTAGACGCGCTGGACGCCTGGCTGCTGGAGTCTCCCTGGTCGCCGTTGTCGGTGCTCGCCGTCTCGGTGCTGGCTGTGGTCTTCCATCCACAGTCCGACAAGTGGACGCCCACTAGAGGCGATACGACGATGATAGTGAGCGTGTGCGCGGGCCTGCTGGTGGGCGCGTGGGTAAACTTCCAGTGCGGGCACATGTCCCCCAGCCCCACCCCGGCGCCCTACACCATCATCTGGCCATCCGTGGACATGCTGGGCTGCGCGCTGCTCCGGACCACGCTCGGCTTGTGCGGCGTGCTCGCCACTCGCGCCGTCGCCAAGAGCTTCTCGTACGCCTTGATCTGCGCGATCCTCGGCAAGGACAAGAACGAGTTGAGGAACTCCGCCGACAGTCTCGACAACAAAAACAAGATCTTCGTGGAGTGCTGCTACAAGTACTTCACGTACGGCCTCATAGGGTTCAACACCACGTACGTGTTCCCGAACGTGTTCGAGCTGCTGTTCATCAACAGACCCACCTACTACACCGAGATCTGA

Protein sequence:

>DPOGS209451-PA
MWHKIIEHLNDPLLVVKVQNFFGVIYKRSCQNEATSIIHSDRHDREEECDVRQHKRIPSDISGSSQCSSTTDSSEGPEELECHINNKFWYYLFVIGTSLGDEIFYATFIPFWFWNVDGAVGRRVVLVWTVVMYIGQGIKDVVRWPRPGHPVRKLQQKWAIEYGMPSTHAMVGVSIPFSVLLFTMNRYQYPVHWGLILAVCWCTLICVSRVYLGMHSVLDIAAGLLLASLLLLPLVPLVDALDAWLLESPWSPLSVLAVSVLAVVFHPQSDKWTPTRGDTTMIVSVCAGLLVGAWVNFQCGHMSPSPTPAPYTIIWPSVDMLGCALLRTTLGLCGVLATRAVAKSFSYALICAILGKDKNELRNSADSLDNKNKIFVECCYKYFTYGLIGFNTTYVFPNVFELLFINRPTYYTEI-