Monarch geneset OGS2.0

DPOGS207714
TranscriptDPOGS207714-TA762 bp
ProteinDPOGS207714-PA253 aa
Genomic positionDPSCF300042 - 1353985-1359879
RNAseq coverage1073x (Rank: top 12%)
Annotation
HeliconiusHMEL0153212e-12884.58% 
BombyxBGIBMGA009814-TA1e-7982.74% 
Drosophilaaay-PA8e-6853.68% 
EBI UniRef50UniRef50_F4WY572e-7861.43%Phosphoserine phosphatase n=14 Tax=Eumetazoa RepID=F4WY57_ACREC
NCBI RefSeqXP_001812661.17e-8262.21%PREDICTED: similar to pxPhosphoserine phosphatase [Tribolium castaneum]
NCBI nr blastpgi|1179701797e-11482.46%pxPhosphoserine phosphatase [Plutella xylostella]
NCBI nr blastxgi|1179701791e-10882.46%pxPhosphoserine phosphatase [Plutella xylostella]
Group
Gene OntologyGO:00046474.1e-58phosphoserine phosphatase activity
GO:00065644.1e-58L-serine biosynthetic process
GO:00081526e-36metabolic process
GO:00167916e-36phosphatase activity
KEGG pathwaytgu:1002281345e-82 
 K01079 (serB, PSPH)maps-> Glycine, serine and threonine metabolism
InterPro domain[29-245] IPR0044694.1e-58Phosphoserine phosphatase SerB
[36-248] IPR0232141.2e-48HAD-like domain
[42-212] IPR0063836e-36HAD-superfamily hydrolase, subfamily IB, PSPase-like
[53-105] IPR0231901e-14Phosphoserine phosphatase, domain 2
Orthology groupMCL13901 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207714-TA
ATGGCTTTATACAGCTTGAAAAACAATCTCCATTTGACGACATTGAAGACGCTTTCACTCGTTTCAATCAGCGTGATGTCTCAGAAGCAAACAGTCCAAGAAATCTTTCGTACAGCTGACTGCGTGTGCTTCGATGTGGACTCCACCGTTATAAGGGATGAAGGTATCGACGAACTCGCCAACTTCTGCGGAAAAGGGGATGAGGTTAAAAGACTAACCGCGGAAGCTATGGGCGGCAATATGACGTTTCAAGAGGCCCTTAAGAAGAGACTTGATATTATAAGACCCAGTGCTAGTCAAATTAAAGAATTCATTGAAACACATCCTATACATCTCACGCCTGGAATATCTGAATTAGTGAAGACGTTACACGAGAGAGGTGTAGCTGTGTATTTAGTGTCAGGAGGCTTCAGATGTCTTATTGAACCGGTTGCCGAGATCTTGGAAATTCCCAAAGCGAACGTATACGCTAATAGACTCAAATTCTATTTCAATGGTGATTACGCTGGCTTCGATGACACGGAGCCTACTTCTCGTTCCGGTGGTAAGGGTCTGGTGATAAGACGTCTGAAAGAACACCACCACTACCAGAGACTGGTGATGATTGGTGATGGGGCCACTGACGCCGAGGCCAGTCCCCCAGCTGATGCGTTCATTGGTTTTGGTGGAAACGTAGTGAGGGAGGAGGTTAAGAAGAAGGCGTCGTGGTACGTCACAGACTTCCAGGAACTGATTACGTCACTCGCGATCCAGACTAAGTGA

Protein sequence:

>DPOGS207714-PA
MALYSLKNNLHLTTLKTLSLVSISVMSQKQTVQEIFRTADCVCFDVDSTVIRDEGIDELANFCGKGDEVKRLTAEAMGGNMTFQEALKKRLDIIRPSASQIKEFIETHPIHLTPGISELVKTLHERGVAVYLVSGGFRCLIEPVAEILEIPKANVYANRLKFYFNGDYAGFDDTEPTSRSGGKGLVIRRLKEHHHYQRLVMIGDGATDAEASPPADAFIGFGGNVVREEVKKKASWYVTDFQELITSLAIQTK-