Monarch geneset OGS2.0

DPOGS209831
TranscriptDPOGS209831-TA798 bp
ProteinDPOGS209831-PA265 aa
Genomic positionDPSCF300117 + 663620-667284
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0043445e-13081.51% 
BombyxBGIBMGA008053-TA1e-8170.79% 
DrosophilaSply-PB5e-9660.08% 
EBI UniRef50UniRef50_F4WLC43e-9858.59%Sphingosine-1-phosphate lyase n=11 Tax=Neoptera RepID=F4WLC4_ACREC
NCBI RefSeqXP_967792.12e-10262.55%PREDICTED: similar to sphingosine phosphate lyase isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910767824e-10162.55%PREDICTED: similar to sphingosine phosphate lyase isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910767821e-9862.55%PREDICTED: similar to sphingosine phosphate lyase isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00197526.7e-131carboxylic acid metabolic process
GO:00168316.7e-131carboxy-lyase activity
GO:00301706.7e-131pyridoxal phosphate binding
GO:00038248.4e-20catalytic activity
KEGG pathwaytca:6627536e-102 
 K01634 (E4.1.2.27)maps-> Sphingolipid metabolism
InterPro domain[1-266] IPR0021296.7e-131Pyridoxal phosphate-dependent decarboxylase
[1-220] IPR0154241.4e-59Pyridoxal phosphate-dependent transferase, major domain
[1-109] IPR0154218.4e-20Pyridoxal phosphate-dependent transferase, major region, subdomain 1
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209831-TA
ATGGACGATATAAAGTCGCTGTCAGACATAGCGCTGGAGTATGACGTGCCGTTACACGTAGACGCCTGTCTGGGTGGGTTCATAGCGGCCTTCATGACGGAGGCTGGGTACAATGTACCCGTGTTCGACTTCAGACTGCCCGGTGTTGCCAGTATATCAGCTGACACCCATAAGTATGGATACGCGCCCAAGGGTACGTCGGTGATAGTGTACCGTAAGGAAGAATACAGACATCACCAGTATACAGTGAGCACCGAGTGGCCGGGAGGGGTGTACGGCTCGCCCACCGTTAATGGCAGTCGTGCGGGCGGCCTGATAGCGGCGTGCTGGGCCACCATGATGTACGTCGGCCGCGAACAGTACGTGCGGATGGCGGGCGAGGTCGTCCACACGGCCAGGAGGATAGAGGACGAGATCCGTAAGATTAATGGATTGTTCATATTCGGCCAGCCCGCGACCACCGTGGTGGCGTTCGGGTCCAACAGCTTTGACATCTTCAAACTAGCGGACCTCCTGCATCAGAAGGGTTGGAGCTTGAACGCCTTGCAATTTCCTTCCGGAATCCATATAGCGGTGACTCATGCCCACACTAGAGCTGGTGTCGCTGACAGGTTCTTGGCTGACCTGAAGGAAACTACAGCTGTTTGTATGAAGGAAGGAAGTGCTCCCGTAGAGGGGAAGATGGCTATTTACGGTGTGGCACAGAGTATACCAGATAGAAGTCTAGTGTCAGACATCACTAAATACTTCATCGATTCCATGTACTATTTACCCAAGCCCGGTGATGATGTGGAATGA

Protein sequence:

>DPOGS209831-PA
MDDIKSLSDIALEYDVPLHVDACLGGFIAAFMTEAGYNVPVFDFRLPGVASISADTHKYGYAPKGTSVIVYRKEEYRHHQYTVSTEWPGGVYGSPTVNGSRAGGLIAACWATMMYVGREQYVRMAGEVVHTARRIEDEIRKINGLFIFGQPATTVVAFGSNSFDIFKLADLLHQKGWSLNALQFPSGIHIAVTHAHTRAGVADRFLADLKETTAVCMKEGSAPVEGKMAIYGVAQSIPDRSLVSDITKYFIDSMYYLPKPGDDVE-