Monarch geneset OGS2.0

DPOGS209830
TranscriptDPOGS209830-TA1368 bp
ProteinDPOGS209830-PA455 aa
Genomic positionDPSCF300117 + 655375-662746
RNAseq coverage127x (Rank: top 57%)
Annotation
HeliconiusHMEL0043440.074.53% 
BombyxBGIBMGA008053-TA6e-14365.45% 
DrosophilaSply-PB9e-13552.05% 
EBI UniRef50UniRef50_F4WLC47e-13251.08%Sphingosine-1-phosphate lyase n=11 Tax=Neoptera RepID=F4WLC4_ACREC
NCBI RefSeqXP_967792.17e-14252.70%PREDICTED: similar to sphingosine phosphate lyase isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910767821e-14052.70%PREDICTED: similar to sphingosine phosphate lyase isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910767825e-13755.42%PREDICTED: similar to sphingosine phosphate lyase isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00197521.6e-169carboxylic acid metabolic process
GO:00168311.6e-169carboxy-lyase activity
GO:00301701.6e-169pyridoxal phosphate binding
GO:00038242.3e-27catalytic activity
KEGG pathwaytca:6627532e-141 
 K01634 (E4.1.2.27)maps-> Sphingolipid metabolism
InterPro domain[48-456] IPR0021291.6e-169Pyridoxal phosphate-dependent decarboxylase
[45-410] IPR0154244.6e-83Pyridoxal phosphate-dependent transferase, major domain
[154-300] IPR0154212.3e-27Pyridoxal phosphate-dependent transferase, major region, subdomain 1
Orthology groupMCL13786 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209830-TA
ATGAGTGAGAGACCACAACCATTGAAAGCTATCAACCGTTTCTTTGAGGGCAAGGAACCATGGCAGATAGTTACCATGACTGCATCATCAGTACTTGCAATAGTGTGGGCCCATAGTCTATACAATGCTAAAGATCGATCTTACGACTGGAAGGGAGGTAACGTCTCCGGGGCTGTTTATCACTTGGATGAAGACATATCTAAAGTCGCCTGTGAAGCGTATTCGTCTACAGCCTACACAAACCCCTTACACGCTGATGTGTTTCCCGGAATCAACAAAATGGAGGCCGAGATTGTTAGGATGACGGTCAATCTCTTCCACGGGGATGAGAACTGTTGTGGAACGCCAACGTATGCTCAGAGCGCTTGTACTGAGATACGGAATCGTTGGTGCATTTGCACTGAGCCACGTGACCTTGCTGTGCCACTATATATACATAAACGCCTGTACTTGTATACCATGCCGGTCTCGTCAGAAACTTATACCGTTGACATAGAGGCTGTCAAACGAGCTATAGGCAGAAGGACATGTATGATAGTGGGTTCAGCTCCGAACTACCCGTACGGCACCATGGACGATATAAAGTCGCTGTCAGACATAGCGCTGGAGTATGACGTGCCGTTACACGTAGACGCCTGTCTCGGTGGGTTCATAGCGGCCTTCATGACGGAGGCTGGGTACAATGTACCCGTGTTCGACTTCAGACTGCCCGGTGTTGCCAGTATATCAGCTGACACCCATAAGTATGGATACGCGCCCAAGGGTACGTCGGTGATAGTGTACCGTAAGGAAGAATACAGACATCACCAGTATACAGTGAGCACCGAGTGGCCGGGAGGGGTGTACGGCTCGCCCACCGTTAATGGCAGTCGTGCGGGCGGCCTGATAGCGGCGTGCTGGGCCACCATGATGTACGTCGGCCGCGAACAGTACGTGCGGATGGCGGGCGAGGTCGTCCACACGGCCAGGAGGATAGAGGACGAGATCCGTAAGATTAATGGATTGTTCATATTCGGCCAGCCCGCGACCACCGTGGTGGCGTTCGGGTCCAACAGCTTTGACATCTTCAAACTAGCGGACCTCCTGCATCAGAAGGGTTGGAGCTTGAACGCCTTGCAATTTCCTTCCGGAATCCATATAGCGGTGACTCATGCCCACACTAGAGCTGGTGTCGCTGACAGGTTCTTGGCTGACCTGAAGGAAACTACAGCTGTTTGTATGAAGGAAGGAAGTGCTCCCGTAGAGGGGAAGATGGCTATTTACGGTGTGGCACAGAGTATACCAGATAGAAGTCTAGTGTCAGACATCACTAAATACTTCATCGATTCCATGTACTATTTACCCAAGCCCGGTGATGATGTGGAATGA

Protein sequence:

>DPOGS209830-PA
MSERPQPLKAINRFFEGKEPWQIVTMTASSVLAIVWAHSLYNAKDRSYDWKGGNVSGAVYHLDEDISKVACEAYSSTAYTNPLHADVFPGINKMEAEIVRMTVNLFHGDENCCGTPTYAQSACTEIRNRWCICTEPRDLAVPLYIHKRLYLYTMPVSSETYTVDIEAVKRAIGRRTCMIVGSAPNYPYGTMDDIKSLSDIALEYDVPLHVDACLGGFIAAFMTEAGYNVPVFDFRLPGVASISADTHKYGYAPKGTSVIVYRKEEYRHHQYTVSTEWPGGVYGSPTVNGSRAGGLIAACWATMMYVGREQYVRMAGEVVHTARRIEDEIRKINGLFIFGQPATTVVAFGSNSFDIFKLADLLHQKGWSLNALQFPSGIHIAVTHAHTRAGVADRFLADLKETTAVCMKEGSAPVEGKMAIYGVAQSIPDRSLVSDITKYFIDSMYYLPKPGDDVE-