Monarch geneset OGS2.0

DPOGS208187
TranscriptDPOGS208187-TA1452 bp
ProteinDPOGS208187-PA483 aa
Genomic positionDPSCF300207 + 215790-222172
RNAseq coverage600x (Rank: top 21%)
Annotation
HeliconiusHMEL0157140.074.90% 
BombyxBGIBMGA010122-TA0.073.74% 
DrosophilaCG5618-PA2e-12846.69% 
EBI UniRef50UniRef50_G8FGR50.071.13%Putative sulfinoalanine decarboxylase n=1 Tax=Bicyclus anynana RepID=G8FGR5_9NEOP
NCBI RefSeqXP_002428040.12e-14054.99%Cysteine sulfinic acid decarboxylase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3532822210.071.13%putative sulfinoalanine decarboxylase [Bicyclus anynana]
NCBI nr blastxgi|3532822210.071.43%putative sulfinoalanine decarboxylase [Bicyclus anynana]
Group
Gene OntologyGO:00197521.3e-192carboxylic acid metabolic process
GO:00168311.3e-192carboxy-lyase activity
GO:00301701.3e-192pyridoxal phosphate binding
GO:00038242e-97catalytic activity
KEGG pathwaydpo:Dpse_GA190094e-125 
 K01594 (E4.1.1.29, CSAD)maps-> Taurine and hypotaurine metabolism
InterPro domain[9-476] IPR0021291.3e-192Pyridoxal phosphate-dependent decarboxylase
[4-476] IPR0154241.4e-100Pyridoxal phosphate-dependent transferase, major domain
[81-351] IPR0154212e-97Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[352-476] IPR0154221.6e-11Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL15137 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208187-TA
ATGGATACTAAGGGTCATTCATTTCTGGACAGAGTTTTTAATATTTTGAAGGATGAAACCGCTGACGAAGTTCCTCTCATTCAGTTTAAGCACCCAAAGGAGTTAGAAGAAATACTCAGCAATGACCTCGACATATCTATAAACGTGGATGACAAAGAGTTGGAAAACTCCGTTAGAAAAATTCTGCAGTACAGTGTTAAAACAAACAAAGTGACCTTTAGGAACCAGTTGTATGGTCCCATGGATCCCTATGGACTGGCCGGGACGTGGATAGCCGAGGCTTTCAACACCAGCCAGTACACATTTGAAGTCGCCCCAGTATTTACATTGATAGAACTCAAAATGATAGAGCACATATTAAAATTGTTTGGTATTCCTGGAGGGGATGGTATCTTCAGTCCAGGAGGGAGCGTGTCCATGTTGTATGCCTTAGTAGCTGCGAGATTTAAAAAATTCCCTGAAGTCAAGAGCAAGGGCATGCAAAAACTGCCAGAGATGACCATATTCACTTCCGAAGATAGTCACTATTCCATTATTAAAGCTGCTCATTGGCTTGGTTTTGGCACAGAAAGTGTTATATCTATTAAAACTAATAGTTCTGGTCAGATGATCGTAAACGAACTAAATAAAGCGATAGAACGTCAGTTGGGTCTAGGAAAATATCCTGTGTTCGTAAACGCTACTGCGGGTACAACGGTTCTTGGAGCCATTGACGACTTGGAAGCAATTGCATCTGTTTGTAAAAAATACGATATTTGGATGCATGTTGACGCCTGTTGGGGTGGAGGCTTGATGCTTTCTGCCACATTAAGAAAAAGATTGCAAGGAATTCAATTTGCAGATTCCATTTCCTGGAATCCACATAAGATGATCGGCGCTCCGTTACAATGTTCGATATTTCTGCTGAAAGAAAAGGGTTTGCTGCACGAAGCCAACGCTGCTGCAGCACAGTACCTGTTCCAGCAAGACAAGTTTTATGACGTTCGATATGACACAGGGGATAAAAGTGTCCAGTGCGGGAGAAAGATAGATTCTTTTAAATTGTGGATGATGTGGAAGGCGCGAGGAGATATCGGGCTCTGTAAGGTGATGGATCATGTAATGAGTATATCAGAGTTCTGTTTGCGATCTATTGCTGAAAGAGAAGGTTTCAGACTAGTTTCGGATACCTTGCAATGTCCTAACATTTGCTTCTGGTACATTCCAGTTTTTATGAGAAAACGGGAGGAAAATGATGAGTGGTGGGCACTCATTCATAAGATTACACCAAAACTGAAAGAGCTGTTGACCTTAAGTTCTCGTTTAATGATAGCTTACACCCCTTTACGTCATCACAAGAATTTTTTCCGTCTAGCGTTCACCTTCCACCCTGTTCTTGAAGAAAATCACGTCCTGGAGATATTAAAATCTATAGAGGAATGTGGGGAAATGGTAAACACGGACATGGTCTAG

Protein sequence:

>DPOGS208187-PA
MDTKGHSFLDRVFNILKDETADEVPLIQFKHPKELEEILSNDLDISINVDDKELENSVRKILQYSVKTNKVTFRNQLYGPMDPYGLAGTWIAEAFNTSQYTFEVAPVFTLIELKMIEHILKLFGIPGGDGIFSPGGSVSMLYALVAARFKKFPEVKSKGMQKLPEMTIFTSEDSHYSIIKAAHWLGFGTESVISIKTNSSGQMIVNELNKAIERQLGLGKYPVFVNATAGTTVLGAIDDLEAIASVCKKYDIWMHVDACWGGGLMLSATLRKRLQGIQFADSISWNPHKMIGAPLQCSIFLLKEKGLLHEANAAAAQYLFQQDKFYDVRYDTGDKSVQCGRKIDSFKLWMMWKARGDIGLCKVMDHVMSISEFCLRSIAEREGFRLVSDTLQCPNICFWYIPVFMRKREENDEWWALIHKITPKLKELLTLSSRLMIAYTPLRHHKNFFRLAFTFHPVLEENHVLEILKSIEECGEMVNTDMV-