Monarch geneset OGS2.0

DPOGS210836
TranscriptDPOGS210836-TA2136 bp
ProteinDPOGS210836-PA711 aa
Genomic positionDPSCF300027 + 40443-49889
RNAseq coverage519x (Rank: top 24%)
Annotation
HeliconiusHMEL0099470.075.17% 
BombyxBGIBMGA003910-TA0.064.20% 
DrosophilaCG1486-PB1e-10934.67% 
EBI UniRef50UniRef50_D6WYJ01e-15443.72%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WYJ0_TRICA
NCBI RefSeqXP_974004.13e-15543.72%PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum]
NCBI nr blastpgi|910893975e-15443.72%PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum]
NCBI nr blastxgi|910893974e-14943.51%PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum]
Group
Gene OntologyGO:00197522.1e-78carboxylic acid metabolic process
GO:00168312.1e-78carboxy-lyase activity
GO:00301702.1e-78pyridoxal phosphate binding
GO:00038241.1e-24catalytic activity
KEGG pathwayypi:YpsIP31758_24484e-10 
 K13745 (ddc)maps-> Glycine, serine and threonine metabolism
InterPro domain[64-661] IPR0021292.1e-78Pyridoxal phosphate-dependent decarboxylase
[127-386] IPR0154211.1e-24Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[24-507] IPR0154241.6e-24Pyridoxal phosphate-dependent transferase, major domain
Orthology groupMCL13438 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210836-TA
ATGGGAGATGCCCCGACTTCCGAAATGGACTCAAATAAAGTTAGCCTTGGAAGCGAAAATCCATCCGATGTGGACCGGCGACCGTTTGGAGGACTTGAATTCCAAGTTTCCGAAGTAGTAGAGAGGTTGGAAGCTGGTGTGAATGCTCAAGATGCCATGGAAGAAGAAAAAAAGCCAGAAGAACGAAAAATAAGCACGGGATTCTTCGAGCCAGAAAAAATGGATATGGATGAAATTTTGAAGGTCTTAGAACAACTAGTACTTCAAACTGATCCAAGTTGTGAAAGTTTGGAACCACCCTTATTGCCAACGGATTCTGTGACGCGAGCCGCAATACTTTCTCATAGTATTTCGGCATTATTCTCGAGGTTGGAGAGGAGCCACGCTGCCCGGCTAGGAACGCACATAGCTACTGAAACCACGCGATGGATGGCGCATTTATTTAGGTTGTCCGATTACGACGCGTTTTATCACCAAGAGCAGCTCGAGGGTCTGGTCAGAGTCACTCGGATGCTGTTACACCACAAGTACCCGAGATATCTCGAAGATGGAGCTCTAGCTTTCTCGAACCGTCTCCCCTCCATCTACAGCTGTGTGGCGAGTCCTCTGGGCGTGGTCCAACACCTGTGCCGGCAGCTGGGTCTGCCGCTGGCCTGCGTCAGACCGGTGCCAGTAGATTCATCTGGTAAGGGTATGGATCTGAATGCTCTGGATCGTCTGTGCGAGGAGGACTCGGCTGGTCGTACTCCGCTGCTGGTGTTAGGCGAGGCGGGCGAGCCTCCCCTCGGCGGGGGATCCCCGCTGAAAGCGCTGGCTGAACTATGTGGACGTAGAGGGGTCCACTTACATGTGAGGGGACACGCCCTCGCCCTCCCCGCCGCCGGGGGATTTGAACAGACGTACAGTATAGCGGACTCGCTGACACTACAACCGGGTCCGTGGTTCGGAATACCGGGGCTGCCGACTGTTACGTTTTACAAAATACCGGAACCGCTGACGGCGAACGATCACTCCAAGGTTGTAAATTCGGCGAGTAGTCGCGAGGGTGCTCTGGCCGCACTGGGCGGTCTGACCGCTGGCGCGGCGCGGCTGGCAGCTCTGCCGCTGTGGACGGCGACGAGGGCGGCCGGCGCTAAGAGGCTCGCAAGACGGATAGACGCCGCCTTCCGCTCCGCCCGTACAGCGCGGGCCTTAATAGCCAGCACTGAGCTGAGATTGCTGAGCGATAGACCCGGCGGTGATGAACCTCCTAACATGGATATAGTCGATGCCATAAGTGAATCCTCAGCGTGCGTGTCCTTCCAATTCGCGCCAGCAGGGTGCGCTGACCGGCCACCCCCCTACTACGATAAACTCAACTCGTGGTTGGGGCAAGTGTTGCAACGAGAGGCTGATATGATCAATATAGAAATCTGCGAGACGGAGAGTTACGGCGTGGTGCTCCGCTACTGTCCGCTCGAGGGTATCTTTCTGGAGGAGGACCGTCTGTCGGAGTGGGCGGCCGTGTTAGACGCTCAGCTGCACGTGCTCACCGCTACGGTCGCGCTACGAGAACCCTTCCAGAAGACGCTACAGACACATCCCTGTCTACGACTTGTACATGTACCGGGATGGGCTGGTCTGGGAGGAGTTCGTTACGTGCCACCCGGTTGGGAGAACGCTCCTCTTGAGGAATTGAACTCCTTGAATAGACAGCTAGTGGAGACATTGAGGGCTACCGACGGAGCCTTCTCGTGTGGGGACGGAGAAGACGGTATGGCATGTGTCAGGTTCGGTATGGTCACCGCTGACACAGACGTGGATGAATTGTTGGATCTGGTGTTGTCAGCGGGCAAGGACGTGGAGGAGAACTCCAAGGCTCTCACTGATATGACCGAGGTGTTGAAAAAAGGTATATCAGCGGCTCAAGAAGAACTGAATCGTTCTGCGTGGCAGGAGGGGCTGCTGCGTCGTGTGCCGGTAGTGGGTCGGGTCGTGTCGTGGTGGGCGCCGCCTCAGCCCTGCCCCGGCCGCCGGCTACTGTTGACCCACGGCACCCTGCAGGCGACTGATGATATCTACCGATTCGTTCAGAAGAAAGACAAAGAGGAACCAGCCCGCGCTCACTCCCCAACGAGACAGAACACGGTTCCATAA

Protein sequence:

>DPOGS210836-PA
MGDAPTSEMDSNKVSLGSENPSDVDRRPFGGLEFQVSEVVERLEAGVNAQDAMEEEKKPEERKISTGFFEPEKMDMDEILKVLEQLVLQTDPSCESLEPPLLPTDSVTRAAILSHSISALFSRLERSHAARLGTHIATETTRWMAHLFRLSDYDAFYHQEQLEGLVRVTRMLLHHKYPRYLEDGALAFSNRLPSIYSCVASPLGVVQHLCRQLGLPLACVRPVPVDSSGKGMDLNALDRLCEEDSAGRTPLLVLGEAGEPPLGGGSPLKALAELCGRRGVHLHVRGHALALPAAGGFEQTYSIADSLTLQPGPWFGIPGLPTVTFYKIPEPLTANDHSKVVNSASSREGALAALGGLTAGAARLAALPLWTATRAAGAKRLARRIDAAFRSARTARALIASTELRLLSDRPGGDEPPNMDIVDAISESSACVSFQFAPAGCADRPPPYYDKLNSWLGQVLQREADMINIEICETESYGVVLRYCPLEGIFLEEDRLSEWAAVLDAQLHVLTATVALREPFQKTLQTHPCLRLVHVPGWAGLGGVRYVPPGWENAPLEELNSLNRQLVETLRATDGAFSCGDGEDGMACVRFGMVTADTDVDELLDLVLSAGKDVEENSKALTDMTEVLKKGISAAQEELNRSAWQEGLLRRVPVVGRVVSWWAPPQPCPGRRLLLTHGTLQATDDIYRFVQKKDKEEPARAHSPTRQNTVP-