Monarch geneset OGS2.0

DPOGS209335
TranscriptDPOGS209335-TA1323 bp
ProteinDPOGS209335-PA440 aa
Genomic positionDPSCF300194 + 175284-178233
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0030984e-14059.89% 
BombyxBGIBMGA004251-TA9e-6837.84% 
DrosophilaOdc1-PA4e-6234.58% 
EBI UniRef50UniRef50_F4WYF59e-7237.19%Ornithine decarboxylase n=9 Tax=Neoptera RepID=F4WYF5_ACREC
NCBI RefSeqXP_968571.25e-8039.45%PREDICTED: similar to ornithine decarboxylase [Tribolium castaneum]
NCBI nr blastpgi|2700108719e-7939.45%hypothetical protein TcasGA2_TC015912 [Tribolium castaneum]
NCBI nr blastxgi|1892390382e-7739.65%PREDICTED: similar to ornithine decarboxylase [Tribolium castaneum]
Group
Gene OntologyGO:00038248.6e-61catalytic activity
GO:00065964.5e-30polyamine biosynthetic process
KEGG pathwaytca:6569851e-79 
 K01581 (E4.1.1.17, ODC1, speC, speF)maps-> Glutathione metabolism
    Arginine and proline metabolism
InterPro domain[45-272] IPR0226448.6e-61Orn/DAP/Arg decarboxylase 2, N-terminal
[258-398] IPR0090064.5e-30Alanine racemase/group IV decarboxylase, C-terminal
[33-57] IPR0024334.5e-30Ornithine decarboxylase
[61-79] IPR0001838.4e-26Ornithine/DAP/Arg decarboxylase
[278-387] IPR0226431.1e-14Orn/DAP/Arg decarboxylase 2, C-terminal
Orthology groupMCL23341 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209335-TA
ATGGTTATTGAAAAGAGGGTTTCGAAGGCATTAGTATTGGAAAACAATACTGCCGAGGATGTCTCCCGAGCTATGATAGAAGGTGGACTTCAGAAGGATCCTTTTTATATATTTGATATGGACAAGGCTTATCAACGCGTACAATATTTTAAAAATATGATGCCAAGGATAAAAATATTTTATGCGGTCAAATCAAATGATAGTGACTTTATGTTAAAGCTAGCAGCCTCACTTGGCCTTGGCTTCGATTGTGCTTCACCGGGTGAAATACATAAAGTATTGAAACTCAAAGTATCGCCACTGAGTATCGTTTTTGCAATGCCAACAAAAAGTCCAGAGTGGATGTTATATGCAAGACAATGCGGGATTAAACACACCACTTTTGACAATTTGTGCGAACTAAATAAAATAAAACAATTCTGGCCCGATGCAAAATTGTTACTGCGGATAAGAGTTCATAGCGACAGTGTTTACGATTTAGGGAAGAAGTTCGGTTGCGATTTTGAAACAGAAGCTATTGATTTACTAGAAGAAGCTGCTGCACTCAATATTACAGTGGTTGGGGTAGCATTCCACGTAGGAAGTGGTTGTTCATCACCGGACAGCTATGTCTTAGGACTTCAACAAGTTAAACAACTGTTCGAGCATGAGGATAAGGCGGGACGAAAAATGAAAATTGTTGATATCGGAGGCGGATTCTTGAGTGATAAAACTGATAGGATTGACAAGGTATCCAAACTGGTAAACAATGCTATAGAAGAATTATTCCCGGACCCAGAGGTCCAAGTGATCGCTGAGCCCGGACGGTATCTGTGCGATAACTCATTTACTTTATATTGCAATATTAACACAGTGCGACCGGTACAAGTTGGTGACTCTTCTATTAATATGCTGTATTTAAATGACGGAGTGTTTGGTTGTTTGAGGTACAATGAACCATGGCACACCGTCAAACGGTTTCGGGTACACTCTGAGGGCGAACAATTAAAACCTACTGTATTATGGGGTCCAACCTGTCATCCAATTGATCGCGTACTTGACAACCTTGTTATAATGTTGCCAGCTTGTACGATTTACGATTGGCTCGTGTTTCCGAGTAGAGGGGCTTATAGCATGACTATGGCGTCTAGGTTTTCCACTTTACCAGAACCGCATATACGAAATGTTATATCACAAGAATTGTTTAACATCCTAAAGGATTCTAAAGTTTTGGGTCTTGATGACTTCCTGGAACAGAACATCGCCACGCCACTTCCACCCACTTTGCCGTCAATAATTGTTCACTCCAAAATATTGCAAACGAATTACACTTTGGCGGTTTGA

Protein sequence:

>DPOGS209335-PA
MVIEKRVSKALVLENNTAEDVSRAMIEGGLQKDPFYIFDMDKAYQRVQYFKNMMPRIKIFYAVKSNDSDFMLKLAASLGLGFDCASPGEIHKVLKLKVSPLSIVFAMPTKSPEWMLYARQCGIKHTTFDNLCELNKIKQFWPDAKLLLRIRVHSDSVYDLGKKFGCDFETEAIDLLEEAAALNITVVGVAFHVGSGCSSPDSYVLGLQQVKQLFEHEDKAGRKMKIVDIGGGFLSDKTDRIDKVSKLVNNAIEELFPDPEVQVIAEPGRYLCDNSFTLYCNINTVRPVQVGDSSINMLYLNDGVFGCLRYNEPWHTVKRFRVHSEGEQLKPTVLWGPTCHPIDRVLDNLVIMLPACTIYDWLVFPSRGAYSMTMASRFSTLPEPHIRNVISQELFNILKDSKVLGLDDFLEQNIATPLPPTLPSIIVHSKILQTNYTLAV-