Monarch geneset OGS2.0

DPOGS209337
TranscriptDPOGS209337-TA1209 bp
ProteinDPOGS209337-PA402 aa
Genomic positionDPSCF300194 + 188891-191057
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0030986e-14562.37% 
BombyxBGIBMGA004251-TA2e-6240.11% 
DrosophilaOdc1-PA4e-6437.03% 
EBI UniRef50UniRef50_B0WNL11e-6841.60%Ornithine decarboxylase n=5 Tax=Culicidae RepID=B0WNL1_CULQU
NCBI RefSeqXP_968571.21e-7539.95%PREDICTED: similar to ornithine decarboxylase [Tribolium castaneum]
NCBI nr blastpgi|2700108712e-7439.95%hypothetical protein TcasGA2_TC015912 [Tribolium castaneum]
NCBI nr blastxgi|1892390384e-7339.95%PREDICTED: similar to ornithine decarboxylase [Tribolium castaneum]
Group
Gene OntologyGO:00038242.7e-57catalytic activity
GO:00065966.7e-23polyamine biosynthetic process
KEGG pathwaytca:6569854e-75 
 K01581 (E4.1.1.17, ODC1, speC, speF)maps-> Glutathione metabolism
    Arginine and proline metabolism
InterPro domain[8-234] IPR0226442.7e-57Orn/DAP/Arg decarboxylase 2, N-terminal
[221-356] IPR0090063.3e-34Alanine racemase/group IV decarboxylase, C-terminal
[23-41] IPR0001832.8e-23Ornithine/DAP/Arg decarboxylase
[21-48] IPR0024336.7e-23Ornithine decarboxylase
[239-349] IPR0226433.2e-18Orn/DAP/Arg decarboxylase 2, C-terminal
Orthology groupMCL23341 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209337-TA
ATGGACAAGGCTTACCAGTTCATACAGCATTTCAGAAAAATGATGCCAAGGATCAAAATGTTTTATGGTGCCGTGAAATCGAACGATAGTTGCATGATGTTAAAGTTAGCCGCTGCACTCGGTGTTGGCTTCGATTGTGCTTCACCGGGTGAAATATATAGAATATTAAAACTCAAAGTATCGCCACAGAGCATAATTTTAGCAGTTCCGACAAAAACACCGGAGTGGATCTCATATGCAAGACAGTCCGGGATTAAACACGCTACTTTCGACAACATTTGCGAACTAAAAAAAATAAAACAGTATTGGCCAGAGGCAAACTTATTACTGCGTATAAGAGTTCACGCCGACAGTGTTTACGATTTAGGAAAAAAATTCGGTTGCGATTTTGAAACAGAAGCTATTGATTTACTAGAAGAAGCTGCAGCGCTCAATATCCGGGTGGTTGGGGTAGCTTTCCATGTAGGAAGTGGTTGTACATCACCAGACAGCTATGTGATGGGACTTCAACAGGCTAAGCTATTGTTCGAGCATGAGGCTAAGGCGGGGCGGAAAATGGAAATTGTTGATATTGGAGGAGGATATATGAGCGATAAAATCGATAGAATAGACGAGGTGTCTAAGCTCATAAATAAGGCTTTGGATGAACTCTTTCCTGATCCAGATATCCAAGTGATCTCTGAACCAGGACGTTACCTGTGCGATAAAGCATTTACTTTATATTGCAATATTAACACAGTGCGACAGGTACAAGTTGGTGACTCTTCTATAAATATGTTGTATTTGAATGACGGATTGTTTGGTTGTTTGCGGTACAATGAACCGTGGCACACCGTCAGGCGGTATAAGCAATGTAAGGAAGGCGAACAATGTGAACCAGTTATTTTATGGGGTCCATCATGTGACTCAGTGGATCGTGTGATGGAGAACGTTGAAGTTATGTTACCACCTTGTACTGTTGATGATTGGCTCGTTTTCCCCAACCAAGGAGCTTATACCATGACTCTCGCCTCCGATTTTTCTTCGTTACCAGAACCGCGTATCCGAAGTGTTATTTCACAGAAATTGTGTGAAAAAATAAAGGAGTCAGAAGTGTTTGATTCAGATGACTTCTTCAAACAAGACATTTCAGAACCACTTCCATCTAGCTTGCCACCACTTGTCACTCAATCGAAAGTTATGGATTCAAATTATACTTTGAAGGCTTAA

Protein sequence:

>DPOGS209337-PA
MDKAYQFIQHFRKMMPRIKMFYGAVKSNDSCMMLKLAAALGVGFDCASPGEIYRILKLKVSPQSIILAVPTKTPEWISYARQSGIKHATFDNICELKKIKQYWPEANLLLRIRVHADSVYDLGKKFGCDFETEAIDLLEEAAALNIRVVGVAFHVGSGCTSPDSYVMGLQQAKLLFEHEAKAGRKMEIVDIGGGYMSDKIDRIDEVSKLINKALDELFPDPDIQVISEPGRYLCDKAFTLYCNINTVRQVQVGDSSINMLYLNDGLFGCLRYNEPWHTVRRYKQCKEGEQCEPVILWGPSCDSVDRVMENVEVMLPPCTVDDWLVFPNQGAYTMTLASDFSSLPEPRIRSVISQKLCEKIKESEVFDSDDFFKQDISEPLPSSLPPLVTQSKVMDSNYTLKA-