Monarch geneset OGS2.0

DPOGS209507
TranscriptDPOGS209507-TA1311 bp
ProteinDPOGS209507-PA436 aa
Genomic positionDPSCF300127 + 9350-26990
RNAseq coverage1882x (Rank: top 7%)
Annotation
HeliconiusHMEL0160132e-12880.14% 
BombyxBGIBMGA007423-TA2e-11975.00% 
DrosophilaSamDC-PA3e-7149.09% 
EBI UniRef50UniRef50_B4L5R41e-7049.11%S-adenosylmethionine decarboxylase proenzyme n=3 Tax=Endopterygota RepID=B4L5R4_DROMO
NCBI RefSeqXP_001658186.13e-8551.60%s-adenosylmethionine decarboxylase [Aedes aegypti]
NCBI nr blastpgi|1582984828e-8354.87%AGAP009619-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700115468e-8153.90%hypothetical protein TcasGA2_TC005583 [Tribolium castaneum]
Group
Gene OntologyGO:00040141.1e-123adenosylmethionine decarboxylase activity
GO:00065971.3e-120spermine biosynthetic process
GO:00082951.3e-120spermidine biosynthetic process
KEGG pathwayaag:AaeL_AAEL0011761e-84 
 K01611 (E4.1.1.50, speD)maps-> Arginine and proline metabolism
    Cysteine and methionine metabolism
InterPro domain[4-437] IPR0181671.1e-123S-adenosylmethionine decarboxylase subgroup
[16-396] IPR0019851.3e-120S-adenosylmethionine decarboxylase
[162-432] IPR0160671.8e-83S-adenosylmethionine decarboxylase, core
Orthology groupMCL12188 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209507-TA
ATGGCTGGTACGGAAATAATATCTAATAGTGAGAGTAATGATCAATTTTTTGAGGGTGTTGAAAAATTGATAGAAATTTGGTTTACGCCGGTGAAACACGCGGATCTAAGGAAAATCAGTCGTCAACAATGGGAGAATGTCCTGAAGATAGTCCGCTGTGAGATCATATCTTTCACGCAGAGCGAGCAAGTCGACGCCTATGTCTTAAGCGAGAGCAGCATGTTCGTGTCTCGCAGACGCTGGATATTGAAGACCTGCGGTAAGACTACCCCCCTGCGATGTGTGCGATCTGTTCTCCAGCTTGCGATGGAAACGGCTGGTTATTCGAGGGTGGAGAACGTGTTCTACTCCAGGCGGGAGTTCGCCCGACCAGCCGAACAGCTCAAGCCGCATGACAACTTCGATTCAGAGATCGTGATATCGATCGGAGCCTGCGCGCTGAGATGGAATATGCTTACAGTGTACACAACAGCTCTTGATATCGAGAGCAGCATGTTCGTGTCTCGCAGACGCTGGATATTGAAGACCTGCGGTAAGACTACCCCCCTGCGATGTGTGCGATCTGTTCTCCAGCTTGCGATGGAAACGGCTGGTTATTCGAGGGTGGAGAACGTGTTCTACTCCAGGCGGGAGTTCGCCCGACCAGCCGAACAGCTCAAGCCGCATGACAACTTCGATTCAGAGGTGGAGCTACTGGATTCATTCTTCGGGGACGGTCGCGCGTACATCATGGGCCCCGAGGGAGATTGCTGGTATCTGTACACACTGCTGCCGCTAGAAGGTACAGTGGCAGCCCTGGAGAAGGAGCAGCATCACCAGTCGTCGGAGCCGGACCAGACCATAGAGATCCTCATGTCGGACCTGGACCCCGCGGTCATGGACATCTTCACTAGAGCCACTTCCGCCACCGCCGCTGATGCAACCAGGGCTTCTGGTATAGACAAGTTGATCCCCGGTATGGTGATTGATGACTACCTGTTTGATCCGTGTGGCTACTCCATGAACGGGGTCGCTAAAGATGGCTGCTACATGACGATCCACATAACTCCTGAGCCGTCTTGCTCGTACGTGTCCTTCGAGTCTAACGTGTGTCTGCCGCCGGACGCCCTGCTGGCTCGAGTCCTGGCGGCCTTCAGACCAAACAAGTTCGTGGTCACCGTGTTCGCTACGCCGGACTCCCCGGCGGCGGCGGCCACTCGCCAGCTGAAGCAGTTCCCGTCAGTGGGCGGGTTCCAGCAGAAGGAGGCCCAGCACTGTCGGTTCTCAGGCTACGAGCTGCAATACGCACTGTTCTCTAAGTTCCCGAGCTGA

Protein sequence:

>DPOGS209507-PA
MAGTEIISNSESNDQFFEGVEKLIEIWFTPVKHADLRKISRQQWENVLKIVRCEIISFTQSEQVDAYVLSESSMFVSRRRWILKTCGKTTPLRCVRSVLQLAMETAGYSRVENVFYSRREFARPAEQLKPHDNFDSEIVISIGACALRWNMLTVYTTALDIESSMFVSRRRWILKTCGKTTPLRCVRSVLQLAMETAGYSRVENVFYSRREFARPAEQLKPHDNFDSEVELLDSFFGDGRAYIMGPEGDCWYLYTLLPLEGTVAALEKEQHHQSSEPDQTIEILMSDLDPAVMDIFTRATSATAADATRASGIDKLIPGMVIDDYLFDPCGYSMNGVAKDGCYMTIHITPEPSCSYVSFESNVCLPPDALLARVLAAFRPNKFVVTVFATPDSPAAAATRQLKQFPSVGGFQQKEAQHCRFSGYELQYALFSKFPS-