Monarch geneset OGS2.0

DPOGS204514
TranscriptDPOGS204514-TA1392 bp
ProteinDPOGS204514-PA463 aa
Genomic positionDPSCF300205 - 76404-81339
RNAseq coverage6608x (Rank: top 2%)
Annotation
HeliconiusHMEL0089760.092.22% 
BombyxBGIBMGA012515-TA4e-16488.54% 
DrosophilaSam-S-PC1e-13776.45% 
EBI UniRef50UniRef50_B4P1V00.076.39%S-adenosylmethionine synthase n=11 Tax=Coelomata RepID=B4P1V0_DROYA
NCBI RefSeqXP_002087374.10.076.39%GE16582 [Drosophila yakuba]
NCBI nr blastpgi|2700008310.081.72%hypothetical protein TcasGA2_TC011082 [Tribolium castaneum]
NCBI nr blastxgi|2700008310.081.72%hypothetical protein TcasGA2_TC011082 [Tribolium castaneum]
Group
Gene OntologyGO:00055240ATP binding
GO:00044780methionine adenosyltransferase activity
GO:00065560S-adenosylmethionine biosynthetic process
KEGG pathwaydya:Dyak_GE165820.0 
 K00789 (E2.5.1.6, metK)maps-> Selenoamino acid metabolism
    Cysteine and methionine metabolism
InterPro domain[8-461] IPR0021330S-adenosylmethionine synthetase
[319-456] IPR0226303.1e-76S-adenosylmethionine synthetase, C-terminal
[319-462] IPR0226361.5e-75S-adenosylmethionine synthetase superfamily
[195-317] IPR0226293.9e-46S-adenosylmethionine synthetase, central domain
[26-123] IPR0226289.7e-42S-adenosylmethionine synthetase, N-terminal
Orthology groupMCL11395 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204514-TA
ATGCCGGAAACATCAAAAATGAATGGATATGCAAAAACAAATGGTCACAGCTATGATATGGAAGATGGATCAGTGTTTTTATTTACATCAGAATCTGTTGGCGAGGGCCATCCAGACAAAATGTGTGATCAAATAAGTGATGCTATCCTTGACGCGCATTTGAGACAAGATCCCGACGCAAAAGTGGCCTGCGAAACTGTCACTAAGACGGGAATGATCTTGCTGTGCGGTGAAATAACATCCAAAGCAAACGTTGACTATCAAAATGTTGTCAGAGAAACTGTAAAACACATTGGATATGACGATTCTTCGAAAGGTTTTGATTGGCGTACACTAAATTTGTTAGTGGCTATTGAAGAGCAAAGTCCAAATATTCGTGAGGGGGTCTATCAGGACCGAGAGGAAATTGATATTGGGGCCGGCGATCAGGGCATGTTAGGTCATTCCGGCGAGTCTACTTTCCTGATAATAATCATAATGCATGTTGGGGCAGGATTTGATTACAAGACATGCAGTGTCATGCTTGCATTGGATCAGCAGTCACCAAATATTGCGGCTGGGGTGCATGAAAACAGAAACGATGAGGAAGTCGGTGCCGGAGACCAGGGTTTGATGTTTGGCTATGCGACTGATGAGACAGAAGAATGCATGCCGCTCACTGTGGTACTTGCGCATAGATTGAATCAGAAGATTGCTGAATTACGTCGCAACGGAGAATTCTGGTGGGCGAGACCTGATTCAAAGACACAGGTTACTTGCGAGTATGTGTTTGCTGGAGGTGCCACTGTTCCACAAAGGGTCCACACTGTGGTTGTATCACTACAACACTCTGAGAAGGTAACCTTGGAGACGTTGCGCGAAGAAATTAAGAGCAAAGTCATCAATGAAGTGATTCCAGCTCAGTACCTCGATGAGAGAACTGTCATACACATAAACCCTTGTGGACTTTTCATAATTGGTGGACCTCAGTCGGATGCCGGCCTGACGGGTCGTAAGATCATAGTGGACACGTACGGCGGTTGGGGCGCGCACGGCGGAGGAGCGTTCTCAGGGAAGGATTTCACTAAAGTGGACCGCTCAGCCGCCTACGCGGCTAGATGGGTCGCCAAATCCCTAGTACGAGCTGGGCTCTGCCGTCGCTGTATGGTGCAGGTGGCCTATGCTATTGGAGTCGCCGAGCCGCTATCTATAACCGTATTCGACTACGGCACCTCGCATAAGACACAGCAGGAACTACTTGCGATCGTACGAAAGAATTTCGACCTCCGACCAGGGAAAATAGTCAAGGAACTGAACCTCCGAGCTCCTATATATCAGAAGACCAGCACGTACGGCCACTTCGGGCGAGAGGGTTTCCCGTGGGAGAACCCCAAGCCGCTCGTCGTAGATTGA

Protein sequence:

>DPOGS204514-PA
MPETSKMNGYAKTNGHSYDMEDGSVFLFTSESVGEGHPDKMCDQISDAILDAHLRQDPDAKVACETVTKTGMILLCGEITSKANVDYQNVVRETVKHIGYDDSSKGFDWRTLNLLVAIEEQSPNIREGVYQDREEIDIGAGDQGMLGHSGESTFLIIIIMHVGAGFDYKTCSVMLALDQQSPNIAAGVHENRNDEEVGAGDQGLMFGYATDETEECMPLTVVLAHRLNQKIAELRRNGEFWWARPDSKTQVTCEYVFAGGATVPQRVHTVVVSLQHSEKVTLETLREEIKSKVINEVIPAQYLDERTVIHINPCGLFIIGGPQSDAGLTGRKIIVDTYGGWGAHGGGAFSGKDFTKVDRSAAYAARWVAKSLVRAGLCRRCMVQVAYAIGVAEPLSITVFDYGTSHKTQQELLAIVRKNFDLRPGKIVKELNLRAPIYQKTSTYGHFGREGFPWENPKPLVVD-