Monarch geneset OGS2.0

DPOGS214709
TranscriptDPOGS214709-TA918 bp
ProteinDPOGS214709-PA305 aa
Genomic positionDPSCF300022 - 728650-729567
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0057531e-16688.52% 
BombyxBGIBMGA004772-TA4e-15884.59% 
DrosophilaCG14683-PA1e-9353.59% 
EBI UniRef50UniRef50_UPI0002246F6E1e-9756.68%UPI0002246F6E related cluster n=1 Tax=unknown RepID=UPI0002246F6E
NCBI RefSeqXP_971437.19e-11364.05%PREDICTED: similar to s-adenosyl-methyl transferase mraw [Tribolium castaneum]
NCBI nr blastpgi|910947472e-11164.05%PREDICTED: similar to s-adenosyl-methyl transferase mraw [Tribolium castaneum]
NCBI nr blastxgi|910947472e-10664.05%PREDICTED: similar to s-adenosyl-methyl transferase mraw [Tribolium castaneum]
Group
Gene OntologyGO:00081683.4e-130methyltransferase activity
KEGG pathway 
InterPro domain[1-306] IPR0029033.4e-130S-adenosyl-L-methionine-dependent methyltransferase, MraW
[86-197] IPR0233971.8e-30S-adenosyl-L-methionine-dependent methyltransferase, MraW, recognition domain
Orthology groupMCL14976 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214709-TA
ATGACTTTTGGAGCAGGTGGACATTCAAGAAAAATACTCGAGTCAGCAAATTGTCAACTTATTACACTAGACAGAGATCCAACAGCATATGAGAAAGCAAGGAAACTGGCTGAGGAATATCCAAAACGGGTCATACCCCTAATAGGAAGATTTAGTGAACTGCCAGAGCAGTTAAAATCTATAGGTATTAAACAGTCCTCGATAGATGGAATCTTATTTGATTTTGGATGTTCTTCCATGCAATTAGATAATGGAGAAAGAGGTTTCTCAGTTAGTAAAAATGGCTTCCTAGATATGAGGATGGATGCAGGGAGAGATCCAAGCCAAATAACAGCAAGGGAAGTCCTAGCGACTGCCGATGAACATGAACTATACAAAATATTCAAAATATATGGTGAAGAAAAGAAAGCGGCAAAAATAGCTCAAACAATCATACAAGCGAGATACATGATCAAAAATATAGACACCACAAAAGAATTAGTTGACATTGTTGACTCCTGTTGTCCAGACGAAGTGAGGTTAGATAAATTACAACGACCACAATCCAATGCCACCAAAGTGTTCCAAGCTCTAAGAATTTTCGTAAACAATGAGTTAAATGAGATCAATTATGGCATGGTTTTATCAAAATATTATTTAAAATTAAATGGGAGGTTAGTCACCCTTTGCTTTCACTCTCTTGAAGATACTATTGTTAAACGTCACATAGCTGGCAACATTATCAATGAGACAGCGAACCCTGTACCTCTAAGATATCTTAGCCCCATGTTAGTACAAGATCAGGACACCATAAATCAATTCCTGGACTCTCCATGGAAAGCCATTAATAAACATGTAGAAGTACCGTCTGAAGATGAAGTAGAGAGAAATCCTAGAAGTAGGAGTGCCAGATTAAGGGCTGCCATCAAAATTAAATGA

Protein sequence:

>DPOGS214709-PA
MTFGAGGHSRKILESANCQLITLDRDPTAYEKARKLAEEYPKRVIPLIGRFSELPEQLKSIGIKQSSIDGILFDFGCSSMQLDNGERGFSVSKNGFLDMRMDAGRDPSQITAREVLATADEHELYKIFKIYGEEKKAAKIAQTIIQARYMIKNIDTTKELVDIVDSCCPDEVRLDKLQRPQSNATKVFQALRIFVNNELNEINYGMVLSKYYLKLNGRLVTLCFHSLEDTIVKRHIAGNIINETANPVPLRYLSPMLVQDQDTINQFLDSPWKAINKHVEVPSEDEVERNPRSRSARLRAAIKIK-