Monarch geneset OGS2.0

DPOGS201008
TranscriptDPOGS201008-TA1143 bp
ProteinDPOGS201008-PA380 aa
Genomic positionDPSCF300147 + 127669-131022
RNAseq coverage324x (Rank: top 35%)
Annotation
HeliconiusHMEL0137210.095.01% 
BombyxBGIBMGA009097-TA0.088.37% 
DrosophilaCG7818-PA2e-15069.63% 
EBI UniRef50UniRef50_Q9VLP73e-14869.63%Methyltransferase-like protein 14 homolog n=58 Tax=Eumetazoa RepID=MTL14_DROME
NCBI RefSeqXP_974982.16e-16072.26%PREDICTED: similar to N6-adenosine-methyltransferase IME4 [Tribolium castaneum]
NCBI nr blastpgi|910897191e-15872.26%PREDICTED: similar to N6-adenosine-methyltransferase IME4 [Tribolium castaneum]
NCBI nr blastxgi|910897196e-16272.77%PREDICTED: similar to N6-adenosine-methyltransferase IME4 [Tribolium castaneum]
Group
Gene OntologyGO:00081681e-52methyltransferase activity
GO:00061391e-52nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
KEGG pathway 
InterPro domain[168-320] IPR0077571e-52MT-A70
Orthology groupMCL13857 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201008-TA
ATGAGTGAGAAGTTGAAAGAAATTCGTGAAAGATCACAAAAACGGAAGAAACTTCTAATACAGACGTTAGGTGTTTCAAGTGTGGGCGAGCTGCGTAATGCTCTTGGCGCCACCCTCGACGTAACACCTAAGAAACAAGCCTCCTCAGAGTACACCTCTGATGGCAATGAGAAACCTGTTCACAAGGAACCAGACAGCCTTGTGTACACTGATTCATCCACATTCTTGAAGGGAACACAGTCTTCAAATCCTCACAATGACTACTGCCAACATTTTGTGGACACGGGTCAAAGACCACAGAACTATATTCGGGATGTAGGACTCGCTGATCGGTTTGAAGAGTATCCTAAACTCAGAGAACTGATCAAATTGAAGGATGAACTCATCGCCCGGACAGCAACACCACCTATGTACTTAAAATGTGATTTGAAGACATTTGATCTTAAGACGATGGGTAGCAAGTTTGACGTGGTGCTAGTGGAACCTCCTCTGGGAGCCGGCTGGCGCTGGAGGGATGTCCTCGCCCTGGAGCTGCATCACCTGGCTCAGCCCCGGTCCTTTGTGTTCCTGTGGTGCGGCAGCTCGGAGGGTTTAGATATGGGAAGGGAATGTCTGAAGAAATGGGGATTTCGTCGTTGTGAGGACATATGCTGGATCAAGACTAATATCAAAAACCCAGGACACTCCAAGAACCTGGAGCACAATGCGGTGTTCCAGAGGACCAAGGAACACTGTCTCATGGGAATCAAAGGGACGGTGAGGAGGTCGGTGGACGGGGACTTCATACACGCCAACGTCGATATAGACCTCATCATATCTGAAGATCCCGAGTTCGGCAGCACGGAGAAGCCCATCGAAATATTCCACATCATGGAACACTTCTGTTTAGGACGAAGAAGAGTCCATCTGTTCGGTCGAGATTCCACGATCCGTCCGGGCTGGGTGACCATCGGCCACGAGCTCACCAACTCCAACTTCAACGCGGAGCTGTACGCGTCCTACTTCACGGAAGGTCGCGACACGACTGGCTGTACTGAACGGATAGAAGCCCTCCGACCGAAGAGCCCTCCCAACACCAGCAAGACACGACCCAGGGGCCGGGGGGGCTTCAGGGGTCGCGGGCGGGGCCGGGGGGCCCTCTAG

Protein sequence:

>DPOGS201008-PA
MSEKLKEIRERSQKRKKLLIQTLGVSSVGELRNALGATLDVTPKKQASSEYTSDGNEKPVHKEPDSLVYTDSSTFLKGTQSSNPHNDYCQHFVDTGQRPQNYIRDVGLADRFEEYPKLRELIKLKDELIARTATPPMYLKCDLKTFDLKTMGSKFDVVLVEPPLGAGWRWRDVLALELHHLAQPRSFVFLWCGSSEGLDMGRECLKKWGFRRCEDICWIKTNIKNPGHSKNLEHNAVFQRTKEHCLMGIKGTVRRSVDGDFIHANVDIDLIISEDPEFGSTEKPIEIFHIMEHFCLGRRRVHLFGRDSTIRPGWVTIGHELTNSNFNAELYASYFTEGRDTTGCTERIEALRPKSPPNTSKTRPRGRGGFRGRGRGRGAL-