Monarch geneset OGS2.0

DPOGS212929
TranscriptDPOGS212929-TA2052 bp
ProteinDPOGS212929-PA683 aa
Genomic positionDPSCF300057 - 834265-839146
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0113110.066.86% 
BombyxBGIBMGA011862-TA0.069.44% 
DrosophilaArt7-PB2e-15741.83% 
EBI UniRef50UniRef50_E2C2971e-16245.45%Protein arginine N-methyltransferase 7 n=12 Tax=Coelomata RepID=E2C297_HARSA
NCBI RefSeqXP_967203.10.048.48%PREDICTED: similar to protein arginine n-methyltransferase [Tribolium castaneum]
NCBI nr blastpgi|2700031680.048.48%hypothetical protein TcasGA2_TC002133 [Tribolium castaneum]
NCBI nr blastxgi|2700031680.048.48%hypothetical protein TcasGA2_TC002133 [Tribolium castaneum]
Group
Gene OntologyGO:00081683.1e-225methyltransferase activity
GO:00064793.1e-225protein methylation
GO:00082763.9e-07protein methyltransferase activity
GO:00057373.9e-07cytoplasm
KEGG pathway 
InterPro domain[1-684] IPR0146443.1e-225Protein arginine N-methyltransferase
[56-143] IPR0104563.9e-07Ribosomal L11 methyltransferase, PrmA
Orthology groupMCL11746 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212929-TA
ATGCAGGTATTTACGCAAAAACGGAACCCAATCACTGGCGGTACGGAATGGGACGTGCAGCACGAAGATTATGATTATCACCAGGAGATAGCACGTTCAGCATTCGCTGATATGTTGCATGATACCGAGAGAAATAAGAAGTATCAAAGGGCCTTACAGTTGGCAATAGAAAAGATGCACAATCTTGGCAAGAAGGCTAATGTATTGGACATAGGCACTGGAACAGGACTACTATCGATCATGGCAGCAAGGGCTGGAGCTGACAGTATAATTGCATGTGAGGCCTTTAAACCAATGGCAGAATGCTGTATTAAAATCCTTGAATGTAATGGAGTGAAACACAAAATAAAAGTTATCCCTAAGAGGTCAACGGAACTCACTGTGGGAGAAGATGGAGATATGAAGGAAAAAGCTAATATATTAGTCACAGAGGTGTTTGATACTGAACTAATCGGAGAAGGTGCAATTTCTACATTTACACATGCACACAAATTTTTACTTGAAGAAGACTGTATTGTGATTCCTGACTCGGCAGTAATATATGCTCAAGTTGTTGAATGTCCTACACTTCAGAAATGGAATAGACTTAATGACTTAGCTGATGAAGATCTACAAATAATTTTACGAACACCTCAAAAAATGAAAGATTGCTCTGGATCGGCCGCAGTTCACGATCTCCAGCTCTCCCAGCTGCCTCGTTTGGCTTTTAAGGAGTTGTCTGATCAAATACCAATCTTTTATTACGATTGGTCTGGCCGTTCACCAATCGACAGAAACAGAACAGTGAAACAACAGTTTGTTGTCACAAACACTGGCATAGCACAAATAGTTTTTATGTGGTGGGAGTTGAATATGGATACCGAAGGTAAAATATGCCTAAGTTGCGCGCCATGGTGGGCGCATCCTGATGTGAATTTATCCTGTGAAAGACCACAAGATTCGATCCCTTGGCGCGACCATTGGATGCAAGCTGTGTATTACTTACCAAAAGACATAACCGTCCAAAAAGGCTCCGAGCTGTCCTTGGTGTCCTGTCAAGACGAATACTCCTTATGGTTTTATGTTGACGACGGCAAAACCACGTACAAAAATTATAAAAGACCTATCTGTGAATGTGGTATACATATGGCGTTGTCAAGAACACATGTGTCCTATCTCAACGATGGTAGGAGAAGTAAAAGGTTTTTGGCACAGTTGAGGGAAGATTTACACAAAGACGCTGTGGTTTTAGACTTAAATGGGAGCAGTTTTATTGGTCTGGCTTCAAGTAGAATGGGAGCTAAAACTGTATATGTACTGGAAAATTCCAATCTGAATATATCAATTTTAAATGACTACATAAAGGAAAATGATTTACAAAATGTACAGATCATATCGGAGGTCACAGATGATATTTTGAATAGTGTAACAAATGTCATAGGTGATCCTAATTTCAGCAGCGCTATATTGCCATGGGAGAATCTGAAGATTGCATATTTATTATACAAATATAGAAGTAAATTAAGGGATAGTGTTGTCATAATGCCAGATTGTTGTGAGTTCTGGGCTATGCCGGTTGAGTTTCAAGATTTACACAAAATAAGAATACCATTAAATAAATGCGAAGGTATCGATATGACGATATTCGATAATCTCGTTGAGAGTTCCCGTATCATAAGTGACGCGGATATCGAGGCGCAGCCTCTATGGGAGTACCCGTGTATATGTCGTGGGGAGGCGCGAAAAATTATTGAACTTAACATGTCAGATCTCAAACCCACGATCGCATCTGATGGGACTTATAAAGTTAGTGATAATGAAGATAATGATGCAAATCCAGTTAACGGTTTTGCCATATGGTCGGTGTGGAGAGTTGGTGGGAAACCTATTAGTAGTGGCCCAATAGATTGGCCCAACGTTGGTCAAAGAGTTGTTTGGGACATGCATACAAGACAGGCTGTTAAAATATTAAAAAAATCCAGCTCAATAAATTCTGCGGACAAGTGGAACTACCAAGTGGCTTGTGATTTAAAGACTGGAGAGTTCAGCCTTAACATCAATTGGGAACAATAA

Protein sequence:

>DPOGS212929-PA
MQVFTQKRNPITGGTEWDVQHEDYDYHQEIARSAFADMLHDTERNKKYQRALQLAIEKMHNLGKKANVLDIGTGTGLLSIMAARAGADSIIACEAFKPMAECCIKILECNGVKHKIKVIPKRSTELTVGEDGDMKEKANILVTEVFDTELIGEGAISTFTHAHKFLLEEDCIVIPDSAVIYAQVVECPTLQKWNRLNDLADEDLQIILRTPQKMKDCSGSAAVHDLQLSQLPRLAFKELSDQIPIFYYDWSGRSPIDRNRTVKQQFVVTNTGIAQIVFMWWELNMDTEGKICLSCAPWWAHPDVNLSCERPQDSIPWRDHWMQAVYYLPKDITVQKGSELSLVSCQDEYSLWFYVDDGKTTYKNYKRPICECGIHMALSRTHVSYLNDGRRSKRFLAQLREDLHKDAVVLDLNGSSFIGLASSRMGAKTVYVLENSNLNISILNDYIKENDLQNVQIISEVTDDILNSVTNVIGDPNFSSAILPWENLKIAYLLYKYRSKLRDSVVIMPDCCEFWAMPVEFQDLHKIRIPLNKCEGIDMTIFDNLVESSRIISDADIEAQPLWEYPCICRGEARKIIELNMSDLKPTIASDGTYKVSDNEDNDANPVNGFAIWSVWRVGGKPISSGPIDWPNVGQRVVWDMHTRQAVKILKKSSSINSADKWNYQVACDLKTGEFSLNINWEQ-