Monarch geneset OGS2.0

DPOGS210179
TranscriptDPOGS210179-TA1599 bp
ProteinDPOGS210179-PA532 aa
Genomic positionDPSCF300393 + 91325-92923
RNAseq coverage526x (Rank: top 24%)
Annotation
HeliconiusHMEL0127540.097.56% 
BombyxBGIBMGA014193-TA0.094.36% 
DrosophilaArt4-PA0.063.87% 
EBI UniRef50UniRef50_Q7Q2B70.064.84%Histone-arginine methyltransferase CARMER n=37 Tax=Coelomata RepID=CARM1_ANOGA
NCBI RefSeqXP_394933.30.066.16%PREDICTED: similar to Arginine methyltransferase 4 CG5358-PA [Apis mellifera]
NCBI nr blastpgi|3071918680.066.16%Probable histone-arginine methyltransferase CARMER [Harpegnathos saltator]
NCBI nr blastxgi|3071918680.066.16%Probable histone-arginine methyltransferase CARMER [Harpegnathos saltator]
Group
Gene OntologyGO:00081681.1e-16methyltransferase activity
GO:00057371.1e-16cytoplasm
KEGG pathway 
InterPro domain[137-408] IPR0078571.1e-16Skb1 methyltransferase
Orthology groupMCL14242 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210179-TA
ATGGCGAACACGTTTCGCGCGGTGACGGTTTCCGCCGTCAACAATGACGGTGCTTTAACTCCGCGATTTAATTTCCCCACGAACGTAAACGTGGATTACGACCCGCAAGGTCTTAGTGTTAAAGTGATACGTGCTGATCCCGTCTTGGCTCAAGAGGTGCTTGAATTCCCCGTCCACAGTCAGAGCGAGTGTTCGCGAGTCGCCGGCCAATCTTACATTTTTACAATAGATAATGAAACACTGTTTTTCAAATTCTCCTCTGATGTTGACTGTCAGAACTTCCATTTGCTGGTTAGTAAGATTAAAGCTGGTCGATCATCATCTGTGTTTACGGTGCGCACGGAGGATTCCTCGGCGATGCAGTATTTCCAGTTTTATGGATACCTAAGTCAGCAACAGAACATGATGCAGGACTATGTGAGAACAAGTACATACCAGCGTGCCATTTTATCGAACATCAACGATTTCAAAAACAAAGTAGTGTTGGACGTCGGCGCCGGTTCCGGTATTTTGTCATTTTTCGCGGCACAGGCTGGTGCTCGCAAAGTGTATGCCGTAGAAGCCAGCAACATGGCGCATCATGCACAGGCTCTTGTCAGAGCGAACGGGCTTCACGATCGTATATCTGTTGTTGCTGGTAAAATAGAAGAAATCGAGTTACCAGAGAGCGTCGATGTAATCATATCCGAGCCAATGGGTTACATGCTTTATAATGAGCGTATGTTGGAGACTTATTTACACGCAAAGAAGTGGCTAAAACCCAACGGTAACATGTATCCTACCAGAGGAGACTTACACATAGCGCCATTCACAGATGACGCTCTATTCATGGAACAATATAACAAAGCCAACTTCTGGTATCAGGCATGTTTCCATGGAGTTGATCTGAGTGCATTACGCACTGCTGCTATGAAGGAGTATTTCAGACAGCCCATTGTAGATACTTTTGATGTACGCATTTGTATGTCGAGATCTGTGAGGCATGTAGTCGACTTTTTGAATGCCAATGAAACCGATTTACATCGTATAGAGGTGCCATTTAGATTTGAGTTGACACAGTCTGGAACGTGCCATGGACTGGCTTTCTGGTTTGATGTTCTATTTGCTGGTAGTACACAGCACATCTGGTTGTCTACTTCACCGACAGAACCACTAACACATTGGTACCAAGTCAGATGCTTACTCGAGACACCGATTTTTGCCAAACAAGGACAGGCATTGACAGGTCGAGTGCTTCTTTTAGCAAATAAACGTCAAAGCTATGATGTTACAATGGAGATTAATCTAGAAGGAACAAATATATCATCTTCCAACACATTAGACTTAAAGAACCCATACTTCAGATATACCGGTGCACCTGCGGCACCACCACCAGGAGTAAATACCACATCTCCAAGCGAATCTTACTGGAACTCCATCGAATCTGCCGGCTCATTGAGCAATGGGCAAGGTGTTATCCTTACAGATGGTACACAACAGTATTGCTCAACTCCAGATCAAACAATTGCTTATGGGGCTCATCCGAACATGATCAAAACCGTCATGTTGCAAGAGGAGTTCATCAAGAGGATCGGGATAAGCCAAAATGGCGATATTTAA

Protein sequence:

>DPOGS210179-PA
MANTFRAVTVSAVNNDGALTPRFNFPTNVNVDYDPQGLSVKVIRADPVLAQEVLEFPVHSQSECSRVAGQSYIFTIDNETLFFKFSSDVDCQNFHLLVSKIKAGRSSSVFTVRTEDSSAMQYFQFYGYLSQQQNMMQDYVRTSTYQRAILSNINDFKNKVVLDVGAGSGILSFFAAQAGARKVYAVEASNMAHHAQALVRANGLHDRISVVAGKIEEIELPESVDVIISEPMGYMLYNERMLETYLHAKKWLKPNGNMYPTRGDLHIAPFTDDALFMEQYNKANFWYQACFHGVDLSALRTAAMKEYFRQPIVDTFDVRICMSRSVRHVVDFLNANETDLHRIEVPFRFELTQSGTCHGLAFWFDVLFAGSTQHIWLSTSPTEPLTHWYQVRCLLETPIFAKQGQALTGRVLLLANKRQSYDVTMEINLEGTNISSSNTLDLKNPYFRYTGAPAAPPPGVNTTSPSESYWNSIESAGSLSNGQGVILTDGTQQYCSTPDQTIAYGAHPNMIKTVMLQEEFIKRIGISQNGDI-