Monarch geneset OGS2.0

DPOGS207041
TranscriptDPOGS207041-TA831 bp
ProteinDPOGS207041-PA276 aa
Genomic positionDPSCF300001 + 1795816-1797202
RNAseq coverage200x (Rank: top 47%)
Annotation
HeliconiusHMEL0068578e-15087.68% 
BombyxBGIBMGA012982-TA3e-12681.89% 
DrosophilaCG10903-PA9e-11367.63% 
EBI UniRef50UniRef50_Q9CY214e-10061.37%Uncharacterized methyltransferase WBSCR22 n=190 Tax=root RepID=WBS22_MOUSE
NCBI RefSeqXP_972432.21e-12575.46%PREDICTED: similar to CG10903 CG10903-PA [Tribolium castaneum]
NCBI nr blastpgi|1892386602e-12475.46%PREDICTED: similar to CG10903 CG10903-PA [Tribolium castaneum]
NCBI nr blastxgi|1892386603e-11975.46%PREDICTED: similar to CG10903 CG10903-PA [Tribolium castaneum]
Group
Gene OntologyGO:00081522.2e-11metabolic process
GO:00081682.2e-11methyltransferase activity
KEGG pathwaymdo:1000186762e-105 
 K00599 (E2.1.1.-)maps-> Naphthalene and anthracene degradation
    Tyrosine metabolism
    Histidine metabolism
    Selenoamino acid metabolism
InterPro domain[201-275] IPR0222382.8e-18Uncharacterised protein family, methyltransferase, Williams-Beuren syndrome
[55-128] IPR0132162.2e-11Methyltransferase type 11
Orthology groupMCL15223 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207041-TA
ATGGCTCGAAGACCAGAACATCAAGCACCACCCGAAATTTTTTATAATGAAGATGAAGCGAGAAAGTATACACAAAACACTAGGATCATAGACATTCAGGGGCAGATGACTGAAAGATGCATAGAATTATTGATACTCCCTGAAGATACCCCATGTTTGCTGTTGGATATTGGATGTGGATCAGGCTTATCCGGAACTGTATTAGAAGAAAATGGACATATGTGGATTGGATTAGACATTTCCCCTGCCATGTTAGATGTAGCTTTAGAGAGGGAGACCGAGGGTGACCTCATTCTTTCAGACATGGGTCAAGGAGTGCCATTTAAGGCTGGCAGCTTTGATGGTGCAGTCTCCGTGTCTGCTATACAGTGGCTGTTTAATGCAGACAAAAAATCACACAATCCAGTCAAAAGACTTTATAATTTCTTCAGTTCTTTATATGCCTCACTGTCAAGGTCAGCAAGGGCAGTGTTTCAATTTTACCCTGAAAATGAGAGTCAGTTAGATCTGTTAACATCTCAAGCCATGAAGGCAGGTTTCTATGGAGGAGTTGTTGTTGATTACCCTAACTCGGCTAAGGCGAAGAAATTCTTCTTAGTGTTAATGACGGGAGGTGCAGCACCTTTGCCTCAAGCTCTTGGTACAGATGAAAGTAATAATTCATTACAAGTTAAATATGCTAAAAGGGAAGCCATGAGAGCTGCAAGAGGGAAACCTTTAAAAAATACTAAAGCTTGGTTACTAGAGAAAAAGGAAAGGAGAAGGAAACAGGGCAAGGATACGAAACCTGATACTAAATACACTGGAAGGAAGAGAAGTGGAAGATTCTGA

Protein sequence:

>DPOGS207041-PA
MARRPEHQAPPEIFYNEDEARKYTQNTRIIDIQGQMTERCIELLILPEDTPCLLLDIGCGSGLSGTVLEENGHMWIGLDISPAMLDVALERETEGDLILSDMGQGVPFKAGSFDGAVSVSAIQWLFNADKKSHNPVKRLYNFFSSLYASLSRSARAVFQFYPENESQLDLLTSQAMKAGFYGGVVVDYPNSAKAKKFFLVLMTGGAAPLPQALGTDESNNSLQVKYAKREAMRAARGKPLKNTKAWLLEKKERRRKQGKDTKPDTKYTGRKRSGRF-