Monarch geneset OGS2.0

DPOGS208120
TranscriptDPOGS208120-TA2001 bp
ProteinDPOGS208120-PA666 aa
Genomic positionDPSCF300154 - 86679-89715
RNAseq coverage418x (Rank: top 29%)
Annotation
HeliconiusHMEL0122720.076.66% 
BombyxBGIBMGA006772-TA0.074.49% 
DrosophilaCG8545-PA0.071.97% 
EBI UniRef50UniRef50_A1Z9090.071.97%CG8545 n=24 Tax=Eukaryota RepID=A1Z909_DROME
NCBI RefSeqXP_001958672.10.057.39%GF12515 [Drosophila ananassae]
NCBI nr blastpgi|3838512520.061.08%PREDICTED: putative ribosomal RNA methyltransferase nop2-like [Megachile rotundata]
NCBI nr blastxgi|220241260.059.53%CG8545 [Drosophila melanogaster]
Group
Gene OntologyGO:00087574.6e-100S-adenosylmethionine-dependent methyltransferase activity
GO:00037234.6e-100RNA binding
GO:00063644.6e-100rRNA processing
GO:00081682.3e-19methyltransferase activity
KEGG pathway 
InterPro domain[332-606] IPR0110234.6e-100Nop2p
[320-605] IPR0016782.4e-99Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
[380-394] IPR0232673.3e-28RNA (C5-cytosine) methyltransferase
[283-303] IPR0232732.3e-19RNA (C5-cytosine) methyltransferase, NOP2
Orthology groupMCL13547 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208120-TA
ATGGGTCGTAAAGCTAAATTTGATGAATCAGTAAAGATTAAAAAAGGGCCGGGTAGGAAAGCGAGGAAGCAACCGGATCCCGTATTCAAAAAAGGACTAATTGATGATGATAAAGAAGAAAAGAAGTTGAGCCATAGACAAAAACAGAGAGCTGCTAGGAGGCTTAAAAAAAAGAAAGAACTTGTGGAAAAGAAGAAAGCTTTGAAAGAAGCAAAGAAGAATGTTGCAAAGGTAAATGAGAAAGTAGTTGAGGACAAGTCCGAAGATGAATCTGAAGATTCAGAACAAGAAAATGTGGAAGGTTTTACAGATGACAACAAAGAGTGGCTTAAACCTAAACAGAAGAGTAAATCCAAAGAATACAATTCTGAAGATGATGAGAATGACTCTGGAAGTGAAGTAGAAGATGAAGATGTTGAGGAAGCACCAAAAAAATCTGGGAAAGAAAAATATAAGGTCGGCAAACTTGATGATTTATTTGTGGACACAGATGAGGAACAGGATAATGATCCTGATGAAGAGGATGATGCTGATAATGAGAATAAAATGGACTATAACTCAGACTCTAGTGATGAAGAAAAAGATGATGATGATGATGATGATGATGACGATGATGACATGTTACCAATAGAAAAAGCTAACATCAAACTTAAAAAGAAACAAAAGATCGACAAGAAGCTTGCTGACGATGAATTGCAACTGAACATTTCTAAACAAGATGTATTTGCATTCCCATCTGAAGAAGAACTGCAAAATCCAACGAGTTTACAGGACATCCATCAACGGATTAAAGATGTTGTAACGGTTTTAAGTGATTTTAACCGTTTGAAGGATCAAGAAAGATCGAGGTGTGAATACACTGAGCTATTGATGAAGGACTTATGTATGTATTACAGTTATAATGAATTTCTCATGGAAGTTCTCATGCAAATATTTCCAGTACAGGAATTAGTGGAATTTCTTGAAGCAAGTGAAGTAGCTCGCCCATTGACTATTAGGACTAACAGCTTGAAAACAAGAAGAAGGGATTTAGCTCAAGCCCTTATTAACAGAGGAGTTAATTTAGATCCGGTTGGAAAGTGGAGCAAAGTTGGTCTAGTAGTTTATAGTTCTACAGTACCAATTGGTGCTACTCCGGAATATTTGGCTGGCCATTACATTTTACAAGGGGCATCTAGTTTCTTGCCAGTAATGGCTTTAGCGCCACAAGAGAATGAGAGAATATTAGACATGTGTGCGGCCCCTGGTGGTAAAGCATCTCATATAGCTGCCATCATGAAAAATACAGGTGCCTTATTTGCTAATGATGCCAATAAAGATAGAACTAAAGCGATTGTCGGTAACTTCCACAGGCTGGGAATTGTTAATGCTGTTATTTGTAACTATGATGGACGTCAATTCCCAGAGGTTATTAAGGGCTTCGATAGAGTATTACTTGATGCCCCCTGTACAGGAACGGGGGTTATAGCTAAAGATCCTAGCGTGAAGACTTCGAAGGAACAAAAGGATATTCAGAGATGTTTCAATCTACAAAGACAGCTTTTACTGGCCGCTATAGATTGTTGTAATGCTAAATCCAGTACAGGCGGTTACATTGTTTATTCGACATGTTCTATATTACCAGAAGAAAATGAATGGGTTGTAAATTATGCATTGAAAAGAAGAAATGTCAAGTTAGTGCCGACCGGTCTTGACTTTGGTACAGAGGGATTCGTTAAATACAGACATCATAGATTCCATCCATCATTAAAACTAACAAGAAGATTCTATCCCCATACACATAATATGGATGGTTTTTTTGTGGCCAAATTTAAGAAGTTCTCTAATGTTATACCTGAGCCATTTAAGGATGAAGAAGAAGACAATGAAGAAATAAAGGAAGATGCAGAACAGAATGGTGATGCGACAATGCAGAAGAAATCCAAAAAAAAAAATACACCAGTTAAGAGGCCAGCGGAATCTGTTGCTGTTGAACCACAGAATAAAAAAAATAAAAACTAA

Protein sequence:

>DPOGS208120-PA
MGRKAKFDESVKIKKGPGRKARKQPDPVFKKGLIDDDKEEKKLSHRQKQRAARRLKKKKELVEKKKALKEAKKNVAKVNEKVVEDKSEDESEDSEQENVEGFTDDNKEWLKPKQKSKSKEYNSEDDENDSGSEVEDEDVEEAPKKSGKEKYKVGKLDDLFVDTDEEQDNDPDEEDDADNENKMDYNSDSSDEEKDDDDDDDDDDDDMLPIEKANIKLKKKQKIDKKLADDELQLNISKQDVFAFPSEEELQNPTSLQDIHQRIKDVVTVLSDFNRLKDQERSRCEYTELLMKDLCMYYSYNEFLMEVLMQIFPVQELVEFLEASEVARPLTIRTNSLKTRRRDLAQALINRGVNLDPVGKWSKVGLVVYSSTVPIGATPEYLAGHYILQGASSFLPVMALAPQENERILDMCAAPGGKASHIAAIMKNTGALFANDANKDRTKAIVGNFHRLGIVNAVICNYDGRQFPEVIKGFDRVLLDAPCTGTGVIAKDPSVKTSKEQKDIQRCFNLQRQLLLAAIDCCNAKSSTGGYIVYSTCSILPEENEWVVNYALKRRNVKLVPTGLDFGTEGFVKYRHHRFHPSLKLTRRFYPHTHNMDGFFVAKFKKFSNVIPEPFKDEEEDNEEIKEDAEQNGDATMQKKSKKKNTPVKRPAESVAVEPQNKKNKN-