DPGLEAN08224 in OGS1.0

New model in OGS2.0DPOGS208120 
Genomic Positionscaffold61:- 69258-72294
See gene structure
CDS Length2001
Paired RNAseq reads  1143
Single RNAseq reads  2917
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006772 (2e-11)
Best Drosophila hit  CG8545 (3e-172)
Best Human hitputative ribosomal RNA methyltransferase NOP2 (1e-156)
Best NR hit (blastp)  GF12515 [Drosophila ananassae] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG8545 CG8545-PA [Tribolium castaneum] (2e-177)
GeneOntology terms


  
GO:0005730 nucleolus
GO:0006364 rRNA processing
GO:0003723 RNA binding
GO:0008757 S-adenosylmethionine-dependent methyltransferase activity
InterPro families



  
IPR001678 Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
IPR023267 RNA (C5-cytosine) methyltransferase
IPR023273 RNA (C5-cytosine) methyltransferase, NOP2
IPR018314 Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p, conserved site
IPR011023 Nop2p
Orthology groupMCL13688

Nucleotide sequence:

ATGGGTCGTAAAGCTAAATTTGATGAATCAGTAAAGATTAAAAAAGGGCCGGGTAGGAAA
GCGAGGAAGCAACCGGATCCCGTATTCAAAAAAGGACTAATTGATGATGATAAAGAAGAA
AAGAAGTTGAGCCATAGACAAAAACAGAGAGCTGCTAGGAGGCTTAAAAAAAAGAAAGAA
CTTGTGGAAAAGAAGAAAGCTTTGAAAGAAGCAAAGAAGAATGTTGCAAAGGTAAATGAG
AAAGTAGTTGAGGACAAGTCCGAAGATGAATCTGAAGATTCAGAACAAGAAAATGTGGAA
GGTTTTACAGATGACAACAAAGAGTGGCTTAAACCTAAACAGAAGAGTAAATCCAAAGAA
TACAATTCTGAAGATGATGAGAATGACTCTGGAAGTGAAGTAGAAGATGAAGATGTTGAG
GAAGCACCAAAAAAATCTGGGAAAGAAAAATATAAGGTCGGCAAACTTGATGATTTATTT
GTGGACACAGATGAGGAACAGGATAATGATCCTGATGAAGAGGATGATGCTGATAATGAG
AATAAAATGGACTATAACTCAGACTCTAGTGATGAAGAAAAAGATGATGATGATGATGAT
GATGATGACGATGATGACATGTTACCAATAGAAAAAGCTAACATCAAACTTAAAAAGAAA
CAAAAGATCGACAAGAAGCTTGCTGACGATGAATTGCAACTGAACATTTCTAAACAAGAT
GTATTTGCATTCCCATCTGAAGAAGAACTGCAAAATCCAACGAGTTTACAGGACATCCAT
CAACGGATTAAAGATGTTGTAACGGTTTTAAGTGATTTTAACCGTTTGAAGGATCAAGAA
AGATCGAGGTGTGAATACACTGAGCTATTGATGAAGGACTTATGTATGTATTACAGTTAT
AATGAATTTCTCATGGAAGTTCTCATGCAAATATTTCCAGTACAGGAATTAGTGGAATTT
CTTGAAGCAAGTGAAGTAGCTCGCCCATTGACTATTAGGACTAACAGCTTGAAAACAAGA
AGAAGGGATTTAGCTCAAGCCCTTATTAACAGAGGAGTTAATTTAGATCCGGTTGGAAAG
TGGAGCAAAGTTGGTCTAGTAGTTTATAGTTCTACAGTACCAATTGGTGCTACTCCGGAA
TATTTGGCTGGCCATTACATTTTACAAGGGGCATCTAGTTTCTTGCCAGTAATGGCTTTA
GCGCCACAAGAGAATGAGAGAATATTAGACATGTGTGCGGCCCCTGGTGGTAAAGCATCT
CATATAGCTGCCATCATGAAAAATACAGGTGCCTTATTTGCTAATGATGCCAATAAAGAT
AGAACTAAAGCGATTGTCGGTAACTTCCACAGGCTGGGAATTGTTAATGCTGTTATTTGT
AACTATGATGGACGTCAATTCCCAGAGGTTATTAAGGGCTTCGATAGAGTATTACTTGAT
GCCCCCTGTACAGGAACGGGGGTTATAGCTAAAGATCCTAGCGTGAAGACTTCGAAGGAA
CAAAAGGATATTCAGAGATGTTTCAATCTACAAAGACAGCTTTTACTGGCCGCTATAGAT
TGTTGTAATGCTAAATCCAGTACAGGCGGTTACATTGTTTATTCGACATGTTCTATATTA
CCAGAAGAAAATGAATGGGTTGTAAATTATGCATTGAAAAGAAGAAATGTCAAGTTAGTG
CCGACCGGTCTTGACTTTGGTACAGAGGGATTCGTTAAATACAGACATCATAGATTCCAT
CCATCATTAAAACTAACAAGAAGATTCTATCCCCATACACATAATATGGATGGTTTTTTT
GTGGCCAAATTTAAGAAGTTCTCTAATGTTATACCTGAGCCATTTAAGGATGAAGAAGAA
GACAATGAAGAAATAAAGGAAGATGCAGAACAGAATGGTGATGCGACAATGCAGAAGAAA
TCCAAAAAAAAAAATACACCAGTTAAGAGGCCAGCGGAATCTGTTGCTGTTGAACCACAG
AATAAAAAAAATAAAAACTAA

Protein sequence:

MGRKAKFDESVKIKKGPGRKARKQPDPVFKKGLIDDDKEEKKLSHRQKQRAARRLKKKKE
LVEKKKALKEAKKNVAKVNEKVVEDKSEDESEDSEQENVEGFTDDNKEWLKPKQKSKSKE
YNSEDDENDSGSEVEDEDVEEAPKKSGKEKYKVGKLDDLFVDTDEEQDNDPDEEDDADNE
NKMDYNSDSSDEEKDDDDDDDDDDDDMLPIEKANIKLKKKQKIDKKLADDELQLNISKQD
VFAFPSEEELQNPTSLQDIHQRIKDVVTVLSDFNRLKDQERSRCEYTELLMKDLCMYYSY
NEFLMEVLMQIFPVQELVEFLEASEVARPLTIRTNSLKTRRRDLAQALINRGVNLDPVGK
WSKVGLVVYSSTVPIGATPEYLAGHYILQGASSFLPVMALAPQENERILDMCAAPGGKAS
HIAAIMKNTGALFANDANKDRTKAIVGNFHRLGIVNAVICNYDGRQFPEVIKGFDRVLLD
APCTGTGVIAKDPSVKTSKEQKDIQRCFNLQRQLLLAAIDCCNAKSSTGGYIVYSTCSIL
PEENEWVVNYALKRRNVKLVPTGLDFGTEGFVKYRHHRFHPSLKLTRRFYPHTHNMDGFF
VAKFKKFSNVIPEPFKDEEEDNEEIKEDAEQNGDATMQKKSKKKNTPVKRPAESVAVEPQ
NKKNKN