New model in OGS2.0 | DPOGS208120  |
---|---|
Genomic Position | scaffold61:- 69258-72294 |
See gene structure | |
CDS Length | 2001 |
Paired RNAseq reads   | 1143 |
Single RNAseq reads   | 2917 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006772 (2e-11) |
Best Drosophila hit   | CG8545 (3e-172) |
Best Human hit | putative ribosomal RNA methyltransferase NOP2 (1e-156) |
Best NR hit (blastp)   | GF12515 [Drosophila ananassae] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG8545 CG8545-PA [Tribolium castaneum] (2e-177) |
GeneOntology terms    | GO:0005730 nucleolus GO:0006364 rRNA processing GO:0003723 RNA binding GO:0008757 S-adenosylmethionine-dependent methyltransferase activity |
InterPro families    | IPR001678 Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p IPR023267 RNA (C5-cytosine) methyltransferase IPR023273 RNA (C5-cytosine) methyltransferase, NOP2 IPR018314 Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p, conserved site IPR011023 Nop2p |
Orthology group | MCL13688 |
Nucleotide sequence:
ATGGGTCGTAAAGCTAAATTTGATGAATCAGTAAAGATTAAAAAAGGGCCGGGTAGGAAA
GCGAGGAAGCAACCGGATCCCGTATTCAAAAAAGGACTAATTGATGATGATAAAGAAGAA
AAGAAGTTGAGCCATAGACAAAAACAGAGAGCTGCTAGGAGGCTTAAAAAAAAGAAAGAA
CTTGTGGAAAAGAAGAAAGCTTTGAAAGAAGCAAAGAAGAATGTTGCAAAGGTAAATGAG
AAAGTAGTTGAGGACAAGTCCGAAGATGAATCTGAAGATTCAGAACAAGAAAATGTGGAA
GGTTTTACAGATGACAACAAAGAGTGGCTTAAACCTAAACAGAAGAGTAAATCCAAAGAA
TACAATTCTGAAGATGATGAGAATGACTCTGGAAGTGAAGTAGAAGATGAAGATGTTGAG
GAAGCACCAAAAAAATCTGGGAAAGAAAAATATAAGGTCGGCAAACTTGATGATTTATTT
GTGGACACAGATGAGGAACAGGATAATGATCCTGATGAAGAGGATGATGCTGATAATGAG
AATAAAATGGACTATAACTCAGACTCTAGTGATGAAGAAAAAGATGATGATGATGATGAT
GATGATGACGATGATGACATGTTACCAATAGAAAAAGCTAACATCAAACTTAAAAAGAAA
CAAAAGATCGACAAGAAGCTTGCTGACGATGAATTGCAACTGAACATTTCTAAACAAGAT
GTATTTGCATTCCCATCTGAAGAAGAACTGCAAAATCCAACGAGTTTACAGGACATCCAT
CAACGGATTAAAGATGTTGTAACGGTTTTAAGTGATTTTAACCGTTTGAAGGATCAAGAA
AGATCGAGGTGTGAATACACTGAGCTATTGATGAAGGACTTATGTATGTATTACAGTTAT
AATGAATTTCTCATGGAAGTTCTCATGCAAATATTTCCAGTACAGGAATTAGTGGAATTT
CTTGAAGCAAGTGAAGTAGCTCGCCCATTGACTATTAGGACTAACAGCTTGAAAACAAGA
AGAAGGGATTTAGCTCAAGCCCTTATTAACAGAGGAGTTAATTTAGATCCGGTTGGAAAG
TGGAGCAAAGTTGGTCTAGTAGTTTATAGTTCTACAGTACCAATTGGTGCTACTCCGGAA
TATTTGGCTGGCCATTACATTTTACAAGGGGCATCTAGTTTCTTGCCAGTAATGGCTTTA
GCGCCACAAGAGAATGAGAGAATATTAGACATGTGTGCGGCCCCTGGTGGTAAAGCATCT
CATATAGCTGCCATCATGAAAAATACAGGTGCCTTATTTGCTAATGATGCCAATAAAGAT
AGAACTAAAGCGATTGTCGGTAACTTCCACAGGCTGGGAATTGTTAATGCTGTTATTTGT
AACTATGATGGACGTCAATTCCCAGAGGTTATTAAGGGCTTCGATAGAGTATTACTTGAT
GCCCCCTGTACAGGAACGGGGGTTATAGCTAAAGATCCTAGCGTGAAGACTTCGAAGGAA
CAAAAGGATATTCAGAGATGTTTCAATCTACAAAGACAGCTTTTACTGGCCGCTATAGAT
TGTTGTAATGCTAAATCCAGTACAGGCGGTTACATTGTTTATTCGACATGTTCTATATTA
CCAGAAGAAAATGAATGGGTTGTAAATTATGCATTGAAAAGAAGAAATGTCAAGTTAGTG
CCGACCGGTCTTGACTTTGGTACAGAGGGATTCGTTAAATACAGACATCATAGATTCCAT
CCATCATTAAAACTAACAAGAAGATTCTATCCCCATACACATAATATGGATGGTTTTTTT
GTGGCCAAATTTAAGAAGTTCTCTAATGTTATACCTGAGCCATTTAAGGATGAAGAAGAA
GACAATGAAGAAATAAAGGAAGATGCAGAACAGAATGGTGATGCGACAATGCAGAAGAAA
TCCAAAAAAAAAAATACACCAGTTAAGAGGCCAGCGGAATCTGTTGCTGTTGAACCACAG
AATAAAAAAAATAAAAACTAA
Protein sequence:
MGRKAKFDESVKIKKGPGRKARKQPDPVFKKGLIDDDKEEKKLSHRQKQRAARRLKKKKE
LVEKKKALKEAKKNVAKVNEKVVEDKSEDESEDSEQENVEGFTDDNKEWLKPKQKSKSKE
YNSEDDENDSGSEVEDEDVEEAPKKSGKEKYKVGKLDDLFVDTDEEQDNDPDEEDDADNE
NKMDYNSDSSDEEKDDDDDDDDDDDDMLPIEKANIKLKKKQKIDKKLADDELQLNISKQD
VFAFPSEEELQNPTSLQDIHQRIKDVVTVLSDFNRLKDQERSRCEYTELLMKDLCMYYSY
NEFLMEVLMQIFPVQELVEFLEASEVARPLTIRTNSLKTRRRDLAQALINRGVNLDPVGK
WSKVGLVVYSSTVPIGATPEYLAGHYILQGASSFLPVMALAPQENERILDMCAAPGGKAS
HIAAIMKNTGALFANDANKDRTKAIVGNFHRLGIVNAVICNYDGRQFPEVIKGFDRVLLD
APCTGTGVIAKDPSVKTSKEQKDIQRCFNLQRQLLLAAIDCCNAKSSTGGYIVYSTCSIL
PEENEWVVNYALKRRNVKLVPTGLDFGTEGFVKYRHHRFHPSLKLTRRFYPHTHNMDGFF
VAKFKKFSNVIPEPFKDEEEDNEEIKEDAEQNGDATMQKKSKKKNTPVKRPAESVAVEPQ
NKKNKN