DPGLEAN14087 in OGS1.0

New model in OGS2.0DPOGS212221 
Genomic Positionscaffold959:+ 16124-22714
See gene structure
CDS Length2139
Paired RNAseq reads  1045
Single RNAseq reads  2471
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004408 (0.0)
Best Drosophila hit  CG6133 (0.0)
Best Human hittRNA (cytosine-5-)-methyltransferase NSUN2 isoform 2 (2e-132)
Best NR hit (blastp)  PREDICTED: similar to CG6133-PA [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG6133-PA [Apis mellifera] (0.0)
GeneOntology terms
  
GO:0003886 DNA (cytosine-5-)-methyltransferase activity
GO:0003723 RNA binding
InterPro families


  
IPR018314 Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p, conserved site
IPR023267 RNA (C5-cytosine) methyltransferase
IPR023270 RNA (C5-cytosine) methyltransferase, NCL1
IPR001678 Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
Orthology groupMCL14539

Nucleotide sequence:

ATGGGTAGACGAAACAGAAATGTTAATAAATTCGCACAACGCAAAAGAGAACGGAAAGAA
CAGGAAAAAAATCCACAACAGAAACCGGCCGACACGCGAAAGCATTATGAAGATATTGTG
CGTGAAAATGCAATCTTCGAAGAGTATTATAAGGCGCAGAAAGTTTGTCCTGATGATCAA
TGGGACGACTTCATGAGAGCTATCAAGAAAGATCTGCCAACAGCATTCAGAATAACGGGC
TCCAAATGTGAAACAGATGCGCTACTCAATATAGTTAAGAGCCAATATTTTTCAGAGATA
TTAAATCATAAACTTAAATTGGATGATGAGAAAGAAGAAGAGGAAATCAAACCTGTTAAC
TTGCCGTGGTATCCCGGAGGTCTAGTGTGGCAACTGCCAGTATCTCGCACACATATCCGT
CGGAATGAACCACTCTATCGTCTCCATAACTTTTTAGTGGCTGAAACAGAAGCGGGCGGT
GTTTCACGTCAAGAGGCTGTATCTATGATACCGCCTGTTGTGTTACAAGTTGAACCTCAT
CATAAGGTATTAGACATGTGTGCTGCACCGGGCTCCAAGACTGCACAACTCATTGAGTTC
TTACATTCTGATGAAGACAAAATGCCTACAGGTTTCGTAATGGCCAATGATGTTGACAAC
AGTCGCTGTTACATGTTGGTGCACCAGGCGAAGAGGTTGAACTCACCCTGCATCATCATC
ACCAACCATGACTCCGCAGTGTTGCCATCACTGGTTGTGAGTGATGAGGAGAACCCGAGT
GCGACGAAGCCGCTTAAGTTCGACCGCGTTCTGTGTGATGTTCCATGTTCCGGAGACGCC
ACCTTGAGGAAGAACCCTGATATATGGACGAAATGGTCGACCGGAAACGGAAATAACTTA
CACGGTATTCAGTATAGAATCCTCCGTCGTGGCGTTGAGTTGTTGTCTGTGGGCGGAAGA
TTGGTCTATTCCACCTGTTCCTTCAACCCTGTGGAGAACGAGGCCGTGGTGCACAGAATC
CTTCAGGAGACCGGCGCCAGTGTGACCCTCGTGGATGTACAGGATCTACTGCCCGGACTA
AAGTTCCATAAAGGCATGACACATTGGCGGCCGGCGTCTAAAGACATGGTGTTCTACAAC
AGTTATGATGAGGTTCCAGAGAAATGGCAGACGGTGGTGAGGCCGCAAATGTTCCCTCCC
AAGACTGAAGACTTGGACAAATATAATCTGGATAGATGCATAAGAATTCTGCCTCATCAC
CAAGATACTGGAGGGTTTTTCGTGGCAGTGTTTGAAAAAACCGCCCTCCTGCCATGGGAG
AAGGACCCAACCAAGAAACCGGATGTGGCAGCCGATGAACCGGCAGAGGAACCGGAAAAG
AAGGAACCACCAAAGAAGAGAAGAAGAATGGGAGGATATAGGGAGGATCCTTTTGTATTT
TTCTCCGGTGAAAATGAAGATGTGTTCCCTTCTATCAAGGAATTCTACGATCTTGATACA
AAATTCGACCCTACCTGTCTCTTGACGAGATGTCATGTTGGGAAAAAGAAGAATATTTAC
CTGGTGTCAGCCATGGTGAAAGAAGTTGTACAGAAAAATGAGAATAGTATTAAGATTATA
AACACAGGCGTCAAAACATTTGTTAGGTGTGATAATAAAAATATGAAGTGCCCATTCAGA
CTATCTCAAGAAGGTCTTCAGAGTATAGCCCAGTACATCGGTCCAAAACGACGCGTGACC
ATTCTTAAGGAGGATCTCATACTAATATTACAATGTGACAACCCTAGCAAACCCCCAGAA
CTAAAACTGTTCACAGAACACACTCAGAATATGGTGAAAGATTTCGCTACTGGTAGCTGC
GTGTTGGAGTATAAGGACACGTCATCAGGGTTGTCACTCCGCCTGGTCGGTTGGCGAGGT
GTTCACTCTCTACGCGCGTACACCGCCGCCCCTGACACCGTGCACTACCTAAGACTGTTG
GGAGCTGACTACAGCAAATATGACGTAAATAAGTTCAAGAAGGCTGCAGAAGTCCCCAAG
GATGATAGTATTGAGGTTAGCGGAACAGCCTCTACCAGTGAAAATGATCCTAACAAGACC
AACGCTATGGAGACTGAAGAGGGAATGAATGTAACATGA

Protein sequence:

MGRRNRNVNKFAQRKRERKEQEKNPQQKPADTRKHYEDIVRENAIFEEYYKAQKVCPDDQ
WDDFMRAIKKDLPTAFRITGSKCETDALLNIVKSQYFSEILNHKLKLDDEKEEEEIKPVN
LPWYPGGLVWQLPVSRTHIRRNEPLYRLHNFLVAETEAGGVSRQEAVSMIPPVVLQVEPH
HKVLDMCAAPGSKTAQLIEFLHSDEDKMPTGFVMANDVDNSRCYMLVHQAKRLNSPCIII
TNHDSAVLPSLVVSDEENPSATKPLKFDRVLCDVPCSGDATLRKNPDIWTKWSTGNGNNL
HGIQYRILRRGVELLSVGGRLVYSTCSFNPVENEAVVHRILQETGASVTLVDVQDLLPGL
KFHKGMTHWRPASKDMVFYNSYDEVPEKWQTVVRPQMFPPKTEDLDKYNLDRCIRILPHH
QDTGGFFVAVFEKTALLPWEKDPTKKPDVAADEPAEEPEKKEPPKKRRRMGGYREDPFVF
FSGENEDVFPSIKEFYDLDTKFDPTCLLTRCHVGKKKNIYLVSAMVKEVVQKNENSIKII
NTGVKTFVRCDNKNMKCPFRLSQEGLQSIAQYIGPKRRVTILKEDLILILQCDNPSKPPE
LKLFTEHTQNMVKDFATGSCVLEYKDTSSGLSLRLVGWRGVHSLRAYTAAPDTVHYLRLL
GADYSKYDVNKFKKAAEVPKDDSIEVSGTASTSENDPNKTNAMETEEGMNVT