DPGLEAN05424 in OGS1.0

New model in OGS2.0DPOGS211967 
Genomic Positionscaffold71:- 215699-217832
See gene structure
CDS Length2055
Paired RNAseq reads  1652
Single RNAseq reads  3948
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000919 (0.0)
Best Drosophila hit  Suv4-20 (3e-99)
Best Human hithistone-lysine N-methyltransferase SUV420H1 isoform 1 (1e-76)
Best NR hit (blastp)  PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum] (6e-140)
Best NR hit (blastx)  PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum] (2e-132)
GeneOntology terms




  
GO:0005634 nucleus
GO:0040029 regulation of gene expression, epigenetic
GO:0042799 histone methyltransferase activity (H4-K20 specific)
GO:0016571 histone methylation
GO:0034773 histone H4-K20 trimethylation
GO:0034772 histone H4-K20 dimethylation
InterPro families  IPR001214 SET domain
Orthology groupMCL17026

Nucleotide sequence:

ATGGTTGTTGGATCCGCGAGTTATCCAGGGAGGCAAGTGCTCCACAAAATGCAGCCCATG
GGAATGACCCCGCGCGAGTTGTCGGAGTACGACGACCTGGCGACGGCTCTCATAATCGAC
CCTTATCTCGGAATAACCACTCACAAAATGAACATCAGGTACAGACCGTTGAAAACGAAC
AAAGAGGAACTGAAGAATATCATTAAAGAATTCCTCCAGACTCAGGACTACAACAAGGCT
TACTCCAAACTGGCTAATGGTGAATGGATGCCGAGACACTTCAGCAAAAATAAACACCAG
CAAAACAAACTAAGAGAACACATTTACCGTTATCTTAGAATTTTTGACAAGAAGGCTGGT
TTTGTTATAGAACCTTGTTATAGATATTCATTAGAAGGAAGAATTGGAGCTAAAATTTCA
ACTACAAAAAAATTTTTTAAGCATGAGAGAATCGACTTCCTAGTGGGTTGTATCGCTGAG
ATGACCGAGGAAGAGGAGAAGCAACTTCTTCATCCAGGGAAAAATGACTTTTCAGTTATG
TATAGTTGTAGAAAAAATTGTGCCCAGCTGTGGTTGGGACCCGCTGCTTATATAAATCAT
GACTGCAGGCCCACCTGTACCTTTGAAGCAACAGACCGGGGAAAGGCATTTGTACGAGTG
TTGAGGGATATAGAAGTTGGGGAAGAGATAACCTGCTTCTACGGAGAAGACTTCTTTGGC
AATGGGAATTGTTACTGTGAATGTGAGACATGTGAGAGACGAGGGAAGGGGGCATTTTCA
GTACAGAATGCTCACAATGACGAGCAGGCCACGAGGTACAGGTTTAGAGAAACTGATAAT
AGAATAAATAGGACTAAAGCAAAGCAAATTCAAAAACCTGTGAACAGTAAAAACTCTGAA
AAGTCAATAGCGCCCGCAGGACAAGTCTCTACTATAGTGTCTCCATTGAGTATGAAGGAA
ATGAAGCAGAAGGGTTTGACGAAGTATGATGCGGAGTTGTTAATAGCACAAGGTTGCATA
GCGGACATCATGGACGGGAACGGTAAGAAGAACGCGCAAGGGAGCCGGGAGTCATCGGCG
TCCAGCCGCGGGGAGCGTCTCCGGCGGAGAGCGGACAGCAGTCGTCCTGTCAACGGCACC
TCCACACACACACCCAGCGCCAGTCAGGCTGCTCAGTCGCGATGTTGTAGCGTCACAAGC
TCGTGTAGCAGTCGCGACTCGCATTCTGGAATAGTTTTAAGAAGTCACAGGCGTCTCACC
GAGTCCAGTGTCCCCGCTGTCTGTTCGAAAGTAAGGAATTCTTCGAAAGCCACAACCGAA
ACCAAAATTTGCCGAAATCACAGCACGCCAAAAACAGAACCAGATTTACCAGAGCCGGAA
GTGGATACACAGCCGGTGAAAATGGAACCTCCGCAGGAACACGAGGAGGTCACGCACGAC
CCGGAAACACACATAGAAACTAGGAGTGGCGATGAAGCTGCAATCACGGGCGAGGAATGC
CACAAGAATGAATCTCCGCTGCAAGCAACAGAGACACCTCCCCGAGGAAATGTAAAGACG
GACACCGAGGACACAGAGCCCGCGGAGAAGCGGCTCGTAGAGAGCAAGTGTGTGGGCGAA
GACGCGTGCATCAGCGAGAGTTGTGACTTTAGAGAAAACGTGAACCCGAGCGAGGCGAAG
GAGAGTAAGGCAGTGAAACAGAAAGCGGATGTGGCAAGGAGCGACGAGAACAAGAAGGAG
TGCGAGGGCCAATGGCTGGCGGACAAGTCCAACTGCGGCGGAGAGTGTCCCTGCACCCCG
CCCAGGAGGGGCCTGAAGTTGACGCTCAGGGTGAAGAGGAGCCCCGTGGTGGAGGAAGAG
GTGCCCGAGTACGAGGTGCTGCGGCTGGAGGGCGTCGACCCCGACACGGCCCGCCGCCTC
AAGAAGAGACGCCGCTCCAAGGAACGACGGAAACACAGCCCCGTCCGCCCGCTGCCTCCC
ATGAAGCGACTCAGGTTGATCTTCGGCAACGAGAGCCGCACCATCGACCTCCCGCCCGCC
CTCACGGCAGACTGA

Protein sequence:

MVVGSASYPGRQVLHKMQPMGMTPRELSEYDDLATALIIDPYLGITTHKMNIRYRPLKTN
KEELKNIIKEFLQTQDYNKAYSKLANGEWMPRHFSKNKHQQNKLREHIYRYLRIFDKKAG
FVIEPCYRYSLEGRIGAKISTTKKFFKHERIDFLVGCIAEMTEEEEKQLLHPGKNDFSVM
YSCRKNCAQLWLGPAAYINHDCRPTCTFEATDRGKAFVRVLRDIEVGEEITCFYGEDFFG
NGNCYCECETCERRGKGAFSVQNAHNDEQATRYRFRETDNRINRTKAKQIQKPVNSKNSE
KSIAPAGQVSTIVSPLSMKEMKQKGLTKYDAELLIAQGCIADIMDGNGKKNAQGSRESSA
SSRGERLRRRADSSRPVNGTSTHTPSASQAAQSRCCSVTSSCSSRDSHSGIVLRSHRRLT
ESSVPAVCSKVRNSSKATTETKICRNHSTPKTEPDLPEPEVDTQPVKMEPPQEHEEVTHD
PETHIETRSGDEAAITGEECHKNESPLQATETPPRGNVKTDTEDTEPAEKRLVESKCVGE
DACISESCDFRENVNPSEAKESKAVKQKADVARSDENKKECEGQWLADKSNCGGECPCTP
PRRGLKLTLRVKRSPVVEEEVPEYEVLRLEGVDPDTARRLKKRRRSKERRKHSPVRPLPP
MKRLRLIFGNESRTIDLPPALTAD