New model in OGS2.0 | DPOGS211967  |
---|---|
Genomic Position | scaffold71:- 215699-217832 |
See gene structure | |
CDS Length | 2055 |
Paired RNAseq reads   | 1652 |
Single RNAseq reads   | 3948 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000919 (0.0) |
Best Drosophila hit   | Suv4-20 (3e-99) |
Best Human hit | histone-lysine N-methyltransferase SUV420H1 isoform 1 (1e-76) |
Best NR hit (blastp)   | PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum] (6e-140) |
Best NR hit (blastx)   | PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum] (2e-132) |
GeneOntology terms    | GO:0005634 nucleus GO:0040029 regulation of gene expression, epigenetic GO:0042799 histone methyltransferase activity (H4-K20 specific) GO:0016571 histone methylation GO:0034773 histone H4-K20 trimethylation GO:0034772 histone H4-K20 dimethylation |
InterPro families   | IPR001214 SET domain |
Orthology group | MCL17026 |
Nucleotide sequence:
ATGGTTGTTGGATCCGCGAGTTATCCAGGGAGGCAAGTGCTCCACAAAATGCAGCCCATG
GGAATGACCCCGCGCGAGTTGTCGGAGTACGACGACCTGGCGACGGCTCTCATAATCGAC
CCTTATCTCGGAATAACCACTCACAAAATGAACATCAGGTACAGACCGTTGAAAACGAAC
AAAGAGGAACTGAAGAATATCATTAAAGAATTCCTCCAGACTCAGGACTACAACAAGGCT
TACTCCAAACTGGCTAATGGTGAATGGATGCCGAGACACTTCAGCAAAAATAAACACCAG
CAAAACAAACTAAGAGAACACATTTACCGTTATCTTAGAATTTTTGACAAGAAGGCTGGT
TTTGTTATAGAACCTTGTTATAGATATTCATTAGAAGGAAGAATTGGAGCTAAAATTTCA
ACTACAAAAAAATTTTTTAAGCATGAGAGAATCGACTTCCTAGTGGGTTGTATCGCTGAG
ATGACCGAGGAAGAGGAGAAGCAACTTCTTCATCCAGGGAAAAATGACTTTTCAGTTATG
TATAGTTGTAGAAAAAATTGTGCCCAGCTGTGGTTGGGACCCGCTGCTTATATAAATCAT
GACTGCAGGCCCACCTGTACCTTTGAAGCAACAGACCGGGGAAAGGCATTTGTACGAGTG
TTGAGGGATATAGAAGTTGGGGAAGAGATAACCTGCTTCTACGGAGAAGACTTCTTTGGC
AATGGGAATTGTTACTGTGAATGTGAGACATGTGAGAGACGAGGGAAGGGGGCATTTTCA
GTACAGAATGCTCACAATGACGAGCAGGCCACGAGGTACAGGTTTAGAGAAACTGATAAT
AGAATAAATAGGACTAAAGCAAAGCAAATTCAAAAACCTGTGAACAGTAAAAACTCTGAA
AAGTCAATAGCGCCCGCAGGACAAGTCTCTACTATAGTGTCTCCATTGAGTATGAAGGAA
ATGAAGCAGAAGGGTTTGACGAAGTATGATGCGGAGTTGTTAATAGCACAAGGTTGCATA
GCGGACATCATGGACGGGAACGGTAAGAAGAACGCGCAAGGGAGCCGGGAGTCATCGGCG
TCCAGCCGCGGGGAGCGTCTCCGGCGGAGAGCGGACAGCAGTCGTCCTGTCAACGGCACC
TCCACACACACACCCAGCGCCAGTCAGGCTGCTCAGTCGCGATGTTGTAGCGTCACAAGC
TCGTGTAGCAGTCGCGACTCGCATTCTGGAATAGTTTTAAGAAGTCACAGGCGTCTCACC
GAGTCCAGTGTCCCCGCTGTCTGTTCGAAAGTAAGGAATTCTTCGAAAGCCACAACCGAA
ACCAAAATTTGCCGAAATCACAGCACGCCAAAAACAGAACCAGATTTACCAGAGCCGGAA
GTGGATACACAGCCGGTGAAAATGGAACCTCCGCAGGAACACGAGGAGGTCACGCACGAC
CCGGAAACACACATAGAAACTAGGAGTGGCGATGAAGCTGCAATCACGGGCGAGGAATGC
CACAAGAATGAATCTCCGCTGCAAGCAACAGAGACACCTCCCCGAGGAAATGTAAAGACG
GACACCGAGGACACAGAGCCCGCGGAGAAGCGGCTCGTAGAGAGCAAGTGTGTGGGCGAA
GACGCGTGCATCAGCGAGAGTTGTGACTTTAGAGAAAACGTGAACCCGAGCGAGGCGAAG
GAGAGTAAGGCAGTGAAACAGAAAGCGGATGTGGCAAGGAGCGACGAGAACAAGAAGGAG
TGCGAGGGCCAATGGCTGGCGGACAAGTCCAACTGCGGCGGAGAGTGTCCCTGCACCCCG
CCCAGGAGGGGCCTGAAGTTGACGCTCAGGGTGAAGAGGAGCCCCGTGGTGGAGGAAGAG
GTGCCCGAGTACGAGGTGCTGCGGCTGGAGGGCGTCGACCCCGACACGGCCCGCCGCCTC
AAGAAGAGACGCCGCTCCAAGGAACGACGGAAACACAGCCCCGTCCGCCCGCTGCCTCCC
ATGAAGCGACTCAGGTTGATCTTCGGCAACGAGAGCCGCACCATCGACCTCCCGCCCGCC
CTCACGGCAGACTGA
Protein sequence:
MVVGSASYPGRQVLHKMQPMGMTPRELSEYDDLATALIIDPYLGITTHKMNIRYRPLKTN
KEELKNIIKEFLQTQDYNKAYSKLANGEWMPRHFSKNKHQQNKLREHIYRYLRIFDKKAG
FVIEPCYRYSLEGRIGAKISTTKKFFKHERIDFLVGCIAEMTEEEEKQLLHPGKNDFSVM
YSCRKNCAQLWLGPAAYINHDCRPTCTFEATDRGKAFVRVLRDIEVGEEITCFYGEDFFG
NGNCYCECETCERRGKGAFSVQNAHNDEQATRYRFRETDNRINRTKAKQIQKPVNSKNSE
KSIAPAGQVSTIVSPLSMKEMKQKGLTKYDAELLIAQGCIADIMDGNGKKNAQGSRESSA
SSRGERLRRRADSSRPVNGTSTHTPSASQAAQSRCCSVTSSCSSRDSHSGIVLRSHRRLT
ESSVPAVCSKVRNSSKATTETKICRNHSTPKTEPDLPEPEVDTQPVKMEPPQEHEEVTHD
PETHIETRSGDEAAITGEECHKNESPLQATETPPRGNVKTDTEDTEPAEKRLVESKCVGE
DACISESCDFRENVNPSEAKESKAVKQKADVARSDENKKECEGQWLADKSNCGGECPCTP
PRRGLKLTLRVKRSPVVEEEVPEYEVLRLEGVDPDTARRLKKRRRSKERRKHSPVRPLPP
MKRLRLIFGNESRTIDLPPALTAD