DPGLEAN10295 in OGS1.0

New model in OGS2.0DPOGS212341 
Genomic Positionscaffold101:- 301703-309860
See gene structure
CDS Length2202
Paired RNAseq reads  3208
Single RNAseq reads  7496
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014476 (1e-160)
Best Drosophila hit  enhancer of zeste, isoform B (0.0)
Best Human hithistone-lysine N-methyltransferase EZH1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to enhancer of zeste 2 isoform a [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to enhancer of zeste, ezh [Nasonia vitripennis] (0.0)
GeneOntology terms






  
GO:0018024 histone-lysine N-methyltransferase activity
GO:0003677 DNA binding
GO:0045449 regulation of transcription
GO:0008168 methyltransferase activity
GO:0016568 chromatin modification
GO:0005634 nucleus
GO:0016740 transferase activity
GO:0006350 transcription
InterPro families

  
IPR001214 SET domain
IPR009057 Homeodomain-like
IPR001005 SANT domain, DNA binding
Orthology groupMCL11128

Nucleotide sequence:

ATGCGGCTTCGGCAAGCAAAGAGATTCAAGCGGGCTGATGAGGTAAAGGTGGCATGGGCT
CGGAACTTGAGGCTCATGTCGGAGTCTGTTGAAATGCGTGATAGTGAATGTTTGGAGCGT
GGAAGAAGACCTTTTTGGCCTCCACCAGCACCAACACCCAGCCACGAGAGCCTCATGAAA
CGTGCTGAAGTTTCTTGTACAGATGCATCTGGAGTAGTAACAACTCAGCAAGTACCAATT
CGAATCATTAACTCTGTGAATCCCATCCCTACTATGTACACCTGGGCTCCGACACAGAAA
AACTTCATGGTGGAGGATGAAACGGTGCTGCACAACATTCCCTACATGGGAGACGAAGTT
CTGGACCAGGATGGTACATTCATCGAGGAGCTCATAAAGAACTACGATGGGAAAGTGCAT
GGTGATAAAGAAGGTGGTTTTATTGATGACCAGCTGTTTGTGGACCTGGTGCATGCGTTG
GTGTCATTCCAGACAGGAGACGAGGTCGCTGAGGAGAGGAAGGAGAGGGAGCTGAGGAGA
TCTAAAGAAGACAAGGAGAAAGAAACATCATCCGAAGTGAAAGACAAAGAAGAACCCAAG
GAGGGAGACAAGTCTCTGGTGTACAATGAGAAACAGTTCCCAATCTTCACAATATTCCAG
GCCATCAGCTCACAGTTCCCCGACAAGGGCACGGCGCAAGAACTCAGAGAAAAGTACGTG
GAGCTGACGTCTCGCGCGGACCCGCTGGCGCTGCCGCCGTCCTGCACGCCCAACATAGAC
GGACCTCATGCGGAGTGCGTGTCCCGGGACCAGACCATGCACTCCTTCCACACATTGTTC
TGTCGACGGTGCTTCAAGTACGACTGCTTCCTGCACCGCCTGCAGGCGTGTCACCCGCGA
CCCAACCTGTCCAAGCGGAAGGGTCCGGATCTCAAACCCTTCTCCGAGCCGTGCGGCTCC
AGCTGCTATATGTTACTGGAAGGAATGAAGGAGAAATTAGCTCGCGAGCAAGCGGCGGCT
GGCGGGGAGGGCAGGGACGGGCGCGAAGGGAGGGAGGGGAGGGACAGGGCGCTGGACTCA
CCCAACGACGCCTCCTCGGAGGACAGTAATGACAGCAACAGGTATCAGAAAGGCAGCAAC
AGCAACTCCAGCAACAGCAACTGGAGCGCTCTCTGCAGCAAGCAGCCTCAGGAGCACACG
GACGCCCCCTACAACGTACTGGGTTTGACAGTAGGCGACATAGAGTCGGAGTGGACTGGG
TCGGACCAGTCCCTGTTCCGCGCCCTCCACAAGGTGTTCCCCTCCAACTACTGCGCCATC
GCCCAGCTCATGCTCTCCAAGACTTGTCAGCAGGTCTACACGTACTGGATCCGCACCGGA
CAAGAGCAGTGCCGCGTGGAAGCCGAGCTGACGCCGCCCAGGAAGAAGAAGAAGAAGCAT
CGCCTGTGGTCCGTGCACTGCCGCAAGATACAGCTCAAGAAGGACTCCGCCTCGCATCAC
GTCTACAACTACACTCCGTGCGATCATCCCAACCAGCCGTGCGACAGTCTGTGTCCGTGT
CTCCAGTCGCAGAACTTCTGCGAGAAATTCTGTCAATGTAGCAGCGACTGTCAGAACCGC
TTCCCCGGTTGCCGCTGCAAGGCCCAGTGTAACACCAAGCAGTGTCCGTGCTACCTCGGC
GTGCGGGAGTGCGACCCCGACCTGTGCACCGCCTGCGGCGCCGACGCCCCCTCGCCCGCC
GCCCCCCGAGCGCCCCTCTACTGCAGGAACGTGTCCGTCCAGCGAGGTCTCCACAAGCAC
CTGCTGCTGGCTCCGTCCGACGTGGCGGGCTGGGGCATCTTCTTGAAGGAAGCCGCCCAC
AAGAACGAGTTCATCTCCGAGTACTGCGGCGAGGTGATCTCGCAGGACGAGGCGGACCGC
CGCGGGAAGGTCTACGACAAATACATGTGCTCCTTCCTCTTCAACCTCAACAACGACTTC
GTAGTTGACGCGACTCGTAAGGGCAATAAGATCCGGTTCGCGAACCACTCGATAAACCCG
AACTGCTACGCTAAGGTCATGATGGTGAACGGAGACCATCGCATCGGCATCTTCGCCAAG
CGAGCCATCCAGCCCGGAGAGGAACTGTTCTTTGACTACAGATACGGACCGACTGAACAA
CTGAAGTTTGTGGGAATCGAAAGGGAAATGGAGTTCTTATGA

Protein sequence:

MRLRQAKRFKRADEVKVAWARNLRLMSESVEMRDSECLERGRRPFWPPPAPTPSHESLMK
RAEVSCTDASGVVTTQQVPIRIINSVNPIPTMYTWAPTQKNFMVEDETVLHNIPYMGDEV
LDQDGTFIEELIKNYDGKVHGDKEGGFIDDQLFVDLVHALVSFQTGDEVAEERKERELRR
SKEDKEKETSSEVKDKEEPKEGDKSLVYNEKQFPIFTIFQAISSQFPDKGTAQELREKYV
ELTSRADPLALPPSCTPNIDGPHAECVSRDQTMHSFHTLFCRRCFKYDCFLHRLQACHPR
PNLSKRKGPDLKPFSEPCGSSCYMLLEGMKEKLAREQAAAGGEGRDGREGREGRDRALDS
PNDASSEDSNDSNRYQKGSNSNSSNSNWSALCSKQPQEHTDAPYNVLGLTVGDIESEWTG
SDQSLFRALHKVFPSNYCAIAQLMLSKTCQQVYTYWIRTGQEQCRVEAELTPPRKKKKKH
RLWSVHCRKIQLKKDSASHHVYNYTPCDHPNQPCDSLCPCLQSQNFCEKFCQCSSDCQNR
FPGCRCKAQCNTKQCPCYLGVRECDPDLCTACGADAPSPAAPRAPLYCRNVSVQRGLHKH
LLLAPSDVAGWGIFLKEAAHKNEFISEYCGEVISQDEADRRGKVYDKYMCSFLFNLNNDF
VVDATRKGNKIRFANHSINPNCYAKVMMVNGDHRIGIFAKRAIQPGEELFFDYRYGPTEQ
LKFVGIEREMEFL