DPGLEAN15585 in OGS1.0

New model in OGS2.0DPOGS207038 
Genomic Positionscaffold1:+ 1333875-1344810
See gene structure
CDS Length4476
Paired RNAseq reads  1726
Single RNAseq reads  4243
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012978 (9e-35)
Best Drosophila hit  CG40351, isoform H (6e-109)
Best Human hithistone-lysine N-methyltransferase SETD1A (4e-88)
Best NR hit (blastp)  GM18767 [Drosophila sechellia] (1e-141)
Best NR hit (blastx)  PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum] (1e-157)
GeneOntology terms



  
GO:0006338 chromatin remodeling
GO:0005634 nucleus
GO:0018024 histone-lysine N-methyltransferase activity
GO:0000166 nucleotide binding
GO:0003676 nucleic acid binding
InterPro families



  
IPR000504 RNA recognition motif domain
IPR001214 SET domain
IPR003616 Post-SET domain
IPR015722 Histone-lysine N-methyltransferase
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL14407

Nucleotide sequence:

ATGAATGGAGGAATGGAGCACAAAACACCAGGCCACAATGCAGTTCTTCACAAGGGACCT
AAAAATTATAAACTTCTAATAGACCCATTCCTAGTAAAGGGAGCGACGAAAGTGTACAGA
TATGATGGTACTGTTCCTGGTACGTCTTACCCGTCGATACAATGCAGAGACCCGAGACCT
CAACCGTCCAGAATATGGAATAAATTAGAACCAGCAGATTTACCGATACCTAGGTTTAGA
ATTGATAAGAATTATGTTGGTGTTCCGCCACAGTTAGAAATAACTATTGTAAATTTGAAT
GATAACATCGATAAAGCTTTCTTGTCCGACATGATGAATAAAGTTGGACCTTATGAAGAA
TTGACAATATTTTATCATCCGATGACTAATAGGCATTTAGGTTTTGCTAGAATTGTATTT
CAGGATGTCAAATATTCCAAAATATGTATCGAAAAATATAATGGAAAATCTGTCATGGGG
CAGGTACTTGAAGTTTTCCATGATTCATTTGGTAAGAAGTGTCAGGAGATGTTTGAAGAT
AAAACGGTGGAGAAAAAGCCCCAGCCGGCTCCGATCAAGCCTCCCGAGGATGCCCGAGTG
GCCAAGCTAGATCCAGCTCTCAGCAAGAGATTAGAGGATAGCAAACTAGTTGATAAGGAC
CCATACCTTCGCAAGGAGCTGGAACACAATGACAGCAACACAAGATGGTCAGATGATGAA
AGGGACAGAGAGTACAAACATCGCCTCCGCAGTAGAAGTGAGAGAGACAGAGATATTGAC
AGAGATAGAGATGGCAACAGAGAGAGAGAAAGACACAGAGACCGCTATGCCCGTACCAGT
GAACAGAGTGAGTCCTATTCCAGCGCCCACGTTGAGATACCGTACGCTCCAACGCCAGTA
CCATACGACCCATACTACCAGACCCCTGGCTATGGATACGGCTACGGCACGAGCGCCGGC
GCTGTTTGGTGGGGGGACTGGAGACAGCCGCACACCTCTCACCATCACTCACATTCGGAG
CAGAGCAGCAGCAGCACGTGGACGGCCGCGGGCGAGCCGACCCCCTCCCCGCGGCACACA
CCCCTCGCGGCCCCTCAAACCCCGTACACGCCCGCGCCCCCCACTCCGCTCCCGGAAAAG
GAGGTGAAGTGTAAGCCCGAGGAGCCTCTTCCTCCGAGTACGTCCGTCGTCAGCGACCCT
GAGCCCAAGCCGCCTCCGCCCAGCGACGAACCCAAGAACGTCGACCTTGACACAAGGATA
GCTTTGTTGCTGAAGGGAGCCAGCGGCGGAGGGGGTCTGGCGCCTCCCTTCCTGTCTCTG
GGAATGACCTCTGAAGAGGAAGACGAAGACAGGAAGCCTAGGAACATACCTGACCTGGAC
ACACACAACCCGCCGTCAGACGATGAAGGTTCTGTAAGTGAGGATAGAGAGAGTATAATA
TCATTGAACCAGAATAGAGAAGTTAACCCGGAACCGTTGTCTAATACTCCTTCACCATAC
CTATCAAGAGAGTTCTATCTTGAATGTCTTAAAGCGACGGTCGAGAGGAAAGCAAAAGAA
GAGGAACGCAAAAAGTTCCCGCCAATAGACAAAATAGGTTCCGATATATCGTCGTCTGAA
GACGAACTGCTCACTGGGGAGGAACCACGACGTTCACCTGTTAACCCGCCCGATAGAGAT
CAAGATAATTTGGACGATGATCAGATGTCTTTATCGTCTCTGTCGTCAACGGAGGCCAAA
ATCGAGGAGCAAGTCCCCGCTGAGGCGTATTACTACCCGCCCGCACACCCGCATTACTAT
CAGGCGATGTGGCCGACAACTGCCTATCCTCCGGGTGCTGTGGGCGCTATGGGGTCGGTT
GCCGCTATGGGGCCCGTGGGAGCGGTGGGCCCAGTGGGCGCGGTGGGCGCTGCATACCCA
GCCGCCGGCGACATGTCATTGTATGCTGGTGGCTTCGCTCCTCCCGTGATACACAGCTAC
CCTCCACCACGCACCGTCACTCACGAGGAGCTCGACAACCCTTACTACCCAACAATCAAT
AGTGTGATAGAGCGCGTCACGACTGAGCTTAAACAAATACTTAAGAAGGATTTCAATAAG
AAAATGATAGAGAGCACTGCCTTTAAGAACTTCGAGGTTTGGTGGGACGAGCAGAGTCGG
AAGACGAGACAGACTGTGAAACAAACTAAAGAAGATGTCGGACAACCATTACAAGATGTA
TCAAATAAGAAGGAGGAATCGGTGGATTCAATAAAATCTATAATGGAGTCTAGAGATCTG
GGTCTAGATCTAGGCGGGTACAGTGTTGGTATTGGTCTTGGTCTCAGGGCGACCATACCA
AAGATGCCCAGTTTCAGGAAGAAGAGAAAAATACCTTCGCCTGTTGTTATGGACGAGGAC
TCCAGTAAGAGACTGAGTGATCAGGAGGAAATCGTCCAGAACTCTGACGAAGAGAAGGAA
GTACCGACCAGTCCTCGGAATAGGACAACAGGTTCATACCTCTCGACTGGCAGAAGAAGA
CAGTCGAGCAGCTCATCGAGGTCTTCGTCGTCGTCGTCTTCGCGATCCTCTTGGTCGGGT
TCGGAGCGCTCTGTGAGGAAGGTCGCCCCAAGAATATACTCCGACACAGACGACTCGGAC
CTCGAAGACGCTGAAGTGCAGCAAATCAAGTTGGTGTCCAACAAGGAGAGACTCAGGCGA
GTGTACTCATCGTCATCGGACAGCGAGGAAGAACAGAGAAGAAGAGAAAAAACTCCGATA
CCGGAAGTGGAAGCATCAGACGACCGCCTCGGTTCACCTATTCTGTCGCCGGAAGAGGAA
CCCAGAGATACAATACTCGATCGTGTATACTCGGACTCTGAGGAAGAGAGGGAATACCAG
GAGCGTCGCCGTCGTAACACTGAGTATATGGAACAGATCGAACGAGAGTTCCTCGAGGAA
CAGCGTCGAGGACAACAGACCTCTGACACGGATGCACAGCAACAAAACGATAGTATAACG
GAACCGAAACAAGAAAAGAGTAGAAGTAGTCCAAGCAAGAACTATCTGAAATCACCTGAG
AAGAATAAAATGGCAGCTGAGGGTGATGTTGAAGAAGGTGAAATAAGTTCAGAGGAAGAA
CCTTTAGAAGTTAGAAGAAAGAAGGAGAAGAAACAGAAAAAGAAGACTGACAAGAGACGA
AGAGTCACATCCGTCAGCGACCACAGCTTTACCGAGTCTGCGGTTAGCGTGAATGGCGTT
AAGGAGGCTAGCGGTGCTGTGTCGGAGACGTCTTCGCCTCAGTCGCAGGCGTCCCAGGCT
TCTCAGGCGTCCCAAGTGGCGTTGGACCACTCGTACTGTCGCCCGCCGCCCACTGAACGA
CCTACCACTACACATCTACAACACGATCACGGTTACACTTGGATGGCTGAACCGGAACCG
GAAGCAGAATCGCCACCAGTCGCCATGGAAGAGAAGAGACGGGAGAAGACGGAGAGACCG
TACAAAAGAAAACATCAGAATAAAAAGTTATCTGAAATTCAGAATAAATTATACGACGGT
CGCGATGATTATAACAATAAGTACTCGTCCGTGACATTCAAGCAGCGCGATATAATGGCG
GAGGTCCAAGTGATGTACGAGTTCCTCACCCGCGGTATAGACAGAGAGGATATAGAGTAC
CTCAGGCGGGCGTACGAGGCTCTGTTGGCGGAGGATGCTCAGGGGTACTGGCTCAATGAC
ACCCACTGGGTCGAACATCCGCCCACTGACCTCACGTACTCACCGCCCAAGAAGAAATCC
AAGCGATACAATAACATCTACGAGGACTTGCAAGGCCACTCGAGCGGTTCAGCACGTACG
GAGGGCTACTACAAGATGGACGCCAAGTTGAAGGCGAAGTACAAGTATCACCACGGAAGA
ACCGCTGCGTTACCCCCGCCTGATGATAAGAAAGCCAGCAAGATGCAGCTGCTGTCGAGA
GAGGCGCGCTCCAACCAGAGGAGACTGCTCACGGCATTCGGAACCGACACTGACTCAGAT
CTCCTCAAGTTCAATCAGCTCAAGTTCAGGAAGAAACAGCTCAAGTTCGCCAAATCTGGT
ATACACGACTGGGGTCTCTTTGCTCAGGAGGCGATAGCGGCGGACGAGATGGTTATCGAA
TACGTCGGTCAAATGGTCCGTCCCATAGTAGCGGATGTCCGCGAGGCTCACTACGAGGCC
ACTGGCATCGGTTCTTCATATCTGTTCCGTATAGACTTGGACACTATTATTGATGCAACC
AAGTGCGGTAACCTGGCGCGTTTCATCAACCACAGCTGCAATCCAAACTGTTACGCAAAG
ATAATAACTATAGAATCACAGAAGAAAATCGTCATATACTCGAAACAGCCCATAGGAGTC
GACGAGGAGATAACCTACGACTACAAGTTCCCTCTCGAAGACGAGAAGATACCTTGCCTG
TGTGGAGCGCCGCAATGCCGTGGCTACCTTAACTAG

Protein sequence:

MNGGMEHKTPGHNAVLHKGPKNYKLLIDPFLVKGATKVYRYDGTVPGTSYPSIQCRDPRP
QPSRIWNKLEPADLPIPRFRIDKNYVGVPPQLEITIVNLNDNIDKAFLSDMMNKVGPYEE
LTIFYHPMTNRHLGFARIVFQDVKYSKICIEKYNGKSVMGQVLEVFHDSFGKKCQEMFED
KTVEKKPQPAPIKPPEDARVAKLDPALSKRLEDSKLVDKDPYLRKELEHNDSNTRWSDDE
RDREYKHRLRSRSERDRDIDRDRDGNRERERHRDRYARTSEQSESYSSAHVEIPYAPTPV
PYDPYYQTPGYGYGYGTSAGAVWWGDWRQPHTSHHHSHSEQSSSSTWTAAGEPTPSPRHT
PLAAPQTPYTPAPPTPLPEKEVKCKPEEPLPPSTSVVSDPEPKPPPPSDEPKNVDLDTRI
ALLLKGASGGGGLAPPFLSLGMTSEEEDEDRKPRNIPDLDTHNPPSDDEGSVSEDRESII
SLNQNREVNPEPLSNTPSPYLSREFYLECLKATVERKAKEEERKKFPPIDKIGSDISSSE
DELLTGEEPRRSPVNPPDRDQDNLDDDQMSLSSLSSTEAKIEEQVPAEAYYYPPAHPHYY
QAMWPTTAYPPGAVGAMGSVAAMGPVGAVGPVGAVGAAYPAAGDMSLYAGGFAPPVIHSY
PPPRTVTHEELDNPYYPTINSVIERVTTELKQILKKDFNKKMIESTAFKNFEVWWDEQSR
KTRQTVKQTKEDVGQPLQDVSNKKEESVDSIKSIMESRDLGLDLGGYSVGIGLGLRATIP
KMPSFRKKRKIPSPVVMDEDSSKRLSDQEEIVQNSDEEKEVPTSPRNRTTGSYLSTGRRR
QSSSSSRSSSSSSSRSSWSGSERSVRKVAPRIYSDTDDSDLEDAEVQQIKLVSNKERLRR
VYSSSSDSEEEQRRREKTPIPEVEASDDRLGSPILSPEEEPRDTILDRVYSDSEEEREYQ
ERRRRNTEYMEQIEREFLEEQRRGQQTSDTDAQQQNDSITEPKQEKSRSSPSKNYLKSPE
KNKMAAEGDVEEGEISSEEEPLEVRRKKEKKQKKKTDKRRRVTSVSDHSFTESAVSVNGV
KEASGAVSETSSPQSQASQASQASQVALDHSYCRPPPTERPTTTHLQHDHGYTWMAEPEP
EAESPPVAMEEKRREKTERPYKRKHQNKKLSEIQNKLYDGRDDYNNKYSSVTFKQRDIMA
EVQVMYEFLTRGIDREDIEYLRRAYEALLAEDAQGYWLNDTHWVEHPPTDLTYSPPKKKS
KRYNNIYEDLQGHSSGSARTEGYYKMDAKLKAKYKYHHGRTAALPPPDDKKASKMQLLSR
EARSNQRRLLTAFGTDTDSDLLKFNQLKFRKKQLKFAKSGIHDWGLFAQEAIAADEMVIE
YVGQMVRPIVADVREAHYEATGIGSSYLFRIDLDTIIDATKCGNLARFINHSCNPNCYAK
IITIESQKKIVIYSKQPIGVDEEITYDYKFPLEDEKIPCLCGAPQCRGYLN