DPGLEAN01117 in OGS1.0

New model in OGS2.0DPOGS200457 
Genomic Positionscaffold1378:+ 13817-41183
See gene structure
CDS Length7131
Paired RNAseq reads  1312
Single RNAseq reads  4039
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005445 (0.0)
Best Drosophila hit  eggless (2e-116)
Best Human hithistone-lysine N-methyltransferase SETDB1 isoform 1 (2e-56)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum] (0.0)
GeneOntology terms


  
GO:0005634 nucleus
GO:0018024 histone-lysine N-methyltransferase activity
GO:0048477 oogenesis
GO:0051038 negative regulation of transcription, meiotic
InterPro families




  
IPR001739 Methyl-CpG DNA binding
IPR001214 SET domain
IPR007728 Pre-SET domain
IPR003616 Post-SET domain
IPR016177 DNA-binding, integrase-type
IPR003606 Pre-SET zinc-binding sub-group
Orthology groupMCL12327

Nucleotide sequence:

ATGGACAGCAATATAAATAGCGGGTTAAATAAGCCGGTAAGTGAGGAAAAAACTAAGGAT
AGTGAGAAAATAGCTGATTTTGAATTAATTTTACTACGTAGAAGACAGAATGCCGCGAGA
GTTCGTGCCTGTAGGGAAAGGAAAAAAGCTCTTGGCCTCCAGTCATCTTTACCAGTTGAT
ATTAAAACTGAGCCGCCAGATGACATCAGTGGGCTAACAGATTCAATGCCAAGTACAAGC
TGGAGTCCAGCGTCAAATCTTGGATTTAATGCAATTTCAAGTGAGAAGCCCGCAGTTGAT
GATATTGATGCTGAAAAGGATCTGTATCAAAAGCAGCTAAATGCAGAAAGATGTCGTAGG
TATCGTCAAAAATGTAAACTGAAATCTGGCGTCCGTACTAAGCGAACAACAGGGGAACTG
GATACTAGTTCCAGTCGAGACTTAGTTGCTGGGGGAGATGGATTTGAAAGCTCTACCAGT
CAGACACAAGGTTCTTCCAACGACGAATCTACATCATCAGGTGCTAAAAGTACAACCGCC
GCTACAAACAACGCTAATTCCAGCTCTGCTGCGGGGTTGTACTGTCGTCGGTACAGGGAG
AAACTGAATGCACGAAGAAAACGAGTAAAAGAAGATCCCTCATCATTCTATACATTGTAT
GTCAAACATAACGGAGCTCATAACTTATTCGAAAACTTATTCGATAATAATCCATTTGGT
TTTTCGTGTACTGTTTGTGATAGACTATGGTTTGAAAATGAATTGAGAAGTCCTCCTTCG
TCTTGCGGTGAAATTTTAAGACAGATATGCCCAAATGTCCCCTGTCAAGATATTGTAGTT
TGTGCCGCCTGTAAGGTCTCATTAGTTGCTGGAAAAATTCCGAATTTGGCGGTGTACAAT
GGTTTTAAGTATCCGCCTAAGCCGAATCTCCCTCAAATGGATATGGTTTCAGAGCGATTA
ATATCACCAAGATTACCCTTTATGCAAATTAGACGTTTGCGATACGTTGAGGGACAACAT
AGTGTCACTGGTCAAGTTATCAATGTGCCCATTCATGTTGACAACTTGGTCCAAACTCTG
CCTAGAAATATGGTCGACGATTTCTGTATAAATGTTCACGTAAAAAACAAATCACTGCAC
AAATCTAGTTATTTGCAAGGATTATTAAAAAAGCGGGTTATTAGAGACTGGCTTGATTAT
CTTATAGACACCCCGCTTTATAAGCATTATAATATAAAAATCAATCCGTATTTCTTAGAG
GATTTGAACAACGAGAGTGAAATGCCAGACATTGATTTAAAGGATATCGCTGAACCAATT
GTAATTGGTGATAGCCTAGTTGCTGAACAGCATACACTTTTATGGTCCACGGAAAGAGAT
CTACAAATAGCACCTGGAGAGAATAAAAGGTCTTTGAGTTTACTGTTTGATGCTTATGCG
GAAGAATTATCTTTCCCAACTATTTATTACGGTCAATTTCGGAAATTCAAAGACGGTGTT
AATTGTAAAGCACACTCAATTGCAACAAGTGAAATTCGGCGGACCGATCGACGAGGGGCC
ATTCCTCGCCACTTGCTCTTCTTAGTCATGAAGGTTATTTTATTTAGACTGAGTGAAAAC
ATCGGTATTGCTTCTAAATATATCGTGGAAGACACAAAAGTAACGAAAGAGCAAATATTA
TCTTCCGACTATCTTAACGACTGTCGGGAACCAAATTTGTCTTTTCTCAAATTCATACCA
AATTCAGTACAATATTGGCAAAATCGAAAAAAAGATCTCTTTGCAATGATAAGACAACTT
GGGACCCCAACAGTTTTTTTGTCCCTGAGCGCTAATGAAATATCATGGAAATGGCTTTTA
AAAACTTTGCATAAACTGAAACACGGCACGGAGATCTCTGATTTAGAAATTGATCAGATG
CATTATAAAGTCAAGGCAGAATTAATAAATGAAGATGCTGTGACTTGCGCCATTTATTTT
AATAAACTCGTCAATGTTATAATGACAATTCTTCAAAATAAAACCGTTAGCCCATTCGGT
AAACATTATGTACGACATTATTTTAGGAGGATTGAATTTCAACACAGAGGAAATACTCAT
GCACATATCTTGCTGTGGTTAAATCAAGCGCCTAACGATGCTTTTGGAGGTGACATGACT
TCTGCTATTAAACTTATCGATAACTTAATTTCAGTATCAAAGACAGAATGTTCAGGGCAC
ATTGAATTGGTCACACACCATCACACATATTCGTGTTATAAAAATAATCAAAACCAACTA
AAGTGCAGATTCAATGCTCCATATATGCCTAGTAGAACTACCGTTTTGCTTGAACCTATG
GCTAAATCGTCTGATGAGGAAAAACGAATTTATAACGAGTATAAAAAGAGATATCACATC
ATACACCAAAAACTGGAATGTCATGACTATTATAATATTGATGATTTTTATCGTAAAAAT
GGCATAAAGTCAGATGTAGAGTATTATAAAATTCTGTCCGCTGGAATTCTACGACCGATG
GTTTTCGTTAAACGTCACCCTAATGAAAAATGGCACAACTTTTTCAATCCCTTCATATTT
CACCATTTACAATCGAGCATGGATATCCAATACATCACAGACGAGTATTCTTGTGCTGCT
TATATTGCGGAGTGTGTAAACAAATCCGATCGTAGCGTCAGTAATCTTCAACGGGAGCTG
TTGGATCTTTTGGAGAAAAATCCAAATCTAGATTTGGTTGACATGACCAAACATATGAGT
GTTAATATTTTGAATGCAATGGAAATGTCCAGCCAAGAAGCAGCATGGTTCCTTCTTAGG
GAACCATTGTGTAAGTCGACTCTTAAAGTAGAATTCATACCCACAATGTGGCCTCAGGAA
CGTCATCGTTTTAGAAAAACTGAAAAAGAATTAGACCGTCGTCCCGATGAAGACACAAGT
GTTTGGAAAGAAAACTATTTTGAGAATTACGAAAATCGACCAGCAGAGTTGGAAGATGTC
TCACTGATTCAGTTTGTTGCTTGGTATAAGACCAGAACTAGAAAAAAAATGTCAGGACCT
CAAATTGCGAACCAAAACTGGGATTCGGAAGACGAAGAAGACGTTGAAGAAGATATAACA
CCAGAAGAAAATATGAATCAGCAAAGTGAAAAAGTATTTTACCGTCGTAAAACTCCAAGA
GTCGTAAGCTATTTGCGTTACGATATGACTGATCATGAGCTAGACTACAAACGAGAAATG
GTTACCTTGTTTATTCCATTTCGTAACGAAGAAAGGGACATTTTAGCTGATATGAGTTTC
AATCAAATCTATGAAGAAAATGAAGAACTCATTTTAGTTCGCCGAGAAGACTTCGAAGGA
AATTTGGACATTGATAAAATTTTTGCGGCATACAGAATATTATGTCACCCTGATCAGAAT
GGCAGAGACCTAGAAATTTTTCCGCTTATGCATCACGATGCGGATCCATTCAGGGAATTT
TCCAACAATCCTATTCCAGAGCAGGAAGTTGTTATATTAAATGCTGAAGATGAGTCAGAA
GAAGTCATTGATAGTAGAAAATTAGATGTAGCTGAGAGCTATGTTGATTTACGAGACAGT
TCCGAGGAAGACACTCAAGACACGAAGAAAACTGATCATGAAGTAGACAATCTTAATACA
ACGGAAGATATTGATGAAGATGAGCCAGCGAAGAAAGAAGACACGGTATTGCCCATGGTG
AGGTGCGTCAACAAGATGTGCGCTCGCACTTCCTTCGACTTCTACACAGCTGAGCGCAGC
ACAGTGGACTTCTATGATCCAGAGAGAAAGAAAAGAGGTTATGTTTGCAGAACCTGTCTC
AGTCTGGTCGAAGAAAGGAATCAGCTGTTGATCAGCGCCTTTAAATCCCAGACGCCCCTC
CTACAATTGGAGACTCGTCAGCAGGAAGAGGATCTGGTAGAGATATCAGAATCGCAGTCC
GAAGATAATCTAATACCGGAAGTTGATGATGACGTCATAGGCGAGGAGGGGGCTAGGTTT
ATAGAGGAGAAGTTGACTGATGTCCTGAACGAGACCTGGGTCAAGTACAACATGGATGAC
CGGCTCCAGGAGGCACAGGATCAGCTAAAACAACAGCTGGAACAACTGCAAAAGCACAGT
TTGGAGATCGACCAGTTACTGGACGAGTGCCAGATGTCCACAGATAAGCTCCGCACAGAG
ATTTACTCTACGTTCAAGCCAGACATCAGGAAACTTCCGTCGCTACTTATATACGACGTG
CCAGATTGCTCTTACACCTTCGTCGACCTCGCTGAACAGGGAAGTAGACTCTTAAATCTT
CGGAAATCATCTTTATCTGAGTCTCCAACAAAAAAATCCACAACAGATCAGGATTCTGAT
GAATCAGTGGTACATATATCTGTGGAATCCGCTCCCTCCCACCTGCCTCCCGTGGGGGAG
TTGTCTTATCCTCAGTTGGAGGTGGGAATGATAGTCTACGCATCTAAGAATGCTTTGGGG
ACCTGGATGAAGGGCAAGATATTGGAGATAACACCAAAGTCAGAAGATATTGAACGGATC
TACTTGTGGGATCTTCAGACCAGAGTTGCTATTCTAATTTACTCTATCAACATTGGTCAT
CCAGCCAATCAAGCTACTCTCATTGATCATTCAGTCAATCATCCAATCCATTTAAGCAGA
GAACTGCCACACTGTACACGTGTGATAGCCTTGTTCAAGGACATCATGAGGCGCGAGTTT
TTTTACCCGGGTATCGTCGCAGAAATGCCCAACCCAAGGAATAGTTACCGCTACCTGATA
TTCTTCGACGATGGCTACTCTCAATACGCGCCGCACTCTAAGGTCCGTCTGGTGTGCGAG
TGCGCGTCTCACGTGTGGGAGGAAGTACAGCCCAAGTCGCGAGAATTCGTCCGAAAATAT
CTCCTGGCTTACCCTGAGAGACCCATGGTGAGGTTGCACCCTGGACAGAGCTTGAAGACG
GAATGGAAGGACAACTGGTGGTCATCCGTGGTGGTGTCGGTGGACGCGTCGCTGGTGGAA
ATCCAGTTCCTCCAGCTGGAGAGACGAGAGTGGATCTATCGAGGATCCACGAGACTCGCC
CCCCTGTACCTGGAACTGCAGGCCGCGGAGAGACACAGGCCCAGAGCCCTGCCACGGACA
CAGACCACGAGGACGAACATGCCCTACGTGGAGTACACCAGATCTGAAGAACAGACGAGC
AAACAAGCCAAGACTTCGCCACAGCAACAACAGAGCGAGGGATTTCCTCGTCAGCGAGCC
GTTGCCAAGAAGACTACCACGAAGACTCGCCAACCACCCCGTACAGCCGTACAGAGCCTC
GACCACTTTACTAGTAAACTAGTGGGATTTCCTCGTCAGCGCGCCGTTGCCAAGAAGACT
ACCACGAAGACTCGCCAATCATCCCGTACAGCCGTACAGAGCCTCGACCACTTTACTAGT
AAACTAGTGTACTACAGTCCAAAGAAACATGTGAAGCCATACAAGATGGTGCCCCATACT
TGCTCGACTGCGTGCAAGAGGACGGATGTTTTGGAACTCAAAGATTTAAAATCTTACAAT
CCATTAGCCAAGCCACTGTTGAGTGGCTGGGAGAGACAGATAGCCAATTTCAAGGGCAAC
AAGGTTGTATTGTACTTGTCTCCGTGCGGTCGCCGCGTCCGCTCTCCGCGGGAGCTACAT
CGCTATCTGCGAACCGTTGGTAGTCTGGACGGTCAGCTGGAGAAGCTCTTCACACCATCC
ACGCACTGTCTGGCCGAGTTTGTGCTCAACAAATACTGCGTCAGCAAGAAGGACTTATCA
AATGGCAAAGAGAACGTCCCAGTGGCTTGCGTCAATTACTACGACGGATCACTGCCAGAG
TTCTGTTTCTACAACACTGAGCGGACTCCGACCGCTGGGGTTCCACTCAACCTGGACCCG
GAGTTCCTGTGTGGCTGTGACTGCGAGGACGACTGCGAGGACAAGAGCAAGTGCGCCTGC
TGGCAGCTGACTCTGGAGGGCGCTAGGACGATAGGTCTGGAGGGGGAGAACGTCGGTTAC
GTTTATAGAAGACTAATGGAACCGCTCCCGACTGGTATTTACGAGTGCAACTCTAGGTGC
AAGTGTAAAGACACTTGTCTTAACCGCGTCGCTCAATATCCACTTCAGCTAAATTTGCAG
GTGTTCAAGACCCAGAACCGCGGTTGGGGCATTCGCACCCTGAATGACATACCCAAGGGG
AGCTTCCTCTGTACTTACGCAGGGAAACTACTAACAGAGGCCACAGCTACCCTCGACGGT
CTGAACGAGGGTGACGAGTACCTGGCGGAGTTGGACTACATCGAGGTCGTGGAACAGATG
AAGGAGGGTTACGAAGAGGACATACCAGAGAACATCAAGAAGATGGATGAGGCTCAAATA
GCGGAACAACTCTCGATGGCGGGCGAAGAAACACAGTCATCGTCTTCAGGGGAAAGCAGC
CCCAAAAGCGCTGAAAATGACGACCTTAGCCTCGAAGACATTGGTCCGGGGGTCACAGAG
TCCAGCAAAGAACTAAGGGGGAAAGACTCAAAGACAGACGAAGAAATAGAGAGTGCGGTG
CTGAAAGTTACCGAGAGATTAGTGCCCACAGAAGAAGATGAAACAGTTTTCACAGAGGAA
CAGAAATCTGTTGTCATAGAAGTGGAAAGTTCAGTGCCCACGGAGGACGAACTCTCTGAA
ATGCAGGAGGAAATCGATGAAGATTATGATTCTTCGAGTGATGACGGAGAAGATCGAGAA
CCTTCGAATTTCTCAGCCAGTGCTGGGATGGGAGCAAAGAAGTTTAAATCAAAGTATAGG
TCTGTCCGTAGTCTGTTTGGTGAAGATGAAGCCTGCTACATCTTGGACGCCAAGGTTCAA
GGGAATATAGGCAGATATCTCAATCACTCGTGCGTGCCGAACGTGTTCGTCCAGAACGTG
TTCGTGGACACGCACGACCCTCGCTTCCCGTGGGTGGCTTTCTTCGCTCTCACAGCCGTG
CGGGCCGGGGGCGAGCTCACCTGGAACTACAACTACGACGTAGGTTCCGTGCCCGGGAAG
GTCCTCTACTGTTACTGCGGGGCTCCGACGTGTCGCGGCAGGTTACTGTGA

Protein sequence:

MDSNINSGLNKPVSEEKTKDSEKIADFELILLRRRQNAARVRACRERKKALGLQSSLPVD
IKTEPPDDISGLTDSMPSTSWSPASNLGFNAISSEKPAVDDIDAEKDLYQKQLNAERCRR
YRQKCKLKSGVRTKRTTGELDTSSSRDLVAGGDGFESSTSQTQGSSNDESTSSGAKSTTA
ATNNANSSSAAGLYCRRYREKLNARRKRVKEDPSSFYTLYVKHNGAHNLFENLFDNNPFG
FSCTVCDRLWFENELRSPPSSCGEILRQICPNVPCQDIVVCAACKVSLVAGKIPNLAVYN
GFKYPPKPNLPQMDMVSERLISPRLPFMQIRRLRYVEGQHSVTGQVINVPIHVDNLVQTL
PRNMVDDFCINVHVKNKSLHKSSYLQGLLKKRVIRDWLDYLIDTPLYKHYNIKINPYFLE
DLNNESEMPDIDLKDIAEPIVIGDSLVAEQHTLLWSTERDLQIAPGENKRSLSLLFDAYA
EELSFPTIYYGQFRKFKDGVNCKAHSIATSEIRRTDRRGAIPRHLLFLVMKVILFRLSEN
IGIASKYIVEDTKVTKEQILSSDYLNDCREPNLSFLKFIPNSVQYWQNRKKDLFAMIRQL
GTPTVFLSLSANEISWKWLLKTLHKLKHGTEISDLEIDQMHYKVKAELINEDAVTCAIYF
NKLVNVIMTILQNKTVSPFGKHYVRHYFRRIEFQHRGNTHAHILLWLNQAPNDAFGGDMT
SAIKLIDNLISVSKTECSGHIELVTHHHTYSCYKNNQNQLKCRFNAPYMPSRTTVLLEPM
AKSSDEEKRIYNEYKKRYHIIHQKLECHDYYNIDDFYRKNGIKSDVEYYKILSAGILRPM
VFVKRHPNEKWHNFFNPFIFHHLQSSMDIQYITDEYSCAAYIAECVNKSDRSVSNLQREL
LDLLEKNPNLDLVDMTKHMSVNILNAMEMSSQEAAWFLLREPLCKSTLKVEFIPTMWPQE
RHRFRKTEKELDRRPDEDTSVWKENYFENYENRPAELEDVSLIQFVAWYKTRTRKKMSGP
QIANQNWDSEDEEDVEEDITPEENMNQQSEKVFYRRKTPRVVSYLRYDMTDHELDYKREM
VTLFIPFRNEERDILADMSFNQIYEENEELILVRREDFEGNLDIDKIFAAYRILCHPDQN
GRDLEIFPLMHHDADPFREFSNNPIPEQEVVILNAEDESEEVIDSRKLDVAESYVDLRDS
SEEDTQDTKKTDHEVDNLNTTEDIDEDEPAKKEDTVLPMVRCVNKMCARTSFDFYTAERS
TVDFYDPERKKRGYVCRTCLSLVEERNQLLISAFKSQTPLLQLETRQQEEDLVEISESQS
EDNLIPEVDDDVIGEEGARFIEEKLTDVLNETWVKYNMDDRLQEAQDQLKQQLEQLQKHS
LEIDQLLDECQMSTDKLRTEIYSTFKPDIRKLPSLLIYDVPDCSYTFVDLAEQGSRLLNL
RKSSLSESPTKKSTTDQDSDESVVHISVESAPSHLPPVGELSYPQLEVGMIVYASKNALG
TWMKGKILEITPKSEDIERIYLWDLQTRVAILIYSINIGHPANQATLIDHSVNHPIHLSR
ELPHCTRVIALFKDIMRREFFYPGIVAEMPNPRNSYRYLIFFDDGYSQYAPHSKVRLVCE
CASHVWEEVQPKSREFVRKYLLAYPERPMVRLHPGQSLKTEWKDNWWSSVVVSVDASLVE
IQFLQLERREWIYRGSTRLAPLYLELQAAERHRPRALPRTQTTRTNMPYVEYTRSEEQTS
KQAKTSPQQQQSEGFPRQRAVAKKTTTKTRQPPRTAVQSLDHFTSKLVGFPRQRAVAKKT
TTKTRQSSRTAVQSLDHFTSKLVYYSPKKHVKPYKMVPHTCSTACKRTDVLELKDLKSYN
PLAKPLLSGWERQIANFKGNKVVLYLSPCGRRVRSPRELHRYLRTVGSLDGQLEKLFTPS
THCLAEFVLNKYCVSKKDLSNGKENVPVACVNYYDGSLPEFCFYNTERTPTAGVPLNLDP
EFLCGCDCEDDCEDKSKCACWQLTLEGARTIGLEGENVGYVYRRLMEPLPTGIYECNSRC
KCKDTCLNRVAQYPLQLNLQVFKTQNRGWGIRTLNDIPKGSFLCTYAGKLLTEATATLDG
LNEGDEYLAELDYIEVVEQMKEGYEEDIPENIKKMDEAQIAEQLSMAGEETQSSSSGESS
PKSAENDDLSLEDIGPGVTESSKELRGKDSKTDEEIESAVLKVTERLVPTEEDETVFTEE
QKSVVIEVESSVPTEDELSEMQEEIDEDYDSSSDDGEDREPSNFSASAGMGAKKFKSKYR
SVRSLFGEDEACYILDAKVQGNIGRYLNHSCVPNVFVQNVFVDTHDPRFPWVAFFALTAV
RAGGELTWNYNYDVGSVPGKVLYCYCGAPTCRGRLL