DPGLEAN18987 in OGS1.0

New model in OGS2.0DPOGS205923 
Genomic Positionscaffold3744:- 26198-30148
See gene structure
CDS Length1431
Paired RNAseq reads  777
Single RNAseq reads  1947
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005894 (0.0)
Best Drosophila hit  CG5585 (5e-175)
Best Human hitretinoblastoma-binding protein 5 isoform 1 (1e-158)
Best NR hit (blastp)  AGAP010575-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  GE22415 [Drosophila yakuba] (5e-180)
GeneOntology terms





  
GO:0045449 regulation of transcription
GO:0006350 transcription
GO:0016568 chromatin modification
GO:0005515 protein binding
GO:0006974 response to DNA damage stimulus
GO:0035097 histone methyltransferase complex
GO:0005634 nucleus
InterPro families





  
IPR019775 WD40 repeat, conserved site
IPR011046 WD40 repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
IPR001680 WD40 repeat
Orthology groupMCL14841

Nucleotide sequence:

ATGAACTTAGAATTACTTGAATCTTTTGGTCAAAATTATCCAGAAGAGTTTGATGGCACA
TTAGATTCCATTTCCCTTGCTGCGACGTGCGCCTTCAACCGTCGCGGCACTCTTCTGGCA
GTGGGTTGTAACGATGGAAGAATATTCATCTGGGACTTTCTTACTAGAGGCATTGCTAAA
TCTATATCGGCACATGTTTATCCAGTGTGTAGCTTGAGTTGGTCAAGAAACTGTAAAAAG
TTACTCTCAGCATCCACAGACCACAATGTATGTATTTGGGATGTTCTCTCTGGTGAATGC
GAACAACGGTACAGATTCCCAACACCGATATTACGTGTACAGTTTGATCCTAGAAATGAT
AAGAGATTCCTTGTATGTCCAATGAGACACGCCGCTTTGTTGGTTGATACCGACGGTGAA
CATAAAATCCTGCCCATTGATGAAGATGGTGATACGAATGTGATAGCTTCATTTGACAGG
CGAGGTGACTATGTGTATACAGGGAATGCGAAGGGTAAAATATTAATACTCGACTCACAA
CAGTTAACAGTGAAGGCGAGCTTCAAAATTACCGTTGGTACTTCAAGCACTACCGGCATA
AAGAGTATTGAGTTTGCCCGACGAGGAGACTGTTTCCTTGTAAATACATCCGACAGAGTG
ATCAGAGTTTACGACGCCAACACGGTTGTGAAGTGTGGTGTGAACGGGGAACCGGAACCG
ATACAAAAGCTACAAGATCTCGTCAATAAAACTACATGGAAGAAGTGCTGTTTTTCTGGA
GACGGTGAATACATCTGCGCGGGATCAGCTCGCCAGCACGCGCTTTACATATGGGAGAAG
TCCATAGGGAACCTCGTCAAGATACTCCACGGAACTAAAGGAGAACTGCTCTTGGATGTT
GTGTGGCATCCCGTCAGACCCATCATAGCTAGCATAAGCGCTGGCGTGGTGTCAATATGG
GCTCAGAATCAAGTGGAAAATTGGTCTGCGTTCGCTCCGGACTTCAAGGAGTTGGATGAG
AATGTGGAGTATGAGGAGAGGGAGAGCGAGTTCGATGTGGAAGACGAGGACCGCTCCGTG
GACCAGGGCGCGGACAGCAGAGACGACGAGGAAGTAGAGGTGGATGTGACCACGTGTTCC
GCTGTGGCCGCCTTCTGCAGCTCAGACGAAGACGCCGAGGACGACACGCTGCTGGCCTTC
CTGCCGATAGCACCGGAAATAGAGGATCCAGAGGACGGTTGGGCGGCGACTCAGGAGACG
GTGACCCCGAGCGAGACCCCCGAGAAACTGGAACCGGCGGCGAAGAGACCCAAGAGTAAG
ACCTACGACATATCGCTGAAGATAGCGCCCCAGGAACAACCGCTCAGCTTCGGCGGCAAG
AACAAGCAGGCGGCGGGGAACAAGAAGCTCGCGGGCAGACCAAGGAAATAA

Protein sequence:

MNLELLESFGQNYPEEFDGTLDSISLAATCAFNRRGTLLAVGCNDGRIFIWDFLTRGIAK
SISAHVYPVCSLSWSRNCKKLLSASTDHNVCIWDVLSGECEQRYRFPTPILRVQFDPRND
KRFLVCPMRHAALLVDTDGEHKILPIDEDGDTNVIASFDRRGDYVYTGNAKGKILILDSQ
QLTVKASFKITVGTSSTTGIKSIEFARRGDCFLVNTSDRVIRVYDANTVVKCGVNGEPEP
IQKLQDLVNKTTWKKCCFSGDGEYICAGSARQHALYIWEKSIGNLVKILHGTKGELLLDV
VWHPVRPIIASISAGVVSIWAQNQVENWSAFAPDFKELDENVEYEERESEFDVEDEDRSV
DQGADSRDDEEVEVDVTTCSAVAAFCSSDEDAEDDTLLAFLPIAPEIEDPEDGWAATQET
VTPSETPEKLEPAAKRPKSKTYDISLKIAPQEQPLSFGGKNKQAAGNKKLAGRPRK