DPGLEAN04505 in OGS1.0

New model in OGS2.0DPOGS210714 
Genomic Positionscaffold2513:- 16335-21915
See gene structure
CDS Length1239
Paired RNAseq reads  186
Single RNAseq reads  746
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006325 (0.0)
Best Drosophila hit  escl, isoform A (4e-128)
Best Human hitpolycomb protein EED isoform a (1e-139)
Best NR hit (blastp)  extra sex combs [Junonia coenia] (0.0)
Best NR hit (blastx)  extra sex combs [Junonia coenia] (0.0)
GeneOntology terms







  
GO:0035098 ESC/E(Z) complex
GO:0045449 regulation of transcription
GO:0001739 sex chromatin
GO:0003682 chromatin binding
GO:0016571 histone methylation
GO:0005515 protein binding
GO:0016568 chromatin modification
GO:0045120 pronucleus
GO:0006349 regulation of gene expression by genetic imprinting
InterPro families





  
IPR001680 WD40 repeat
IPR019781 WD40 repeat, subgroup
IPR015943 WD40/YVTN repeat-like-containing domain
IPR011046 WD40 repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019775 WD40 repeat, conserved site
Orthology groupMCL14520

Nucleotide sequence:

ATGAATTTTTCTGACAATGAAGCAGACGACACTTCTAGTGTTGAAAGTACTTCGAATACC
GACAATACTTCTCGCAGTGAAACTCCGACGAACACTCGCGTGAAGAAAAGAAGACGTGGT
AAGAAAAAGGCCGTTACTAAACCAGTGAAACCTCCATATAAATTCAATTGCAGTGCAAAA
GAGGATCATGGACAACCCCTTTTTGGTTGTCAATTCAACCATCACCTTAGGGAAGGGGAA
CCTCAAATATTTGCTGTTGTCGGCAGCAACAGAGTCTCTATATATGAGTGCCCAGAATCG
GGAGGTTTTAAATTTCTGCAATGCTATGCTGATCCTGATGTTGATGAGACATTTTACACA
TGTGCATGGTCGTATGAGGAAGAAACTGGTTTACCACTCCTAGCTGTGGCCGGATCCCGT
GGGATAGTGAGAATTTTTCATCCCGCAACCCAAACATGTATAAAGCACTACATAGGCCAT
GGTCATGCTATCAACGAAGTCAAATTCCATCCTCGCGATCCGAATTTGTTGCTGTCTGCG
AGCAAGGACCATGCTTTACGGCTATGGAATATCATGACGGATGTCTGCATCGCCATTTTC
GGTGGGGTCGAAGGTCACAGGGACGAGGTCCTCAGCGCCGATTTCGACTTAAAAGGCGAA
AGGATAATGTCATGTGGCATGGACCACTCGCTGAAACTCTGGAGGCTGGATAAACCATCC
ATGAACGAAGCCATCAAACAAAGTTATAGTTTTAATCCGCACAGAGCACTCCGGCCATTC
AATTCGCTCAAAGAACATTTCCCCGACTTCTCAACCAGAGATATTCACAGGAACTACGTG
GATTGTGTGAGGTGGATGGGTGATTTAATATTATCGAAGTCGTGTGAAAACGCTATCATA
TGCTGGAAACCTGGACGGCTGGAGGACACAGACTTAAGACCTGGAGATAACTCGGTGACG
ATCGTTCACAGATTTGACTACAAGGAGTGTGAGATATGGTTCATAAGATTTGCTGTTGAT
TATAGTCAAAGAGTTATAGCTCTCGGTAACCAGTGCGGGAAGACGATGGTTTGGGAGTTG
GGCGGCGTGGCGGGAGGGTCGCGCGTGTCGCTACTAGTTCATCCGAGATGTGTGGCCGCC
GTCAGACAGGTGACTCTGTCTCGAAACGGCAAAATACTACTGACCTGCTGCGACGACGGC
ACTATATGGAGATGGGATCGGGTCCACAACGGAAGCTGA

Protein sequence:

MNFSDNEADDTSSVESTSNTDNTSRSETPTNTRVKKRRRGKKKAVTKPVKPPYKFNCSAK
EDHGQPLFGCQFNHHLREGEPQIFAVVGSNRVSIYECPESGGFKFLQCYADPDVDETFYT
CAWSYEEETGLPLLAVAGSRGIVRIFHPATQTCIKHYIGHGHAINEVKFHPRDPNLLLSA
SKDHALRLWNIMTDVCIAIFGGVEGHRDEVLSADFDLKGERIMSCGMDHSLKLWRLDKPS
MNEAIKQSYSFNPHRALRPFNSLKEHFPDFSTRDIHRNYVDCVRWMGDLILSKSCENAII
CWKPGRLEDTDLRPGDNSVTIVHRFDYKECEIWFIRFAVDYSQRVIALGNQCGKTMVWEL
GGVAGGSRVSLLVHPRCVAAVRQVTLSRNGKILLTCCDDGTIWRWDRVHNGS