DPGLEAN12449 in OGS1.0

New model in OGS2.0DPOGS209702 
Genomic Positionscaffold567:+ 37337-41850
See gene structure
CDS Length1611
Paired RNAseq reads  285
Single RNAseq reads  689
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001602 (2e-113)
Best Drosophila hit  CG17829, isoform B (4e-48)
Best Human hithistone H4 transcription factor (4e-51)
Best NR hit (blastp)  PREDICTED: similar to MBD2 (methyl-CpG-binding protein)-interacting zinc finger protein [Tribolium castaneum] (4e-62)
Best NR hit (blastx)  PREDICTED: similar to MBD2-interacting zinc finger [Nasonia vitripennis] (1e-63)
GeneOntology terms

















  
GO:0000077 DNA damage checkpoint
GO:0010843 promoter binding
GO:0006281 DNA repair
GO:0001701 in utero embryonic development
GO:0019899 enzyme binding
GO:0046872 metal ion binding
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043193 positive regulation of gene-specific transcription
GO:0045184 establishment of protein localization
GO:0000083 regulation of transcription involved in G1/S phase of mitotic cell cycle
GO:0015030 Cajal body
GO:0042393 histone binding
GO:0016566 specific transcriptional repressor activity
GO:0045445 myoblast differentiation
GO:0003713 transcription coactivator activity
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0005654 nucleoplasm
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL14476

Nucleotide sequence:

ATGGAGGAAACAGTAACTAATGCGTCGGATGTTGATAATTTTAAGCTCAAAAGATGTACA
GATTGGTTGCTTCAGCAGAATAATCCAGACAAGTTGACCCGACAGAATCAGAATGATATT
CAGTTTATTATCGAAACGAATGCCCATAGAAAGAAATTTTTCCTTTCCGCTGCAGAGGAT
GAGGCAACAGTTCCAACTGGGGTCGATGAAAACACAGTCGAACCGGCAACCGTGCCTTTG
GCCCGTTTACGTAAAGACAATATACGGATGGAGTGTGAATGGCAATCCTGCAGGAAGTTC
TTCACTAACTATGAAGTGTTTCAGAAGCATGTAACAAAACATGCCTCTGACTTACATGTT
ATTGATATGGAAGGTGATGTGGACTATGTATGTCTGTGGGATATTTGCGGTCATCGCACC
AAGGACTTTGTTGAGATGGTGCGTCATATTAGCTATCACGCTTATCACGCAAGACTTCTA
GCCATCGGTTACAATGCTCGAGCTACACTTAAACTGGACCAGTGTAAGAAGGACTCCAGC
CGCCGTAATAGACTACCCTCGCTGAAGTCCGATCATTGTTGTATGTGGATTGGATGTTCG
GAAACTTTTTTTTCTATACAGACGTTTCTAGACCACATGAAGCATCATATTTTCTACTCC
GACGACTATCTCTGCTCGTGGGCCGGTTGTGGAGCGACCTTCACTAAACGACATTCCCTC
GTACTGCATCTGAGATCACATACACAAGAGAAAACCATAGCCTGCTTCCATTGCGGCAGA
CATTTCACATGTAACAGGAAACTTAGCGATCATTTAGCGAGGCAGAACGTAGACCCGTCG
ACGGGCTACCCGTGTAACATGTGCGGCACCGTACTGGCGAGCGCGTACCTCCAGCGCGAG
CACGCCCGCCAGCACGTGTCGGCGTACGCGTGCACTCTGTGCGACATGTCGGCGACCACC
CCCGCCGCTCTCGCCCACCACGTGCGGTACCGACACCTCGCCGACCACGCCAGGAGCTTC
GCCTGCCCGCATTGTGTATACAGAGCGGTGACTAAATGTGATCTGCGCAAGCACATACTG
ACACATACAAGAAAAGCAAAGAAAAAGACTAAAGACGATAGCGAGGATTCCGATGTTTCT
GATGCAGAAGTCAAAAAGAAAAAGGAGCCAAAGAAATACGTGTGTCACTTGTGTCAGAAA
GACAGCAAGATATTCTCGCGCGGGACACGACTCACCACACACTTAGTAAAGGTACACGGA
GCACAATGGCCGTTTGGACACAGTAGGTTTAGGTATCAAATCAGCGAGGACGGCATGTAC
AGGCTGACTACAACGAGATTTGAAGTTCTAGAAGTCTCCAAGAAGATTGTTGACGGCTAC
AGCGGTCCGAAGGAATCACTGACTAATACATTCGAATTCGATCTGAAGCAGACGGCGGAC
GCCACGGAGACCACGCCCAAGAGGTTCGAAATAACCTTGAAGAATACCAACAAGAGTGAC
GAGGGAGGCTGCAAGCAGGCCGGGGCTGTGGAGATAATGATGTGCGATGTAGACCAACAA
GGAAACATTATAAGCACCGAGACCATTAAGTCTGACGTTGTTTATACCTAA

Protein sequence:

MEETVTNASDVDNFKLKRCTDWLLQQNNPDKLTRQNQNDIQFIIETNAHRKKFFLSAAED
EATVPTGVDENTVEPATVPLARLRKDNIRMECEWQSCRKFFTNYEVFQKHVTKHASDLHV
IDMEGDVDYVCLWDICGHRTKDFVEMVRHISYHAYHARLLAIGYNARATLKLDQCKKDSS
RRNRLPSLKSDHCCMWIGCSETFFSIQTFLDHMKHHIFYSDDYLCSWAGCGATFTKRHSL
VLHLRSHTQEKTIACFHCGRHFTCNRKLSDHLARQNVDPSTGYPCNMCGTVLASAYLQRE
HARQHVSAYACTLCDMSATTPAALAHHVRYRHLADHARSFACPHCVYRAVTKCDLRKHIL
THTRKAKKKTKDDSEDSDVSDAEVKKKKEPKKYVCHLCQKDSKIFSRGTRLTTHLVKVHG
AQWPFGHSRFRYQISEDGMYRLTTTRFEVLEVSKKIVDGYSGPKESLTNTFEFDLKQTAD
ATETTPKRFEITLKNTNKSDEGGCKQAGAVEIMMCDVDQQGNIISTETIKSDVVYT