DPGLEAN14064 in OGS1.0

New model in OGS2.0DPOGS213651 
Genomic Positionscaffold218:- 165761-168845
See gene structure
CDS Length1539
Paired RNAseq reads  93
Single RNAseq reads  289
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001685 (9e-11)
Best Drosophila hit  combgap, isoform C (3e-24)
Best Human hitendothelial zinc finger protein induced by tumor necrosis factor alpha (9e-34)
Best NR hit (blastp)  krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis] (1e-45)
Best NR hit (blastx)  krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis] (1e-50)
GeneOntology terms





  
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005730 nucleolus
GO:0046872 metal ion binding
GO:0008270 zinc ion binding
GO:0005622 intracellular
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupND

Nucleotide sequence:

ATGTGTGAATGCAAGTGTAGATTGTGTTTAAAGGTATCATCTAATATTGTGGAAGAATTG
TGGGAAGGATGTAAAATTTTGGAACAATTGTTTACCTGTCTGGGGCACCATGTAGTAATA
AACAAAGATCTACCTAACAAAATATGCAATAATTGTGTCATGAAAATAGAAGATATATAC
AAATATCAACAATTTATCAGACAAAATGAAATAAAATTACAACAAGAATTAAATAATATT
ATAATTCATAGTGAAATTGTTAATATCAATGAAGTAAAAATAGAAGATCAAATATTAGAT
GAAGAAAAATCTATAGATCAAGAGAAAAATGTTAAAATTAAATCAGAAATTACTGAAGAA
AATGTAAAAAATATAACAGAATTACATGGAGAAGTAATTCAGAAAAATGATGTTGAATTA
TATCCAGTCAAACAAGAAATTGAAAACAAAATTGTTTATAGTAATGAAGACAAAGATATT
AATAAAGAAATATCAAATGCAATTAAACTAGACGAAACAAAGAGATTTTCATGTTTAACA
TGCTTTGAAGTATTCCCAAATCAATTGGAACTCTTAAGGCACTACCAGAATGTTGAATTA
GAAAAATATAACAAGAATAATACAGATGTTGAAAGTAAACCTGTAAAGTACAACGTGTTC
ACAGACGATAATGGTTTATATTACAAATGTGAGAGGTGTTACAAAAAGTACCGACAGAAA
TCATACATAAACAGACATGTATTGAGTCACATAGAAAGAAGACCATTCCTCTGCAAGCTG
TGCGGTAAAACTTACCAAACAGCATCAATAATAGTTTCCCATGGAAAGATACACACGGGA
GATATATATGCATGTACATACAACTGTGGCTACCGATCTGTACACAAACATGTTGTCAAA
AATCATGAGAAAAGACACAAAGGAGAATTTAAGTATAAATGCCAGACATGCGGCAAAGGA
TTCCAAGTGAGATCATGGTACCAACAGCATCAGAACATTCATGATGGAGTCAAGCCATTC
AAATGCGATATCTGTGGCATGAGTTTTCATCTGCATCGATATCTAACTACACATCGAAGC
AATGTACACCCACAGTCGTCTGTTCGCAAACCGTGGGTTTGCAAGCAATGTGAATATCCC
TGTGACTCCAAAAATAGTCTAAATTTGCATTTGAAGGATAAACATGGCCTAGTTATAAAG
AAATCAAACTTGTGTGATGTTTGTGGTAAAGTCTTGAAGGATTCCCAACAGTTGAAAGTA
CACAAGAGAGCCGTACACTTGAACATCAAGCCGTATGTCTGTGGTACATGTAACAAGTCG
TTCCCCAAAAAATATACTCTCAAGAACCACGAACAGACACACAAAGGAAAGACGTTTTTA
TGTTCCATGTGTGACAAGATGTTCGCTAAAGACGCCAGTCTACAGAAACACGTACAAAGG
TGTCATATAACAAGCAAGTACAGGTGTCACGAATGTGACAAATCATTTTCATCAAAGATG
ACATTGACAGTTCACGTGAAGAATTGCAACCGGAAGTGA

Protein sequence:

MCECKCRLCLKVSSNIVEELWEGCKILEQLFTCLGHHVVINKDLPNKICNNCVMKIEDIY
KYQQFIRQNEIKLQQELNNIIIHSEIVNINEVKIEDQILDEEKSIDQEKNVKIKSEITEE
NVKNITELHGEVIQKNDVELYPVKQEIENKIVYSNEDKDINKEISNAIKLDETKRFSCLT
CFEVFPNQLELLRHYQNVELEKYNKNNTDVESKPVKYNVFTDDNGLYYKCERCYKKYRQK
SYINRHVLSHIERRPFLCKLCGKTYQTASIIVSHGKIHTGDIYACTYNCGYRSVHKHVVK
NHEKRHKGEFKYKCQTCGKGFQVRSWYQQHQNIHDGVKPFKCDICGMSFHLHRYLTTHRS
NVHPQSSVRKPWVCKQCEYPCDSKNSLNLHLKDKHGLVIKKSNLCDVCGKVLKDSQQLKV
HKRAVHLNIKPYVCGTCNKSFPKKYTLKNHEQTHKGKTFLCSMCDKMFAKDASLQKHVQR
CHITSKYRCHECDKSFSSKMTLTVHVKNCNRK