DPGLEAN12852 in OGS1.0

New model in OGS2.0DPOGS205148 
Genomic Positionscaffold698:- 21070-24593
See gene structure
CDS Length2049
Paired RNAseq reads  894
Single RNAseq reads  1991
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008173 (2e-68)
Best Drosophila hit  CG11247, isoform C (3e-19)
Best Human hitzinc finger protein 573 isoform 4 (9e-20)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein [Tribolium castaneum] (3e-27)
Best NR hit (blastx)  krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis] (1e-35)
GeneOntology terms




  
GO:0006355 regulation of transcription, DNA-dependent
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0005622 intracellular
GO:0046872 metal ion binding
GO:0008270 zinc ion binding
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL40234

Nucleotide sequence:

ATGGTAGCAAGCTTCCATCCTTTCAAGGTTACCGATGATAAAAATTTTCCAAACAAGATA
TGTTCAGATTGTTTGAACCGAACAATGAATTGCTACCTGTTTGCCCAGCAATGTGAACGA
TCCGAAAGATCATTACGGAATTGTTTCGAGGATATGTACGATAAATTTGAGAAACTTGAT
CCGATTGAACCCGTAAAACGGAGAGGCAAACCTAAATTGAATCCCAACTGTAACATTTTA
TACACAGAACACAACGAAGTCATGAATTACGCTGAACCCATAATAAATATTATCAACACT
GATACTGTTGCCATAGAAGATAATTTCATCACTGAGTTGGAATGCCAGAAATGTTGGCAA
GTTTTACCGACGGCAGAATCACTATTAAATCATGAGAAAATACATCCGAAATCCATGTGG
TACAACTGTAGGTTGTGTGGGAAGTCGTTTGTTAAACGATGCCATTTGAAGAAACATTTG
AAATCTCACAGACAACTCAGTAATTCGGATAACACTGAATCTTCATACAAATGCAATGAC
TGCGGCATCAGCAGCGATACATTATCAGATCACTATCAGCACATAGAGAAACACAAGTTC
AAAGATATGTTTGAGCATTTAATTGAGGGGAATATTGGTAGCTTTTGTGCTATCTGTATG
GATAAAGGTTTGAAAATGGTCGAGTTGTCCGAAATGGTACATTTACACGGAGGTTATCCC
GCTTTAAGCAGGGACTTGACTATTCAGAATGTATTGACAGCGACTATTCCTGGTATTCAA
CTCGATAGCTACATGGGGAATAAAATCTGCGACAGCTGTGTCAATCACGCTCTGAACACA
TATATATTTATAAGCAAAGTACAATATGTCCGGAACAGATTGAACACTTCGGTAACACTT
ATGTTGGATAATCTTAATATAGATGATCCAGAGAGTAATGTTATGGTCGAAATATCTAAG
GAACTGATACTACCATCTGTAAAAGATGCTGATGATACAGATGATTCAGATGAATCCGAC
TACAAAGTAGAGGTTTTAGAAGACGAATTCCGGGTAGAATACAGCGAGAGCGGATCGGAA
TCGGATTTTAAAGAGGATTCTATAGTCTTGAGCAACGATATCCAAGATCAACTCAAAAAT
GTCACTAAGACCTACAGCAAGAAAGTTTTCAATGGATTTCAAATGAAGAATGATGAAGAC
AGTCTAGACGTTTGCAGCGAATTCCTGACTTTCAAGAAGAGAGTGAAAACCAGAAAACGC
GCCCCAAAAATACGTTACACCTGCCCGTTCTGTAATAAAAACTTCATATCAGATTACTTC
CTCAAGAAGCACGCTCTGAAACACATCAACCGGGCGGTCGAATGCGACTTATGTTACGAC
CAGTTCAAATCGAAGTTCCATCTCTTCGAACACAAGAAGATGATGCATCTGTTGCAATCA
CAGAACTATATGACATGCAACATCTGCGGTAGAACCTTCGAAAGTGCGAACAAAATGAAA
ATACATCAGAAATGCCACAGATACAAGGAGTGCCACCTGTGTAACAAATATTTCATCAGC
CAGAAGTACTATGACATTCATATGCAACGGCACGCGGCCAGGTTCAACACATACAGGAAC
AGGGACGAACAGACATGTAGTTTTTGCGAGAAGGCCTGTTCCAACGAAAACGAACTCTCT
CTTCATGTCAATAAAGTCCATTTACAAATTAAACCGTACAGTTGCGATATGTGCGAAAAG
CAGTATTACACGGAATACAACTTGCTGAGTCACAAAAAACTTCACAGTCTACCATGCAAG
GAGATTTGTGAATTTTGTAACAGAGTATTCAAATGTAGGAAGAATTTGGTCATACACGTC
AGAAAGCATATAGGCATAAAACCACATAACTGTCCTGTTTGCAGACAAGCATTCTATTCA
GACAGCATAATGAAGAATCACATGAAAAATTACCACGGAGGTAAATTCTGTTGCAGATTA
TGCCGGACTGTTCTCCAGAGCCAGTTCGATTTGAAAACTCATATAAACGTTGCCCACAGT
TCCATGTAG

Protein sequence:

MVASFHPFKVTDDKNFPNKICSDCLNRTMNCYLFAQQCERSERSLRNCFEDMYDKFEKLD
PIEPVKRRGKPKLNPNCNILYTEHNEVMNYAEPIINIINTDTVAIEDNFITELECQKCWQ
VLPTAESLLNHEKIHPKSMWYNCRLCGKSFVKRCHLKKHLKSHRQLSNSDNTESSYKCND
CGISSDTLSDHYQHIEKHKFKDMFEHLIEGNIGSFCAICMDKGLKMVELSEMVHLHGGYP
ALSRDLTIQNVLTATIPGIQLDSYMGNKICDSCVNHALNTYIFISKVQYVRNRLNTSVTL
MLDNLNIDDPESNVMVEISKELILPSVKDADDTDDSDESDYKVEVLEDEFRVEYSESGSE
SDFKEDSIVLSNDIQDQLKNVTKTYSKKVFNGFQMKNDEDSLDVCSEFLTFKKRVKTRKR
APKIRYTCPFCNKNFISDYFLKKHALKHINRAVECDLCYDQFKSKFHLFEHKKMMHLLQS
QNYMTCNICGRTFESANKMKIHQKCHRYKECHLCNKYFISQKYYDIHMQRHAARFNTYRN
RDEQTCSFCEKACSNENELSLHVNKVHLQIKPYSCDMCEKQYYTEYNLLSHKKLHSLPCK
EICEFCNRVFKCRKNLVIHVRKHIGIKPHNCPVCRQAFYSDSIMKNHMKNYHGGKFCCRL
CRTVLQSQFDLKTHINVAHSSM