DPGLEAN00012 in OGS1.0

New model in OGS2.0DPOGS213025 
Genomic Positionscaffold46:+ 65107-79829
See gene structure
CDS Length1359
Paired RNAseq reads  1537
Single RNAseq reads  3473
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006943 (8e-145)
Best Drosophila hit  Sp1, isoform D (2e-74)
Best Human hittranscription factor Sp9 (5e-64)
Best NR hit (blastp)  Sp-like zinc finger transcription factor [Tribolium castaneum] (1e-130)
Best NR hit (blastx)  Sp-like zinc finger transcription factor [Tribolium castaneum] (3e-114)
GeneOntology terms



  
GO:0005634 nucleus
GO:0003702 RNA polymerase II transcription factor activity
GO:0045449 regulation of transcription
GO:0003676 nucleic acid binding
GO:0008270 zinc ion binding
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL11976

Nucleotide sequence:

ATGGAGTCGCGCGTGCGATCGATAGAGCCTTATGGGCGGGGGCCGGTGTGGGCGGGGAGC
AATAAGAGGCTGGTATTGGTTCGGACCGCCCATCCCGCCCCCCACGTCGCGCATCAAACC
CGCACCCGTAAACGTATGCCCGCGCGCCGCGACAAGAGACGTCATGCAAATGTTCCGAGA
CGTCCCGCGGCGGGCGGTAGCGGCCGGGAGCTGTCACCGCTCCGACTAATTGAGCATCCG
AACTTGCGCGGCACGCCATTGGCGATGCTCGCTGCGCAGTGCAGCAAGCTGTCCAGCAAG
TCGCCACCACCACTGGCTGATGCAGCGGTCGGCAAAGGTTTTCATCCGTGGAAGAAAAGC
CCTGGAACACATTCTCCACCGGGAGCTGGTTTGGTGCCTCGATCGCAGGCGTCGGCTTGC
ACACCATATGCACGAGCCCCTACCTCATGTGCTGCGGCGCCTTCATACGGAAACGAGCTC
TACTTTCCTTCATCGGGTGATCAGTTGCTAGGGAAAAGTGAATCGAGCGCCAGTCTAGGC
TCCATGTACTCAAGACACCCTTACGAGTCCTGGCCTTTTAATGTTGGAGGTGGCGGTGGT
AGTGGTGCTTTGAAAGCAGCTGAAATGGGCGGTGTAAGCGCTGTAGGTAGTACTTGGTGG
GATGTCCACAGTGGGTGGTTAGACGTTGGAGGTCAAATGGCAAACTACGCTGGGCAAGAT
TATTCTCAATTGACGCACTCTCTTTCTGGAGGAGCTCATTTGCTTCCTCCAGCGCCCCAC
CTCCTACAAGATGCATATAAATCTGTGTTGCCTACACAGGGATCTTTCGGTCTTCATGCA
CCAGGATCCCCAGCACCACCAGCTCAGGCTCCGTCACCGCGATCTCAGCGACGATACGCC
GGCCGCGCTACTTGTGACTGTCCTAACTGTCAAGAGGCCGAAAGACTCGGACCGGCTGGA
GCTCATCTTCGTAAGAAAAATATACATAGTTGTCATATACCTGGATGTGGAAAAGTATAC
GGGAAAACATCCCACCTTAAGGCTCATCTACGCTGGCACACTGGCGAGAGGCCTTTCGTG
TGCAACTGGCTGTTCTGTGGAAAACGTTTCACACGCTCCGATGAACTACAGAGGCATCTG
AGAACGCACACAGGCGAAAAAAGATTTGCATGTCCTGTGTGCAACAAACGTTTCATGAGG
TCGGATCATCTCGCTAAACACGTCAAGACTCATAATGGAGGAAAGAAGGGCAGTTCGGAA
TCTTGCTCGGATTCCGAAGAGAATAGCCAAGGGGAGAGTCATGCTGGTGGAAGGTCGCCA
GAGCATCACTTGGATGTGAAACCAGGTGCACTCGTGTGA

Protein sequence:

MESRVRSIEPYGRGPVWAGSNKRLVLVRTAHPAPHVAHQTRTRKRMPARRDKRRHANVPR
RPAAGGSGRELSPLRLIEHPNLRGTPLAMLAAQCSKLSSKSPPPLADAAVGKGFHPWKKS
PGTHSPPGAGLVPRSQASACTPYARAPTSCAAAPSYGNELYFPSSGDQLLGKSESSASLG
SMYSRHPYESWPFNVGGGGGSGALKAAEMGGVSAVGSTWWDVHSGWLDVGGQMANYAGQD
YSQLTHSLSGGAHLLPPAPHLLQDAYKSVLPTQGSFGLHAPGSPAPPAQAPSPRSQRRYA
GRATCDCPNCQEAERLGPAGAHLRKKNIHSCHIPGCGKVYGKTSHLKAHLRWHTGERPFV
CNWLFCGKRFTRSDELQRHLRTHTGEKRFACPVCNKRFMRSDHLAKHVKTHNGGKKGSSE
SCSDSEENSQGESHAGGRSPEHHLDVKPGALV