DPGLEAN12186 in OGS1.0

New model in OGS2.0DPOGS205627 
Genomic Positionscaffold175:- 168986-173686
See gene structure
CDS Length1404
Paired RNAseq reads  653
Single RNAseq reads  2042
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001139 (0.0)
Best Drosophila hit  CG12769, isoform A (7e-78)
Best Human hitearly growth response protein 1 (2e-14)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein [Tribolium castaneum] (3e-145)
Best NR hit (blastx)  PREDICTED: similar to zinc finger protein [Tribolium castaneum] (9e-132)
GeneOntology terms

  
GO:0008270 zinc ion binding
GO:0005622 intracellular
GO:0003676 nucleic acid binding
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL16254

Nucleotide sequence:

ATGTCATCAAAAGGGGAAACAGAGGCCGATTCTGGCACCGTGTCGCCAAATCTACCTGCC
GTCAAAAATGAACCGGCACCAACAGCGAGCAAGGAAACACCTAAAAGGGGCAGTGGTACT
CTATTGAAGTGTACTACATGTAACAGTTTTAGTACGCTAAGCAGTCGAGCATTAACCACA
CACATGGCACAGTGTTCACCTGACAATAACAATGTGGCGGCCGCTCAGAACACAGACGCT
AGGCCACACAGAAAGCTTTTCGAGTGCGATGTGTGTAACATGAAGTTTTCCAATGGCGCC
AACATGCGACGCCACAAAATGCGCCACACCGGCGTCAAGCCTTACGAGTGTCGTGTGTGT
CAGAAGAGGTTCTTCAGGAAGGACCATTTAGCGGAGCACTTCACGACCCACACAAAGAGC
TTGCCGTACCATTGCCCCATATGTAATCGTGGCTTCCAGCGACAGATAGCCATGCGTGCT
CACTTCCAGAACGAGCACGTTGGCCAGCATGATCTCGTCAAAACTTGCCCGCTCTGCAGT
TACCGAGCGCCGACAATGAAGAGTCTCCGGGTCCATTTTTTCAATAGACACGGTATTGAT
CTGGACAACCCAGGGCCCGGTAACAACTCTGTTTCCCTTCTCGCGGCCGGCATAGCCAAC
GCCGCTTATGCTGAAGGCACAATGGACCCCAGCGCCATATCAGCATTAGGGGCATTGGGC
TCAGGCCTCTCAGTGTCTGTAGCGGCTGCTTACTCAGATAGTGGTGACAGCAATGGAGAA
CGTTCAGTGGACAACGCAACCCCACCTATGCATTATTTGACCCCACATGTAGAAATATCT
ATGGCCGATAACAACGAAACATTCTCACCTGGCCAGAATAACCATCAAGAGTCCCGTATG
AATGGTTCAGGGTCACCTCATTCAGAGAGCGCTGCAAGTAGTTCCCTGGCGTTGCCTGCT
ATAACACCATCAATCACGCTCATACCCATTAAACAGGAACCTAATGCTCAGGAGGAAGGT
TCAGGAGATGCTGGTGAAGGGGAAGACAAACGTGACGTCTCATCATCGCTGTCTTCACTC
ATACAAGTGTCGCCATTGAAGAGTTTGTTGCGAGAGGACCTGCGCAGACGCATCTCAGCC
AGGGGCCGGTCTAGGGCTAATAATGCGTCCCGAGCCTCCCCTTCGGAGGGGGGAGTGACT
ACTTCGACCCAAGGGGACGCAGCCCTACTGCCTTCGTCGCTTGTTTGCTCGTTCTGCTGC
ATCACGTTCCCCGACTCCACACTATACTTCCTGCACAAGGGCTGCCACTGCGACGCCAAC
CCCTGGAAATGCAACATCTGCGGCGAGCAGTGCTGCAACGTTTACGAGTTCAATTCCCAT
CTGCTGTCGAAGAGCCACCAATGA

Protein sequence:

MSSKGETEADSGTVSPNLPAVKNEPAPTASKETPKRGSGTLLKCTTCNSFSTLSSRALTT
HMAQCSPDNNNVAAAQNTDARPHRKLFECDVCNMKFSNGANMRRHKMRHTGVKPYECRVC
QKRFFRKDHLAEHFTTHTKSLPYHCPICNRGFQRQIAMRAHFQNEHVGQHDLVKTCPLCS
YRAPTMKSLRVHFFNRHGIDLDNPGPGNNSVSLLAAGIANAAYAEGTMDPSAISALGALG
SGLSVSVAAAYSDSGDSNGERSVDNATPPMHYLTPHVEISMADNNETFSPGQNNHQESRM
NGSGSPHSESAASSSLALPAITPSITLIPIKQEPNAQEEGSGDAGEGEDKRDVSSSLSSL
IQVSPLKSLLREDLRRRISARGRSRANNASRASPSEGGVTTSTQGDAALLPSSLVCSFCC
ITFPDSTLYFLHKGCHCDANPWKCNICGEQCCNVYEFNSHLLSKSHQ