DPGLEAN12518 in OGS1.0

New model in OGS2.0DPOGS205425 
Genomic Positionscaffold2492:- 20353-22449
See gene structure
CDS Length2097
Paired RNAseq reads  284
Single RNAseq reads  687
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001689 (8e-23)
Best Drosophila hit  crooked legs, isoform A (5e-26)
Best Human hitzinc finger protein 845 (8e-33)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_65421 [Branchiostoma floridae] (1e-37)
Best NR hit (blastx)  PREDICTED: zinc finger protein 347-like [Saccoglossus kowalevskii] (3e-53)
GeneOntology terms




  
GO:0005622 intracellular
GO:0005634 nucleus
GO:0046872 metal ion binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0003677 DNA binding
GO:0008270 zinc ion binding
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL39572

Nucleotide sequence:

ATGTCACGAGGCCACAACAAATTGGGGAAACCTCAGCCCGAGTGCGACTTGTGCGGACGG
ATATTCACAAGGAAACATAATCTTGTCTCTCACATGATAGTCGTACACTTGCAGGCGGGC
AAACAAAACATAACTTGTAATTTGTGTGATCGTAAATTTAATATCGAGAGGAATTTGAAG
AGGCATATGAAGCACAAGCACACAGTTGTAGATTATTCGACCTGTGATATATGTAAAAAG
AACTTCAAAGACAAGGGATTGCTTGCATCTCATATAGAAACGGTGCATCTGACAAATTTT
TCAGTGGCAACCGAGAATTATAAAGAAAATAAAAGCGACAAATTGTTAAAAAGTTGGTCG
GAAATACATAAGGACAAATCCGTTTTTAAGTGTAATATATGCTCGAAAATATATCTATCA
AGTCAAAGTCTGAAACGGCATACAAGAACTTTACACGGCGATAAGAATTACTGTAAGTAT
TGTTCGAAATTCATCAAAGACGATATTGAGAAGCATATTAATAATTGTCACAAAAACAAT
GAGGATAAGTATATATTTAAATGTGAGGTTTGTAATGCGATATTTGAATATGAGCACTCA
CTGAGAGGACATATCAGGGAAGAGCACAGTTTCCAACAGTTTTACGACCACTGCAAGAAA
TCTCTACTAAACATAACACCTTGGAAATTACCTAAGCGGGAAAATAATTGGCATACCTGT
GAATTTTGTTGTAACACATTCGCATCTGTATACGATTTAAAAGATCACATGAAATCACAT
CACGACATTGAATACAACTTGTCCACTTGCAACGTTTGTTTTAATAAATTTTACAGCAAA
GAAACCATGTTTGCGCATAAAAAGAACTGTTTCCCGCCAAAAAATGCGAACGCCTGCCGT
CATTGTGACAAACTGTTCACTGACATATCGAGTTTGAATTTCCACGTGAGAATATTCCAT
CCGCAAGCTCAAATCGCTGATTCGAAATTATTATCGAACCGTGAGGATCTCGGATCCTTT
AAATGTGATCACTGCGACAGGATATATTACAGCGACAGGTCTTTGAAACACCATGTGAAA
TTAAAACATAGCAGCGACGAGGCCGCTGAATGTCAATATTGTGGAAAAATATGCAATAAT
AAATATTATCTGGCATCCCACATCAAAATCGTTCACAATAACGACTACTGGGCCAAATGC
GATTTCTGTGATAAACAATTCAAATCGAAAAGGAACATACGCCGGCACATTGAATACACA
CATTTAGGCATGCAGAGGTATAAGTGCATAGAGTGCGGAACTCTATTTAAAGAGAAAAGA
AGTTTGAGGAAGCATGTTAGGATCAAACATCCAGATTCGACTGCGTTCCCACAGTGCCAT
ATATGTAAAAAGCGTTTCGAATCAGCTAAATCCTGTAAAATACATTTAAAGTTACTCCAT
TCCTTCAACATGAATACCCATCCTTGTGATTTATGTTCACTGTCTTTCGATTCCCTCGAT
GCTCTGAATATTCATCTATCGACCAAACATCTCGCTAAAGATGAAATATATAAATGCGAG
GAGTGCAATTTAGTTTTTAAAGGACAAATAACATTCGACTGTCACAACGATAATTACCAC
GCGTGTGAAAGCAAAGAGAAGAATCTGCCGCGTTGTATAATATGCGCCAAAGATTTCAGA
ACTCGTAAGACTTTAAAGCGTCACATAAAACGATTCCACGAGGAGTTTAACGTGGAGGAA
CTGGCTACTTACGGTACAAAGAAACGTATGTTTAACGTAAATTGCACCGAGTGTATAAAA
AATTTTAATAATGATTTTTATTTCATGGTGTATTCAAAAATGAAGCATTTGAGCGATTCG
TTTATTTTCAAATGCGAATTGTGCTCTTATTCCTACAATTGTCTTGAATACGCGATCCAG
AGGTATAAACAGAGCGTTGACGTGAAGGGGAAGTTGTATTTGAGCGAGTTGTGTACGACT
GAAATGAGTGAAAATGATTCGGACAGCGGAAAACTTGCGCCTGAGAGCAATATTGATTCA
AAAGATGATGTAGAATATAAACATTTTAATATAAAAATCGAGCCCAGCTCTCCATGA

Protein sequence:

MSRGHNKLGKPQPECDLCGRIFTRKHNLVSHMIVVHLQAGKQNITCNLCDRKFNIERNLK
RHMKHKHTVVDYSTCDICKKNFKDKGLLASHIETVHLTNFSVATENYKENKSDKLLKSWS
EIHKDKSVFKCNICSKIYLSSQSLKRHTRTLHGDKNYCKYCSKFIKDDIEKHINNCHKNN
EDKYIFKCEVCNAIFEYEHSLRGHIREEHSFQQFYDHCKKSLLNITPWKLPKRENNWHTC
EFCCNTFASVYDLKDHMKSHHDIEYNLSTCNVCFNKFYSKETMFAHKKNCFPPKNANACR
HCDKLFTDISSLNFHVRIFHPQAQIADSKLLSNREDLGSFKCDHCDRIYYSDRSLKHHVK
LKHSSDEAAECQYCGKICNNKYYLASHIKIVHNNDYWAKCDFCDKQFKSKRNIRRHIEYT
HLGMQRYKCIECGTLFKEKRSLRKHVRIKHPDSTAFPQCHICKKRFESAKSCKIHLKLLH
SFNMNTHPCDLCSLSFDSLDALNIHLSTKHLAKDEIYKCEECNLVFKGQITFDCHNDNYH
ACESKEKNLPRCIICAKDFRTRKTLKRHIKRFHEEFNVEELATYGTKKRMFNVNCTECIK
NFNNDFYFMVYSKMKHLSDSFIFKCELCSYSYNCLEYAIQRYKQSVDVKGKLYLSELCTT
EMSENDSDSGKLAPESNIDSKDDVEYKHFNIKIEPSSP