DPGLEAN16717 in OGS1.0

New model in OGS2.0DPOGS205247 
Genomic Positionscaffold1389:+ 40223-42719
See gene structure
CDS Length1311
Paired RNAseq reads  58
Single RNAseq reads  146
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008785 (5e-120)
Best Drosophila hit  CG11247, isoform C (4e-26)
Best Human hitzinc finger protein 555 isoform 1 (1e-24)
Best NR hit (blastp)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (7e-31)
Best NR hit (blastx)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (2e-37)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupND

Nucleotide sequence:

ATGTCATTATATAATAATTTATATCACGAGGGAATATGCCATGAAGATTATGTATCTGGC
GAACTGGAGCTGCTGTCCCGGCATAACGACCTTAAACCTTTGCCATCTAGGGAGAGCCTT
AAATATATCTGCAAAAAATATTATAAGGAATTGGAGCTTTTGAAAGGACAGATAATAACA
GCTAAATCGATTAAAATCGATAACACAGATAGTGTTTCAAACAGAATCGACCCCCAATCA
TTTGTGACAGAGAAGCTGGCTCATATAAACAACGTGACGACCATACTAGAGAATTGCAAT
CTAACGGCCTTTAAGACTAAACGGAGATATGGGTATATTTGTTTCTACTGCAGCCGAAAA
TTCGACAAGATAGAACTCCTCGCTGATCACCAGGTGGAAGGTCGTTGCAAGGACGGGATC
AAGTCAATCATCAGCAAGCACCCATCAGACAGTCTAGCTGTGTACGTCCACGTGGCGGAC
ATGAAGTGCTCCATATGCGACAAAAAACTACCAAACATGAACGAACTGAAACATCACCTG
ATGGCAACACACAAGAAGAAGATATACACCGGATACGGCGACAGGATAATACCGTTTGAA
CTCGGCAAGAACAAATACGACTGCCAAATATGCGGCTCCAGTTACGAGACATTCGGTGCT
GTAGAGAGGCACATGAATGTTCACTACCGGAACTACATCTGCGATCAGTGCGGCGTTGGT
TTCGTAACGAAGAACAGACTGAGGGTGCACATAAGATCGGCCCACGTCACCGGCAGCTAT
CCGTGCGACGTGTGCGACAAAATATTCCAAGCTCAGCACAAATACAAGAACCACGTTGAC
GTCACCCATAGAATGGTCAAGAAGAACAAATGCCCGAAGTGCCCCGAGCGCTTCGCGGAC
TATTTCCACCGGCACAAGCACATGGTGGACGCGCACGGCGAAACGCCGTTACGATACAAA
TGCAACGTATGCGAAGCGCTGTTCAAACGCCGCTACGCCCTCTCCTGCCACACGAAGAGA
CGGCACCTGGATATGAGGGACGTCAATTGCGATGTCTGCCCCTACAAGTGCTATACGATT
ACCGAACTCAAAGCCCACATGATAAAACACAACGGCCAAAGGACTTATGAGTGTAATGTG
TGCAAAAAGTCCTATGCTAGAAAGAAAACTCTGAAGGAGCACATGAGGATACATAATAAC
GACAGGAGGTACGTCTGCGCCGTCTGCGGACAGGGATTCGTACAGAACTGCAGTCTGAAG
GGACATATGAAGACTCATCATACGGAATATCTGAATAATCTGCCAAGATAG

Protein sequence:

MSLYNNLYHEGICHEDYVSGELELLSRHNDLKPLPSRESLKYICKKYYKELELLKGQIIT
AKSIKIDNTDSVSNRIDPQSFVTEKLAHINNVTTILENCNLTAFKTKRRYGYICFYCSRK
FDKIELLADHQVEGRCKDGIKSIISKHPSDSLAVYVHVADMKCSICDKKLPNMNELKHHL
MATHKKKIYTGYGDRIIPFELGKNKYDCQICGSSYETFGAVERHMNVHYRNYICDQCGVG
FVTKNRLRVHIRSAHVTGSYPCDVCDKIFQAQHKYKNHVDVTHRMVKKNKCPKCPERFAD
YFHRHKHMVDAHGETPLRYKCNVCEALFKRRYALSCHTKRRHLDMRDVNCDVCPYKCYTI
TELKAHMIKHNGQRTYECNVCKKSYARKKTLKEHMRIHNNDRRYVCAVCGQGFVQNCSLK
GHMKTHHTEYLNNLPR