DPGLEAN19023 in OGS1.0

New model in OGS2.0DPOGS216089 
Genomic Positionscaffold375:+ 73136-74854
See gene structure
CDS Length1650
Paired RNAseq reads  126
Single RNAseq reads  367
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007797 (0.0)
Best Drosophila hit  CG11966 (2e-17)
Best Human hitsal-like protein 1 isoform b (4e-12)
Best NR hit (blastp)  PREDICTED: similar to CG4374 CG4374-PA [Tribolium castaneum] (5e-103)
Best NR hit (blastx)  PREDICTED: similar to CG4374 CG4374-PA [Tribolium castaneum] (9e-102)
GeneOntology terms

  
GO:0003676 nucleic acid binding
GO:0005622 intracellular
GO:0008270 zinc ion binding
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL16376

Nucleotide sequence:

ATGACGGCCACTGCTCCTGAACTCTCAAAATCGGATTTTTTTGATTTTGTGACATCTAAT
GAAGTGACGGACGCTCAATATAAACAGCAAATGCGATCAGTCAACGTGTTCATGGAATCA
CCTGATTCAAGATCCAACCCTTTGTTGTCAGAGGAGCCCAAGGAGCAAAACAACAACAAC
ATACTTGCCGTCAGCGGTGGTCAGTCGACCAGCGCAGGTATGACGACCGCCACCAGTTCG
AATCCACTTCAGAGTTTCGACTCTATTTGGAACGTAGATCGGGATCGCGACCGCATCGAG
ACAGCAATGTTGGAGGATCTCAACAAATACTATTGGAATCAAGAAAACGATATTAACGGA
ACCCATCCATGTTCTGATACAGCAATATCAAATAAATTAATAAGCAATAATACGGATGGA
CAAATATACACACTGACAGTTTTAAATCAAGACATTAATGAAACAAACACTAACCGCTAT
TGGGTAAAAGAGGAAGATGTTTCAATGTCAAGTCCGATAGACGTTGAACAAAATCCCTCC
TTAGACCTGGAATCTATACTTAACATGAATGGATTTCCAAATGATTTCAGTCAAGATACA
CTTAAATTAAGTTCAAACAACTTGGTCAAAATTGAACCATTCAATTATGATGACAGTGAA
TTTCAAAGTAGTGACAAAAAAGATGATTCCATAATAAGCAACCCCACATTATTAGAAGTG
GAATATAATAACAATAACAACGACTGGAAATTGACTGACCAAAACACAGAATCAAACGAA
TCATTACTACGGAGTGCCTTACAAGGAAAAGCTTTTATTAGATATAGTACAATTCAAAAA
AATACTGTCAATAAATTTGACGCAGAATCAGAGTTAAAGAGGGCTATTATTAATAATAAC
AATAAACCAGATGTACCATCTATGTATCAAGATAACAAAGATACCGATTCTAGCCTTTTG
ATGGCCATAGCACCACCAAACGGTAATGTAAATATATCAATATTATTAGAAGAACCTTCA
GCGACTGTCTTATCGAATGGAGATAATCCTACATCAACACAAAGCATGGACGACATTTTA
CTTTCCCAACTTGATGCTAGTTATCCCGACGACTATGAAAAATTGAAGCGGATAGCTACT
GAACTAGGTGAATCAGTGCAACCATTTTGTACTGTAGAGCCTATTGATTCCGTTCGTAAT
GTATACAATATTCATCATGTAAATGGCGAATTAGTAACTATGATTCCAGCAGGAGAAGTT
CAGTTGCCTCAACACTTACAGATAGTAACGGCATCACCCACGGTGACGAGTACTAAGCCG
GGAGGTAAGAAGATCAGAAGAATCCAAAATAAAAATTCACCCCCAACAACACAAACACAG
TCGGGGACGGCTGTTCAAGCGGCGACATCTACTTCAAATGGTGTCCGAAAAGAACGGTCG
CTACACTACTGCTCTATTTGTTCTAAAGGTTTTAAGGACAAATACTCTGTTAATGTGCAT
GTGAGAACTCACACCGGAGAGAAACCCTTTACATGTTCTTTATGCGGGAAAAGCTTTCGA
CAGAAGGCGCATCTCGCAAAACATTACCAGACACACATCGCACAAAAGAGCGCAGCTGCT
AACGGTGGCTCTAAACCTTCGAAGAGGTAA

Protein sequence:

MTATAPELSKSDFFDFVTSNEVTDAQYKQQMRSVNVFMESPDSRSNPLLSEEPKEQNNNN
ILAVSGGQSTSAGMTTATSSNPLQSFDSIWNVDRDRDRIETAMLEDLNKYYWNQENDING
THPCSDTAISNKLISNNTDGQIYTLTVLNQDINETNTNRYWVKEEDVSMSSPIDVEQNPS
LDLESILNMNGFPNDFSQDTLKLSSNNLVKIEPFNYDDSEFQSSDKKDDSIISNPTLLEV
EYNNNNNDWKLTDQNTESNESLLRSALQGKAFIRYSTIQKNTVNKFDAESELKRAIINNN
NKPDVPSMYQDNKDTDSSLLMAIAPPNGNVNISILLEEPSATVLSNGDNPTSTQSMDDIL
LSQLDASYPDDYEKLKRIATELGESVQPFCTVEPIDSVRNVYNIHHVNGELVTMIPAGEV
QLPQHLQIVTASPTVTSTKPGGKKIRRIQNKNSPPTTQTQSGTAVQAATSTSNGVRKERS
LHYCSICSKGFKDKYSVNVHVRTHTGEKPFTCSLCGKSFRQKAHLAKHYQTHIAQKSAAA
NGGSKPSKR