DPGLEAN13948 in OGS1.0

New model in OGS2.0DPOGS213643 
Genomic Positionscaffold543:- 50335-52712
See gene structure
CDS Length1542
Paired RNAseq reads  58
Single RNAseq reads  166
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004584 (2e-169)
Best Drosophila hit  lame duck (3e-58)
Best Human hitzinc finger protein GLIS3 isoform b (4e-55)
Best NR hit (blastp)  zinc finger protein transcription factor lame duck, putative [Pediculus humanus corporis] (4e-62)
Best NR hit (blastx)  GF18275 [Drosophila ananassae] (1e-61)
GeneOntology terms


  
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
GO:0008150 biological_process
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL39870

Nucleotide sequence:

ATGGCGGAATACGACCTCATGGCGTACGACCACGACATGTTCCAGGACCGAGGAGACGTC
ACCGACCTTCACGCTTACTACATGACGCAGTTTAACCTGGAACTAGAAAACGGGAAAGGA
AAGCCGAAAAGTGACGGCTTCAAGTTTGGAATCGAAGAGAGCTTCGACTTCGAAGTTGTG
GACAACTGGAACGGCTGGAACTGCAACACCAAAACAGACTTCGACATCATAGACGAAGTC
ATCGACAACAACGTCAGCGAGCAAAATATCAAAGAGGTACTTCTAGACCTGGACACCATT
GAGTTTGATGATAACAGCTTCAAGTCAGCTTCTTGCGAAGGAAGTAAGAGAAGTGAAGCA
AATTATGAAACGAACGACTACATAGACGACGAGTGTCTCATCGACGAGCTGTGCAGAGAG
GAAGGAGAGTCGTGTCGTCTGACGCCAGACGTGTTTAGCGATGACTGCACCAACATCTTG
AACAACGACACGCAGCTGCCATCCATAGAGACCGCCTTCTCTAAAAGATACGGAGCTTTT
AACAACTTGGACAGTTATAACACACAAAATGCTTACGAAGCGAACCCATCTCAGAACACA
AACACACCGACTGTCAGCAGTTTGGAACATTACAGTTATCCTAATAATATTCTTCACAAC
TTGGATAATCCAAAGAACTACGAGCTCCCGGACACACCGACAAGCTGCCAGGATTTTAAT
TTTGACAGAAATATTAGAAAAGTATCGATATCAGATTCTATCGAGAGTGACGTCCAGAGC
GCCGGTTATTATGACGACAACTCAGAAAACTTAGACGAAGACGACCTGTTCATAAACCTC
GATGACTTCGGAATCGCCTTTGAAACGGAGAATGAAGGAAACAACTGCGAAAAGAGCCAT
CACGCCGAGAAAAGAAATGACAAAGACAAAACTCAAGGCGAAAGGGTTTGTTTATGGGAG
CACTGTTTTGAAAGATATCCAAATCAAAACACACTCGTGGAGCACATAGAGCGCGCACAC
GTCAACACTTACAAAGGTGACGAGTTCAGTTGTTTGTGGCGGGACTGTGCGCGCGGTCGC
CGTCCGTTCAACGCGCGTTACAAGCTACTCATACATATGAGAGTACACTCGGGACACAAA
CCCAATAGATGTCATCATCCCGGCTGCGGCAAAGCGTTCTCTCGCCTGGAAAACCTTAAG
ATCCACGTGAGATCCCACACGGGCGAGCGACCCTACGCCTGTCCCGCTCCTCACTGCAGG
AAGGCCTTCTCTAATTCCTCAGACCGAGCCAAGCATCAGCGAACTCACTTTAATGCCAGA
CCGTACGCGTGCGGCGCTGCAGGCTGTAACAAGCGCTACACGGACCCCTCCTCGCTGCGG
AAGCACGTCAAGTCCCACCCGCATGCTCCCCCTCGCGCGCGCCTCCCCCCGGCCAGACCT
CCTCCTCACGAGGAACAGCTTGTGCCGAGTTCACCCGCCAAACTCGACACACTCAGATGT
ATCAGAGACAAACTCACCGTTCCCCGACTACGGAACATGTAG

Protein sequence:

MAEYDLMAYDHDMFQDRGDVTDLHAYYMTQFNLELENGKGKPKSDGFKFGIEESFDFEVV
DNWNGWNCNTKTDFDIIDEVIDNNVSEQNIKEVLLDLDTIEFDDNSFKSASCEGSKRSEA
NYETNDYIDDECLIDELCREEGESCRLTPDVFSDDCTNILNNDTQLPSIETAFSKRYGAF
NNLDSYNTQNAYEANPSQNTNTPTVSSLEHYSYPNNILHNLDNPKNYELPDTPTSCQDFN
FDRNIRKVSISDSIESDVQSAGYYDDNSENLDEDDLFINLDDFGIAFETENEGNNCEKSH
HAEKRNDKDKTQGERVCLWEHCFERYPNQNTLVEHIERAHVNTYKGDEFSCLWRDCARGR
RPFNARYKLLIHMRVHSGHKPNRCHHPGCGKAFSRLENLKIHVRSHTGERPYACPAPHCR
KAFSNSSDRAKHQRTHFNARPYACGAAGCNKRYTDPSSLRKHVKSHPHAPPRARLPPARP
PPHEEQLVPSSPAKLDTLRCIRDKLTVPRLRNM