DPGLEAN21979 in OGS1.0

New model in OGS2.0DPOGS211825 
Genomic Positionscaffold662:+ 16436-19088
See gene structure
CDS Length1548
Paired RNAseq reads  160
Single RNAseq reads  593
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012183 (3e-07)
Best Drosophila hit  CG6654 (2e-24)
Best Human hitzinc finger protein 222 isoform 1 (6e-30)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein 709 [Equus caballus] (8e-38)
Best NR hit (blastx)  PREDICTED: similar to zinc finger protein 709 [Equus caballus] (1e-45)
GeneOntology terms



  
GO:0008150 biological_process
GO:0003674 molecular_function
GO:0005634 nucleus
GO:0046872 metal ion binding
GO:0005575 cellular_component
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL22734

Nucleotide sequence:

ATGTATTTCAAATACACTGATTATCTTTTTTATGTACTGCTTATAATGATACAAACAGTT
CAAAGAGTTTCAGGAAAAGCGTTCATCGAAAATGAACCAGTGACCACTCCAGAACTGCAA
ATATTTCCAATGGGCTTCGATGAAGTTTTTAATGATGCTACTAATGTGCTTGGTAATGTT
GTAAGGAAAATCATATCAAAGAAACGTTTACTTGGACTAGATTCTAATTGGTACTGGTTA
TGTTTTAAAAACCTTACGGAACAATACGTTCGTTTTGATGATGCGGTTTCATTGCACCCA
GAAAGTGGTGTATTTCAACCTTTATCCGAAATATTACTTAAACTATTGGGCGACAATATA
TGTGATGAGATTAAGGGTGTTGAAGCTGTATGTACAGATTGTGTGGAAAATGCATTGTTG
TCAGCCCGCTTTGTAGAGAAGTGCCAGCATTCAACAAAAGCTTTAAATGAAGTGTTCAAT
AATATTAGCAATACTTTAGATGTAGATATTGATAATAAAGATAGCAATAAAACATTGTAT
GTTGTAATAGAAGATCTAGAATCTAAACTGTTAGTAGTAAAGAAGACAGATGAAAGAAAC
AGTCTTCAGGGGACATTTGAATGTGAAGTGTGCACAGATAGTTTTGATACATTTACAGAT
TTAAAAGTCCATAATTTGACCAATCATGGTACTTTGACATGTGACAAATGCTATGATACA
TTTGATAGTAATACTGAATTTTCCCTTCATGAGAGTCAGCATCATGTTTATAAATGTCCT
GAATGTCCACAATACAGAAACACAGAGGAGAGTTTAGAAGACCACCAAAACAGACTTCAC
AATGTTTTTGTATGTAAGGAATGTGGAAAACGTTGTCGTGGCCTTTATAAGCTCCAAGTA
CATGAAGAGAAGCATAAGACAAAAAATTCATGCCCTAAGTGTGGAAAGTCTTATACAACA
AAGGAGTTTTTTGATAGGCATGTCAATCTGTGCATCAACAACCTCATAGATCCTCATCCG
ATAAGAAGCAGCATGGTTAAATCATACTCCTGTGAGAAATGTGATAAGGCTTACAGTACG
GCTGGAGGGCTTAGAGTGCATAATAGATTTGCCCATGGAAATGCTAAGCCTCATGAATGC
AAGGAATGCGGGAAACAGTTCACTGCTCCCAGTTATTTGAAAGTTCATATGATAAAACAT
ACAGGGGAGAAGAACTTCAAATGTGATATTTGTCATAGTAAATTTGTATCAAAAGAGGCA
TTGTTGTATCACACTCGACGACACACCGGCGAAAAACCATACAGTTGCAAATACTGCAAT
GAAAGATTCGTCAATGCCTCAACCAGGGCCGAGCATATCAAATTTAAACATGTGGGACCT
ACATTAATGTGTGAAATATGTTCTAGAAAATTTGTTACAAGTCACTTCTTAAAGCAGCAT
ATAAATCGTCATCACGATCCTACAAGTAAGCTCTACTATGGCAGGAATATGATTCCACCT
AACTTGCCGCTCCAACAGAACATGAAGAAGGTTGTTATACACAACTGA

Protein sequence:

MYFKYTDYLFYVLLIMIQTVQRVSGKAFIENEPVTTPELQIFPMGFDEVFNDATNVLGNV
VRKIISKKRLLGLDSNWYWLCFKNLTEQYVRFDDAVSLHPESGVFQPLSEILLKLLGDNI
CDEIKGVEAVCTDCVENALLSARFVEKCQHSTKALNEVFNNISNTLDVDIDNKDSNKTLY
VVIEDLESKLLVVKKTDERNSLQGTFECEVCTDSFDTFTDLKVHNLTNHGTLTCDKCYDT
FDSNTEFSLHESQHHVYKCPECPQYRNTEESLEDHQNRLHNVFVCKECGKRCRGLYKLQV
HEEKHKTKNSCPKCGKSYTTKEFFDRHVNLCINNLIDPHPIRSSMVKSYSCEKCDKAYST
AGGLRVHNRFAHGNAKPHECKECGKQFTAPSYLKVHMIKHTGEKNFKCDICHSKFVSKEA
LLYHTRRHTGEKPYSCKYCNERFVNASTRAEHIKFKHVGPTLMCEICSRKFVTSHFLKQH
INRHHDPTSKLYYGRNMIPPNLPLQQNMKKVVIHN