DPGLEAN18503 in OGS1.0

New model in OGS2.0DPOGS201417 
Genomic Positionscaffold570:- 11208-14529
See gene structure
CDS Length1338
Paired RNAseq reads  219
Single RNAseq reads  651
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002571 (7e-12)
Best Drosophila hit  CG12299 (5e-42)
Best Human hitzinc finger protein 510 (2e-50)
Best NR hit (blastp)  PREDICTED: similar to Zinc finger protein 271 (Zinc finger protein 7) (HZF7) (Zinc finger protein ZNFphex133) (Epstein-Barr virus-induced zinc finger protein) (ZNF-EB) (CT-ZFP48) (Zinc finger protein dp) (ZNF-dp) [Acyrthosiphon pisum] (9e-56)
Best NR hit (blastx)  hypothetical protein LOC77117 isoform 1 [Mus musculus] (4e-59)
GeneOntology terms




  
GO:0005622 intracellular
GO:0005634 nucleus
GO:0008270 zinc ion binding
GO:0003677 DNA binding
GO:0046872 metal ion binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families


  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL24031

Nucleotide sequence:

ATGGCTCACATCTTGGATTTCAAGAAAATATGTCGCGCCTGTTTATCTGATGCTGGACCT
CTAAAGGATTTGTTTACGGCTTGTTCTGCTGGAGTCTTTAAATACTGCACTTCTGTGGAA
ATCGCAGATTCGGATGCCCTCCCAAAATTAATATGTCAAACATGTTTGGATTTACTGAAC
AAACTGTACTACTTCAAGCAAGTCGTTGTGAGATCCAACGTTATACTGAAACAGCAATGC
AGATTACTGAATTTGCAGACCAAACCTGATCAGACAAGCGAAGGGAATGATATAGTAGAG
GTAAATATAACAGAACTGAATGAAGAAGTCACGATGCATGAGAACAACATGAACGAAAGT
ATGGATGGAACAGAGAAGACAGAAAAACCATCAGCTGATGCAATATTAATTAGTCAAATC
CTGCAACGTCGCCGGAGACGCGGCCCTGGTCGTCCGCCGAAGGATCCAGACGGGCCCAAG
CGGAGACGGGAACGGATGAAGTGTATGAAATGCGGCAAGAGCTTCCAGAAGTACGAGAAC
TTCGAAGCTCACATGCGCGGACACTTCGGGAAGAAGCCAGATATAAAGTGCAAGCATTGC
GACAAGGCGTTCCTGTCCCTCCGCAGTCTTAGCAGCCACGTGAGGATTCATACAGCGGTA
CGCAAATATCAATGCCTGAGCTGCGGCAAGAGCTTCGCATATTTGAATGTGCTCAAAAAT
CACGAGCTGATACATGCCGGTATCAAGAAACATCAGTGTCACATATGTGACGCTAAGTTC
GTGCAGGCTTACAATCTCAAGATGCATCTAGAAACTCACAATAATCAGAAGAACTATAGC
TGTTCACAGTGCGGAAAGAAGTTTGCTCAGCCGGGGAACCTCAAGATACACCTCATAAGG
CACACTGGCATCAAGAACTATGCATGTACCATGTGTGAGATGAGGTTCTATATAAAGGCT
GATCTGGTGAAGCACATGCGTTCACACTCCGCCGAGAAACCTTTCTCCTGTCAACTTTGT
GATAAAACTTTCAAAAGCAGAAGCTTTCAAGCAATACATATGAGGACGCATACAGGAGAG
CGTCCGTATGCCTGCGACCTGTGCCCCAAAAAATTCATGGCTAGAAAAGACTTGAGGAAC
CATCGGATGATCCACACGGGGGAGAAACCGCACAAATGTCAGCTGTGCAACCAAGCTTTC
ATACAGAAATGTGCACTGAACAGACACATGAAGGGTCACGGGAAGGCCAATGAAGATGCA
CAGAATCTCATCAGAGCACAACTACCGCCTGTTAATAATACACCTCTTCCAATGTCATAC
ACACAGTGGCATAACTGA

Protein sequence:

MAHILDFKKICRACLSDAGPLKDLFTACSAGVFKYCTSVEIADSDALPKLICQTCLDLLN
KLYYFKQVVVRSNVILKQQCRLLNLQTKPDQTSEGNDIVEVNITELNEEVTMHENNMNES
MDGTEKTEKPSADAILISQILQRRRRRGPGRPPKDPDGPKRRRERMKCMKCGKSFQKYEN
FEAHMRGHFGKKPDIKCKHCDKAFLSLRSLSSHVRIHTAVRKYQCLSCGKSFAYLNVLKN
HELIHAGIKKHQCHICDAKFVQAYNLKMHLETHNNQKNYSCSQCGKKFAQPGNLKIHLIR
HTGIKNYACTMCEMRFYIKADLVKHMRSHSAEKPFSCQLCDKTFKSRSFQAIHMRTHTGE
RPYACDLCPKKFMARKDLRNHRMIHTGEKPHKCQLCNQAFIQKCALNRHMKGHGKANEDA
QNLIRAQLPPVNNTPLPMSYTQWHN