DPGLEAN16713 in OGS1.0

New model in OGS2.0DPOGS205244 
Genomic Positionscaffold1389:+ 7905-11933
See gene structure
CDS Length3276
Paired RNAseq reads  179
Single RNAseq reads  531
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008745 (3e-68)
Best Drosophila hit  crooked legs, isoform A (2e-35)
Best Human hitzinc finger protein 208 (5e-68)
Best NR hit (blastp)  PREDICTED: similar to mCG7830 [Acyrthosiphon pisum] (2e-72)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_71028 [Branchiostoma floridae] (6e-90)
GeneOntology terms




  
GO:0008270 zinc ion binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0005622 intracellular
GO:0046872 metal ion binding
GO:0003677 DNA binding
GO:0005634 nucleus
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL20861

Nucleotide sequence:

ATGCCTCTTAGAAAGTTACCGTTTCCACATACGGATTTGCAAAAACATCGCTATAATATT
AGAGAAATATTGCACAGCTCTAATGCGACGCCCATACGCGGCCACAAAGACCGCGGCTAC
ACTTGTAATTTTTGCGAAAATGAATTCGAGAATCCCGCCGATTTAAAACAGCACAATTTA
GATTGGCACACTTATAACTTAAGCACCGACGACATTGAAGTTTTGGTCCCATCCAACCTA
CCCCAGGTCTGTGTAAAATTGGACATCACAGGTCTCACTTGCAATTTATGCCAAGAAAGC
GTAGACTCTGTCGAAGAGCTATGCGACCACCTGCAACAAGCTCATGATAGGACAATTTTT
GACGACATAATACATCAAATACTACCATTTAAATTCAATGATCGAATCCTCCAATGCCAC
ATATGTAATAATATATACAATAAATTCAAAAAACTCATGGAACATATGAACAATCATTAC
AGTAACTATGTGTGTGACGTGTGCGAAGCTGGTTTCGTCAATAAGAGAAAACTTCATTAT
CACAAGATCGGACACAACGACGGTGATTTCAAGTGTGACGTGTGCGTAAAACGCTTCCGC
ACTTTCGGCAGCAAAAGATATCACGAACGGCGATACCATGACAAGAACAGGCTGTTTAAC
AAATGCGGATACTGCAATGCGCTGTTCAAGGGCATGAGGGAAAAGGATCTTCATCTGGCT
AAAGAGCACGGTGTGGCGTCATTTGCGAGAAAATGTCAAGCCTGTGACAAAATTTTCTCT
CATCAGAACGCTTTGCACACGCACATGAGACGATATCACTTGGTGGAAAGGAAGCACAAG
TGCCCAGAATGTGACAAAACATTCTTCTCTACATCCGACGTCAAGATGCACATGGTGAAA
CATACGGGCGAGCGTGAATACAAGTGTGAAATATGCCTTAAATCTTATGCGAGGAAGTGG
ACTCTGAGCGAGCATATGCGCATTCACTCGGACGATAGGCGGTTCAAGTGCGAGCATTGT
GGACAATCGTTCATACAGAAATGTATACTACGTAAACTCAAAAAGGAAAAGTTAAATAAG
GACGACGACTTCGTTCAGAGAACAGATCTGCCGAGAGGTAGACAAGGAGAAATTTTAGAC
AAACATCGCACTAATATACGCGAAATTATCAAATGGTCGAACGCTACGCCGATACGATGC
CGAGGCGGGATCGGATATGCGTGCTGTTTCTGTTCAGACCAATTTCCAAATCCTGCTGAC
TTAAAACATCACACCATCAAAGCTCACGATGATATCACCAAGTCCAAGTTCATGAAGGGT
AGGGATATGTACGGATATTTCGTCAAACTAGATATAACCTCTCTACAATGCAACATTTGT
GGGCTGGAAAATGACACGTTAGAACAAATTATGTTGCATTTGAAAGACGAACATGGTAAA
AGCATCAGTACTGACATATCGAATCACATATTACCTTTCAAATTTGACAGCGAGATTTTG
CGCTGTTTCATGTGCCACAACGTGTTCAATAGGTTCAAGGCATTGCAAGAACACATGAAC
TTACACTATAGGAATTACGTCTGTGAAGTTTGCGACGCTGGTTTCGTCAATCGCCACCTT
CTTTTATGTCACAACGAGGGTCACAAAACGGGTACATTTGCGTGCGATCAATGCGGAAAA
ATCTTCGACACGCTGAGGAAGAGAAAGTTGCACGAAAGGAAAATTCACAACGGTCTGAAC
ATGCCGCATAAATGCGGTTATTGCAACGAGAGGTTTAAGGAGAACTGTTACAAAAATGAG
CATCTCGCCAAAGTTCACGGCATAATCGGCCCTTCTATCAAATGTCAAGCCTGCGAGAAG
ACGTTTTCGACGCAGCAGACCTGGCTCTTGCACATGAAGAAGTACCATTTGATGCAGAGA
CAGCACAAATGCACAAAATGCGAAATGGATTTCTTCTCAAAGAGGGAACTCACCGACCAC
ATGGTGAAACACACCGGAACTAGGGACTACAGATGTGAAATGTGTTTCAAATCGTATGGC
AGATTGAAGACTCTGAAAGAACACATAAGACGGCTCCATCCAGAAGGCAGAGACATCAAA
TGCGCTCATTGTGGTCAAGCTTTTGCGAAGAGATTTGCTTTGAAAAGTATGAAAACGATA
AAAATTGAAAGAAAAACGGACGTCCCACGCGTGCAGTTGAAGCTGGTTAGCCGGAAACAT
CGGAAGCTGTCCGACAACAAGAAAAACCAGGAGAATCTCACGCACATATTACTCAATTCA
AACGCGAGCCCCATAAGGAATAAGGACAGCCTCGGCTACGGTTGCGCCTTTTGTTCTGAA
CAGTTCGTTGATCCTAAAAATCTAAAGAGGCATTTTTTGAGCGAACATCAGAGCGACAGG
CTGATAAAATTAATGTCCAGCAAACTTTTTGAACATGTCGTCAAATTGGATATCACGTAT
TTGCATTGTTCGCTGTGCGATAAAAAGATACCTCACTTAGATGAACTAATGGCGCATCTG
AAAACGGAACACAGAAAGGACTTGCACACGGACATCAAGAGTTCAATAGTACCTTTCAGT
TTTGATACTCCGCAGTTGCAATGCGCTGTTTGCAAGATAGAGTATTCAAACTTCAAGCTA
TTACAAGAGCACATGAATTCGCATTTCGGTAATCACATCTGTGCTATGTGCGGCGGCGGC
TTCGTTACGGAACGTCTGCTGACAACGCACATGAAGAGGCACAAAACCGGCGAGTACAAA
TGCGAAGATTGCGACAAGGTTTTCGATAATGAAGAGAAAATGAAGGAGCACCAGAAGCGT
TCACACCTCGGTCATAATAAAAGGAACAAGTGTCTTATATGTGAGGAGAGATTTGTGGAT
TACTGGAAGAAAGTCGAGCACATGGTCGAAATACACGGCGCTCCGCCGGTCGTGCTGAAA
TGTCAGGCTTGTGACCGTACATTCAGAAATCAGAGAGCTCTGTCCAGACACACGAAAAAG
GACCACCTGATGGAGAGGAAAAACAAATGTCCGGAATGCGATATGAGATTCTTCAGCAAG
AGCAGTCTGCAGCGGCACATGGCGAAGCACACGGGGTTACGGCAGTTTTCGTGCGACGTT
TGTTTTAAATCCTACGGAAGAAAGAACACTCTGAGGGAACATATGAGGATTCACGCCGAC
GACAGGAGGTTTGCGTGCATCCATTGCGGACAGGCCTTCGTTCAGAAGTGCAGTTGGCGG
AGTCACATGAGGTCCAAGCATGGCGATGACGTTTAG

Protein sequence:

MPLRKLPFPHTDLQKHRYNIREILHSSNATPIRGHKDRGYTCNFCENEFENPADLKQHNL
DWHTYNLSTDDIEVLVPSNLPQVCVKLDITGLTCNLCQESVDSVEELCDHLQQAHDRTIF
DDIIHQILPFKFNDRILQCHICNNIYNKFKKLMEHMNNHYSNYVCDVCEAGFVNKRKLHY
HKIGHNDGDFKCDVCVKRFRTFGSKRYHERRYHDKNRLFNKCGYCNALFKGMREKDLHLA
KEHGVASFARKCQACDKIFSHQNALHTHMRRYHLVERKHKCPECDKTFFSTSDVKMHMVK
HTGEREYKCEICLKSYARKWTLSEHMRIHSDDRRFKCEHCGQSFIQKCILRKLKKEKLNK
DDDFVQRTDLPRGRQGEILDKHRTNIREIIKWSNATPIRCRGGIGYACCFCSDQFPNPAD
LKHHTIKAHDDITKSKFMKGRDMYGYFVKLDITSLQCNICGLENDTLEQIMLHLKDEHGK
SISTDISNHILPFKFDSEILRCFMCHNVFNRFKALQEHMNLHYRNYVCEVCDAGFVNRHL
LLCHNEGHKTGTFACDQCGKIFDTLRKRKLHERKIHNGLNMPHKCGYCNERFKENCYKNE
HLAKVHGIIGPSIKCQACEKTFSTQQTWLLHMKKYHLMQRQHKCTKCEMDFFSKRELTDH
MVKHTGTRDYRCEMCFKSYGRLKTLKEHIRRLHPEGRDIKCAHCGQAFAKRFALKSMKTI
KIERKTDVPRVQLKLVSRKHRKLSDNKKNQENLTHILLNSNASPIRNKDSLGYGCAFCSE
QFVDPKNLKRHFLSEHQSDRLIKLMSSKLFEHVVKLDITYLHCSLCDKKIPHLDELMAHL
KTEHRKDLHTDIKSSIVPFSFDTPQLQCAVCKIEYSNFKLLQEHMNSHFGNHICAMCGGG
FVTERLLTTHMKRHKTGEYKCEDCDKVFDNEEKMKEHQKRSHLGHNKRNKCLICEERFVD
YWKKVEHMVEIHGAPPVVLKCQACDRTFRNQRALSRHTKKDHLMERKNKCPECDMRFFSK
SSLQRHMAKHTGLRQFSCDVCFKSYGRKNTLREHMRIHADDRRFACIHCGQAFVQKCSWR
SHMRSKHGDDV