DPGLEAN00989 in OGS1.0

New model in OGS2.0DPOGS215723 
Genomic Positionscaffold788:- 58908-63620
See gene structure
CDS Length3453
Paired RNAseq reads  6539
Single RNAseq reads  16574
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005815 (0.0)
Best Drosophila hit  CG9715 (4e-32)
Best Human hitzinc finger CCHC domain-containing protein 7 (3e-09)
Best NR hit (blastp)  AGAP006128-PA [Anopheles gambiae str. PEST] (4e-37)
Best NR hit (blastx)  AGAP006128-PA [Anopheles gambiae str. PEST] (3e-35)
GeneOntology terms
  
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
InterPro families  IPR001878 Zinc finger, CCHC-type
Orthology groupMCL14608

Nucleotide sequence:

ATGGAAGAGGAGCTTAGCGAAAATGAGCTGGAGGAACAAATGTATGCTATGATACACTAT
GTTGACGATACGCAATCAAATGTCAACTCAAACCAAAACGATAACAACATTGTTGAGAAT
GTTCCTCAGAGTACCGTACGTCGCTACTGGCGTACTAATGTAGACCAAAACACACCTTAT
CAGAAAATAAACACACCTAAAGATTCTACTAACAACAAAGAAACAGGTGAAAAGAAAGAT
GACAAAAGTAAATCCTCAGACCAAAACACATCAGATTTGTCTCTTTTTCAACAACCGGTA
CCCTCGAATGTCAAGAAAACTGTAGAAATATTGGAAAATGATGATGACAAAAATATAGTG
GAACTCGAAACAAGCGACGAAGATGAAGTTATTGAAGTGGCACTTCCACCCAAACCCACC
ATCACCATTGAGAGTTCAGACGAAGATGATGTCTGTCCAGTTGATCCAGAGCCTGACATT
AAACACAAGCCAACAAAGCCCACACAGGAAATTAAAAACTCAGTTGACAGAGAAGTCACT
ACCAGTCCAGTACCATCCGTGGTGTCATCTATCTCAGATGACTTCATAAGAGGAGACTGC
ATCGCACTCAATATATCATCAAAACATCCAAATAACCAAAGCTTTGATTTCAGTCTTCAT
GGCTCCGATCTTCTCGACCAAACACCATCGAAGAAAAAAAAGAAAAAGAAAAGCAAAGAA
AAAAATACACCATCATCAGTAACAACGCCTCTTTCAGTATCAACTCCCGTGAGCTCAAAG
CAAACGGCTGGTGCAGTTGATGAATGTTTTGCTACTCCCAAGAGCAAGGCCAAGAATAAA
CGCCAAAGGACAAAATCATATCGAGTTTCAGAGAAAAGTTTACCAAATGCTGACGTGTAT
GACTCAGACAGCAACCAATCACTGAATGAGAGTAACAAAAACCAGATGACATATGAGGTT
ACCGACAAAAGTGTACCAAGCACTGATGTCTACGAGTCCGATTCCAATCCATCAGAAAAT
GCTACAGAATCTGTATCTAAAGAGGCTATTAATGACGCTGAAAGCTCAGAAAGCACAACG
GAGAGCCCCGTTGTTGAAATAGTCAAAACATCAAAAAATACTGACATTTCCAATAAGTCC
GTAGTAGACCTCACGGAACCAGCCATGAACACAAGTATAGACGAAAACATAGTGATGGGT
AACGTTACGGGATTCACAAACATGGAAGAATTCAGTGATCATGATATTTCAGTCAAAGAC
ATATCCAAATGCGGCTCAACTAAAATACCAGCCATCCTTAATGAGGATCTCGATTTTGAC
AATCTCAAAGGCAGCAACAAAGTGTGCAAACGACGACGATATTCACTAACTACATTGCGA
GCCGAAATGGAAAAGTTCTACAACGAAAGCTGGGGAGGAGAGGAATTCAACCATCGGGAG
ATACAGAAGAATATGTCACGTGACAAAAGTTTGTGGGTAATTGATGCAAAGGACCGTATG
CCATCGCTGACTAGACGAAAGACCACATGCAACTACTGTAACCGCGCCGGCCACCGCGAC
GACGCGTGTCACTTCAAGCCGCCCGTGTGTTTCATGTGCGGCGACGCGGGACACTACGAA
CCGCGCTGTCCCAGGAAGATATGCGTCAATTGCGGGTCACCCAACTACGTGTACTCCACG
ATGTGTCGGAACTGCTCCACGTGGAAGTGCATCAAGTGTGCGGAGTGCGACCAGAGCGGT
CACCCGGCCAGCCACTGCCCGGACGTGTGGCGCAGATACCATGATACCTTGTCGTTGGAG
ACTCCGTTGGAAGAGAATCGTCAAACGAAGAAGAATCACCAGATGTTCTGCAGTGGTTGC
ACGCGCCGTGGTCATTTAGTCCACACCTGCCGTCTCTCTCTACCGTTCTCAGGCCTGCCG
ATGAACTCCCCATACGTCTCGGTCTACCGACCCGTCTACCAGATGCTGGACACTAACAAC
CAAAGTAACGATATCGGTAACAAGAAATTTAAGAACAGGAATAATTCTGAAAATTCCTCA
ACGATCAGACAGGACAGAATGAAACGACAGTCCAAGTCGCCGACCACCCACGATTCACAT
CTCAACAAGAAACGTAATATGGGTACCATTGAAGTTGAAATAAGCTCAGGAAACAAGTTT
CCTACGGGGAATCAAAGGAAAGTTATAATCTCTGAAGAAAATCCCAATAACAGCAGCAAA
ATTATCACAGAAACAAACATCAATAAAAAATCCACCGAAGTTCAAAGTACAGAGAGGGCT
CCAGACTTTATACCGATAACATCATCAGACAATCGAGACAAGAGGGGACAAATAATACAA
GACAATGAAGTGTCGGACACGAGCGAGGTCATCACATCCGCGAGGGTCTACATCACCAAG
GAGATAGCGGATCTCTTAATGACAGATGAAGGAAGCCTGTGGCTCAACACGACCATCAAA
AACAACGATCTGATATTGGAGAATGACACCATAACATTCTACCTGAGCATCAACGGAACA
GTCGGCAACCAGGAGGCCTTCCAAGCTGAACTGGGAGAGTGGATCAAGAAGAAACAAGCC
GGCAGAGAGAAAGAACGATTTGTGTCCGAGAGTGAAACCGACGTTACCCAGGAAGGTACA
AACGATCAGCAATCGTTGACGAATAACATACCCAAGAACAGAAACAACGCTTTGCGCAAA
CTGAACAAAGCCTTCGATTCATTAAAGAAAGATCTGGGAGATCCGAAGACCATTTATAAG
GAGCTGACGTATTTGCAAAATAAACATCAGCAACTTATAAACCAGAAAGTCATAAGCCCC
AAAAAACTGTCCAACAACAGGGACAATATTAATCTGATGCTGAGAAAACTTAATATGGTA
CTTCTCGGGCAAGCTGGTCTAGCGGACGGCTCCACACATTTAAAGGAACTGTACTCCTTA
CAAGAGAAACTAAGCAATTTCAGGCAGAAAAATATACCGACGTCGCTGCGCGAGGAAATC
GGTGAGCACTTTCATTGCATCTTCGCTGCGATACCCAGGGATGATTACATAGAACTCTTA
AGTAAATTTTACAATAAACCGGTCATAACGTTCAAGAAGAAAAATGATAGGTCCTTCAAA
GTCAGTCCGAAGCCAAACCAGAAGACGTTGAATCCGATCCAGAACATACAACGCAACGTG
AGCGGCGTGAAGGATGACACGAAGGAAAACAACGTGGCCAACGACACGTCCCAACTGACC
GCGGCGACCAAGAACAAGCTGGTGTTCTATCACAGGCGGTTGCTGCGCTCGCGACCCATG
GACGCGGTTCTCAAGAAGACAAAAAGCGAACTGCTAAGGAAGCTCCACTTCAATCTCGCC
CTATTAGGCGACAAGGCTCATATATCTTCGAAGGCTCTGAAGAAAATGAGAAAGATTCAA
GAGCAGGCCCAGCTGTTCTTAAATAACTTTTAG

Protein sequence:

MEEELSENELEEQMYAMIHYVDDTQSNVNSNQNDNNIVENVPQSTVRRYWRTNVDQNTPY
QKINTPKDSTNNKETGEKKDDKSKSSDQNTSDLSLFQQPVPSNVKKTVEILENDDDKNIV
ELETSDEDEVIEVALPPKPTITIESSDEDDVCPVDPEPDIKHKPTKPTQEIKNSVDREVT
TSPVPSVVSSISDDFIRGDCIALNISSKHPNNQSFDFSLHGSDLLDQTPSKKKKKKKSKE
KNTPSSVTTPLSVSTPVSSKQTAGAVDECFATPKSKAKNKRQRTKSYRVSEKSLPNADVY
DSDSNQSLNESNKNQMTYEVTDKSVPSTDVYESDSNPSENATESVSKEAINDAESSESTT
ESPVVEIVKTSKNTDISNKSVVDLTEPAMNTSIDENIVMGNVTGFTNMEEFSDHDISVKD
ISKCGSTKIPAILNEDLDFDNLKGSNKVCKRRRYSLTTLRAEMEKFYNESWGGEEFNHRE
IQKNMSRDKSLWVIDAKDRMPSLTRRKTTCNYCNRAGHRDDACHFKPPVCFMCGDAGHYE
PRCPRKICVNCGSPNYVYSTMCRNCSTWKCIKCAECDQSGHPASHCPDVWRRYHDTLSLE
TPLEENRQTKKNHQMFCSGCTRRGHLVHTCRLSLPFSGLPMNSPYVSVYRPVYQMLDTNN
QSNDIGNKKFKNRNNSENSSTIRQDRMKRQSKSPTTHDSHLNKKRNMGTIEVEISSGNKF
PTGNQRKVIISEENPNNSSKIITETNINKKSTEVQSTERAPDFIPITSSDNRDKRGQIIQ
DNEVSDTSEVITSARVYITKEIADLLMTDEGSLWLNTTIKNNDLILENDTITFYLSINGT
VGNQEAFQAELGEWIKKKQAGREKERFVSESETDVTQEGTNDQQSLTNNIPKNRNNALRK
LNKAFDSLKKDLGDPKTIYKELTYLQNKHQQLINQKVISPKKLSNNRDNINLMLRKLNMV
LLGQAGLADGSTHLKELYSLQEKLSNFRQKNIPTSLREEIGEHFHCIFAAIPRDDYIELL
SKFYNKPVITFKKKNDRSFKVSPKPNQKTLNPIQNIQRNVSGVKDDTKENNVANDTSQLT
AATKNKLVFYHRRLLRSRPMDAVLKKTKSELLRKLHFNLALLGDKAHISSKALKKMRKIQ
EQAQLFLNNF