DPGLEAN06922 in OGS1.0

New model in OGS2.0DPOGS205369 
Genomic Positionscaffold100:- 86972-94172
See gene structure
CDS Length3279
Paired RNAseq reads  430
Single RNAseq reads  1102
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008762 (3e-24)
Best Drosophila hit  crooked legs, isoform A (4e-18)
Best Human hitzinc finger protein 208 (3e-24)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_71028 [Branchiostoma floridae] (1e-47)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_71028 [Branchiostoma floridae] (1e-62)
GeneOntology terms

  
GO:0003676 nucleic acid binding
GO:0008270 zinc ion binding
GO:0005622 intracellular
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL40326

Nucleotide sequence:

ATGCACGAGGAACATCAGACTTTTAAAGTCGAAACAGCGTTCGCACATTGCAATGAGGGA
TACTTGAAAGCTGATTGCACAGACCTCAAGTGCAGAATCTGTTGGGAACCATTTAAAAAG
TTGGATGATGTCGCCAAACATATTAACGATGTCCATAATATTAAAATACTATTTGAATTT
CACATTGGCATACAACCTTTTAAGTTCGATGATGAAAAGCTTTTATGCGGTATATGTGAT
AGGAATTTCCCTTGTCTCCGGCAGCTTAGTCGTCACATGACTTCCCATTACCAGAACTAT
ACCTGCGAAGAGTGCGGGAAGTCTTATACTACAAACAGCTCTCTACAGCAGCATATCAGA
TTTTCGCATATATCAAACAAAAGGATTTGCAGAAGGTGTAAGAAGACATTTAATTCGTTG
CTAGATAAAAGGGAACATGTTAAATCTTCATCAAAATGTTGGGCTTACCAGTGCGCCGTT
TGTGGTGTACGATTTATGACGTGGACTCTGAAAGAACAGCATTTAGAAAATGTTCATGGA
CAAGCTAAAAAGACACATAAGTGTCCCGAATGCGCAACAATATTCTTGACCAGGAACGCC
TACAGAAATCATTTTGCGACAATGCACGCGGGAATTAATTTCGTTTGCTCTTATTGCGGA
CCCACGAAGGATATTGACGGACTCAAAGAGAAAGAGAGAAAACAAAAAAAGGGGTTATTC
GAAAGATCCGTTAAACATAACCCCCAGCGTCGAAACGCTGTTTTGGTTCTTAGACATTCC
ACGGCTATTCCTTTTAAAACACGATTTAATAGAATACTCTGCTCTTATTGCCATGATGAA
TTCCAGCCCATGGAGGCTCTTAGAATACATATTAAGGAGAAACACATTAACGCTGATTTT
AACAGTGCTTTTTATAAGGTGGTCGATGATCTCAAAATTGATATAAGCCATTTCAAATGT
AACATATGCTCGCAAGACATTGAGAACGTTGACACATTTATGAATCATTTATCAGGGGAT
CATGGGAAACCAGTTAATTTTGATGTACCTTTCGGTGTGTTACCGTATAGACAGAATGAG
ACTGGTGCTTGGCTGTGCCTTCACTGTGATAAAATATATCCGGAATTCTCCCAAATAAAC
AGCCATTTACGAACCCACGCCAAAATTTCTACTTGCGATAAGTGCGGGGCGACTTTCCTC
TCGGAGCACGGTCTAAAACAACACGAGCGTAATTTCCAATGCTATAAAGCAACATACAAA
CCTCGCTTCGGTAAAGCCTTGAAGCATAAATACAATACTGAAATTATTTTACAATGTTCA
ACTGCATGTCCTTTCAGAACGTGGGGACAAAATTTTAACTGTGTCTTTTGCAGAGTGCAA
TCAAATGATCCCAATGGGCTACGAGCTCATATGGCATCCAGACATGCCAACTTTGACATA
CAACTAGTATTTAGCAGGAAATTACGAAAGGAATTTTTAAAAGTCGATATAACAGATTTG
CAATGTAAACTTTGCTTCATGCACATTGACACTTTAGATGATTTATTGACACATCTCAAA
AATGATCACAAACAACCGGTGAACATAGACGTCCAACCGGGGGTCTTGCCGTTCAAGTTG
AACGACGGCTCTTGTTGGAAATGTGCTATATGCAAAATACAGTTCTCCGATTTCATATCG
TTAAAAAAACACACAGCGGAACACTATCAGAACTACGTTTGCGACACATGTGGGGAGGGT
TTCATAACAGAAGTCGCATTGCGGGCGCACACGAAAATACCGCATGATAATAAATACACC
TGCAGTAGATGCGTTGCGACGTTCTCCACGTTAGAAGAGAGAAGTGTTCACATAAAAACA
CAACACACGAACCTACCGTACATGTGTACTTATTGCAAGGACAAACCGCGGTTCGCCACC
TGGGAGCTCAGGAAACGGCATTTATATGAGATACACAATTATAAATCAGGGGCGGAAATG
TACGAGTGTACCACCTGTCACATGATGTTCAAGACGCGATCTCAGAAATACCACCACAAC
GTCAAAGTTCATCGGACAAAAAAGGAAATAGATTTCGGTTTCTCTTGCGGCCACTGCGCT
AGAGGAAGAGGCAGAGAAACAGAAAATATCACTGATATAAACATACTCCAACAGTCACAT
CTCGATACAGATAATGACATTAAGGTTGGAGAGAAACGAAAGTATCAAAAAAGTGCTAGA
TCCCAAGCGAGATTCATGACAAAGAAGAACGCAAGCTCTATTCTCGAATGCTGGTCTGGA
ATACCATTCAGATGGAAAAAAAATAGATTTAAATGCGCCTATTGTGAAGAAAATTTTAAT
GAGTGTTCGGATTTGAGGGAGCATGTTAGATTATGTGCTACCCAGTACAATGTAGGCAGT
ATATTCAGTAAATTTAAAGAAATGACTCTCATAAATATGGATGTCAGTGAGGCCGCTTGT
CGAATATGTTCTGAGCCGTTCAGAGAACTTGATGGTATGCGAGAGCACGTCATTCGACAC
GGCTACGAATTAGATGTTTCGCATCCGGACGGTGTTATACCGTTTTGTCTCACGAAAGAA
TCCTGGTCGTGTGTCTTATGTCGCGAGACATTCAATAACTTTCTGAAACTTTACGAGCAT
ATGAACACGCATTATCAGTACCACATATGTTCTATATGCGGAAAAGGTTACATGACTGGA
CCGAGGCTAAGGAAACATCTAGAATTACACATAACGGGAACATTTCCTTGCGATAAATGC
AAGAAAGTTTTCACAAAACGCACAGGGAGAGACAATCACAAAGCCTACGCCCACGCCAAA
GGTCCGCGTTATGAATGCCCACAATGTAATATGAGATTTGAAGGTTATTATGATCGAATG
AATCATTTGAAACAAGCTCACAGAGAAAAGGAAGTGAAGTACGGCTGTTCACACTGTGAT
CTGTCATTTAAAACGAGCGGCAAGCGAGCCATCCACGTCAAAACGGTTCACTTTCCTCGT
CAGAGTAACTTTAGTTGCCCTTATTGCAAAACCCTATTCAAAACAGCCTTCGGTATGAAA
CGTCACATGGTAAAACATAATGGAGAAACCTGTACTGTTTGTGGTGAAAGTTTTACTAAA
AGTAAAGCATTGAAGGAACACTTGGCAGGTCACGCTGATGGACTTCGTTGTAAATGGTGC
GGAAATAATTTTAAGGAAGCAGCACTTCTGTCGACACATACGCGTGAGAAGCATCCAGAA
GTAGACGAATTGATGATGTCCGGGCTGGTTAATGTTTAG

Protein sequence:

MHEEHQTFKVETAFAHCNEGYLKADCTDLKCRICWEPFKKLDDVAKHINDVHNIKILFEF
HIGIQPFKFDDEKLLCGICDRNFPCLRQLSRHMTSHYQNYTCEECGKSYTTNSSLQQHIR
FSHISNKRICRRCKKTFNSLLDKREHVKSSSKCWAYQCAVCGVRFMTWTLKEQHLENVHG
QAKKTHKCPECATIFLTRNAYRNHFATMHAGINFVCSYCGPTKDIDGLKEKERKQKKGLF
ERSVKHNPQRRNAVLVLRHSTAIPFKTRFNRILCSYCHDEFQPMEALRIHIKEKHINADF
NSAFYKVVDDLKIDISHFKCNICSQDIENVDTFMNHLSGDHGKPVNFDVPFGVLPYRQNE
TGAWLCLHCDKIYPEFSQINSHLRTHAKISTCDKCGATFLSEHGLKQHERNFQCYKATYK
PRFGKALKHKYNTEIILQCSTACPFRTWGQNFNCVFCRVQSNDPNGLRAHMASRHANFDI
QLVFSRKLRKEFLKVDITDLQCKLCFMHIDTLDDLLTHLKNDHKQPVNIDVQPGVLPFKL
NDGSCWKCAICKIQFSDFISLKKHTAEHYQNYVCDTCGEGFITEVALRAHTKIPHDNKYT
CSRCVATFSTLEERSVHIKTQHTNLPYMCTYCKDKPRFATWELRKRHLYEIHNYKSGAEM
YECTTCHMMFKTRSQKYHHNVKVHRTKKEIDFGFSCGHCARGRGRETENITDINILQQSH
LDTDNDIKVGEKRKYQKSARSQARFMTKKNASSILECWSGIPFRWKKNRFKCAYCEENFN
ECSDLREHVRLCATQYNVGSIFSKFKEMTLINMDVSEAACRICSEPFRELDGMREHVIRH
GYELDVSHPDGVIPFCLTKESWSCVLCRETFNNFLKLYEHMNTHYQYHICSICGKGYMTG
PRLRKHLELHITGTFPCDKCKKVFTKRTGRDNHKAYAHAKGPRYECPQCNMRFEGYYDRM
NHLKQAHREKEVKYGCSHCDLSFKTSGKRAIHVKTVHFPRQSNFSCPYCKTLFKTAFGMK
RHMVKHNGETCTVCGESFTKSKALKEHLAGHADGLRCKWCGNNFKEAALLSTHTREKHPE
VDELMMSGLVNV