DPGLEAN11064 in OGS1.0

New model in OGS2.0DPOGS207860 
Genomic Positionscaffold667:- 25595-31181
See gene structure
CDS Length3105
Paired RNAseq reads  559
Single RNAseq reads  1391
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009945 (0.0)
Best Drosophila hit  crooked legs, isoform A (2e-41)
Best Human hitzinc finger protein 808 (4e-48)
Best NR hit (blastp)  PREDICTED: zinc finger protein 107-like [Saccoglossus kowalevskii] (7e-54)
Best NR hit (blastx)  PREDICTED: zinc finger protein 107-like [Saccoglossus kowalevskii] (1e-64)
GeneOntology terms





  
GO:0005634 nucleus
GO:0046872 metal ion binding
GO:0003677 DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
GO:0005622 intracellular
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL20621

Nucleotide sequence:

ATGTCATCCGATGAATCAGATGATGAATCCCTAGCATTCTTAGCGGCCTCAAAAAGAATT
AAGGTGGAAGAAGATGAATTGTTAAATAAAGGAAATGAAGACCCTTCCACAAAAAAGGTT
AAAGCCACAAAGAAAACTGATCGTAAACTTAACGTTGGCGCTCCAAATGTTATAGAAAGA
CCTGCGGATGTATGGCTGTACCTTAAAGATTTGAAACCCTCAGGCCCTTACAGCTGCTTA
CTGTGTGATGATTGGTTTATTAATCGATCTAAAATGATTTTGCATTACGCAGTAAATCAC
AAAAAAGATTTTTGTGGTATATGCAGATACTTCGTACCAAATAGACAGGCCTGGTATGCA
CATGAGAAATTTCATTCACCCTGGCCATGTTCACAATGTGTAGAAACTTTTACCTCGGAG
CTAATGTTGAGAGAACATCTTAATTCTGCACATAATCTGGTCCACTGTAGATTGTGCCAC
TTCAGAGTGTCTGCTGATTTCAATTACAACTCACATTTATTTGAAAAGCACAATGTCACT
AATGTATCTTCCAAAAATGAGGATGTTTTATGGAAAGTAGAAGGTGGTACGTTTCAATGT
TTACTCTGCTCTAAATCAGAAAATACACTATCAACTTTCTTTGGACACTTTATGGGTATT
CATCATTTAACTTTAAAGTGCCTCACATCAGTTATAGCCGGCAGAGACACACCTTTCACA
GTGAAAGGGGCTGATGTTAGTGAAAAATTTATTAATGAGCAACTCAAAAGCCATGTTCGA
TTAGGTTATGTAGACTGGGAAACAAAAGATGATAAAACAATGAATGAATTCAAAAAAGAA
AAGGATTTGGAAAATAGTTTGACCAAAGTAGAACAAAGTGCATCTGTGAGAGAAATAAAA
GAAGAAGTAATTAGTGATGAAGAAGAATTAGTTGAGAAAGAAAATGAAGTTAATCAGAAA
GATGAGCCTTCAAGCAGTTATTATAAATGGGCTGAGGATTTTGACATTACATACATGGAA
ATTATAATAGTCCATAAATCATATTATGACTATGTCGACACGTCCCTCAGGGATATCAAT
TCAAATTTAATGCCAGAGAAATCGTATTTGAATTACGAAAGAATGAAAGCAGAAATATAT
ATGGACGTTGAATGCGGATTTTGTAAAACGAACTTTGACACAGCACAGTCGTTTGTTGAA
CATATGAATAAAATTCACAGTGTTAAATCAGTACCCTTATATTCTTGTAGAGTGTGTTGC
GAGACGTTTGATAATTATTTAGATTTATGTACACATGTTACCGAGGAGCTGGCAGACTTT
GAGGATCTTTGGATTTGTCAGTTTTGTGACAAGGAGTTTGATAACCGTGAAGAGACAAGA
CATCATCTGACCGAGCATTGGACTGCTTTGGACTATGATAACTGTTTTAGTCCGCATTTA
GGTTTTAAATGCAAATACTGTCCGACATTATTCTGGAACGAACCGGACAGAGAGACTCAT
CAACTTAGAGTGCATTTAGATAAATATAAACATCAGTTCTACAAGTGCGAAAAATGTGAT
ATGGAATTCGGCGATAAGGTCTGGTATGTATACCATCATTTAGAAAACCATCAAAACCCA
AATGCAGTAACTAACTATATCCTAAAGTGTAACATCTGTTGTTCGGTGATGGCAACCATT
GAGGAGATGAGGAATCACTTTGCAAGAAACCATCTCGAGTTCAAGAAAGTCTACTGTAAC
ATAGATCCTTGCTGTTATAAGCCATTGAATCACCAGCGGTCTCTAAAAATACATATTAAG
ATGGCGCACAGGATAACAGATTTACCCAAAACTCCGAGAGTGCCAAAAACTAAAAAGAAG
GTGTCATGCAACATGTGCAATCGTAAGTTCAACAACGCTCGTGCTTGCAGCACACACATG
GCACAGGTGCATGGACCTGGGAAGTTCAAATGCAAACTGTGCCGCGAGGTGCTGCAGACT
GCTGATGAAAGGAAGCTCCACTACCTCCTGTGCCACCCTGGTCGGCATCCATTCGAGTGT
ACTGAGTGCGGAAAATCATTTCAATACAAATCATCACTGTACATGCACAAACAGGAACAC
ATGCCGAATAAACAGAGCTACACCTGCAGTTACTGCAGTAAGGTTTTCGCGAAGAAGGAT
TCGTATCGTGAACACGTCCAGATACACGAAGGTCCTCGCCACGCGTGCTCGTACTGTCCG
ATGAGGTTCGTCCAACGTTCCAACATGTTGAGACACGAACGACGGCACACAGGCGAGAGA
CCTTACAGGTGTCCTCATTGTACGAGGACCTTCGCTGATAAAGGGGCCTGCACTTCACAT
GCTAGGACACATTCGAAAGACTCGTCCTATGCCTGCGTGTACTGCGGTCAGACGTTCGTA
CAGAAGTCGAAACTCACGTACCATATCAGGAAACACACGGGAGAAAATTTGGAGTCGTGT
TCCGTCTGTTCGAAGCTGTTCACCAGCGCGTGCTCGCTGCGGGAACACATGAAAATACAC
GTGGAGAAGAAGAAGATCGTCAAGTGTCCTCTATGCGACAAGGGCTATCAGGACGAGCGT
TATATGCTGCGTCACCTCCGCACGCTACATTCCCGTTCACAGTTCTCATGTCCGTTGTGC
CACAAGCTCCTCTCCAGCGCTGCAGGTCTCCGTCACCACGTCATAACACACAGCTGCGTC
AACACTTTCCAGTGTAAATCCTGCACAAAATCCTACGCAGTGAAAAGGACCATGTTGAAG
CATTTAAGGAAGCGGCACGGCTTAACGGGCAACGAGTTAAATATAAAGGATTACTACACT
AGATTAGAGCCACGCGAGTGTCAATTGGATCTAGACGAGACAACGATGACCAGTATATTC
GGACCTCCCAAGAAGAAATCGACGGACATATTGTTCGGGGATTTCGTAACTTTGGCTAAG
AAAATCAATGGACCAGAAGAAAAGAAACGAGATGGAGATAGTAGTAGCGATGAGCCGGTT
ACAAGGATTAAGCAAGAGGTCCAGAACCAAACTGAAATAGAAATAGAACCAACAGATTTC
GTCAGTGTTAAGATTGAAAGTGTGGACGCTGGTTATACAGAGTGA

Protein sequence:

MSSDESDDESLAFLAASKRIKVEEDELLNKGNEDPSTKKVKATKKTDRKLNVGAPNVIER
PADVWLYLKDLKPSGPYSCLLCDDWFINRSKMILHYAVNHKKDFCGICRYFVPNRQAWYA
HEKFHSPWPCSQCVETFTSELMLREHLNSAHNLVHCRLCHFRVSADFNYNSHLFEKHNVT
NVSSKNEDVLWKVEGGTFQCLLCSKSENTLSTFFGHFMGIHHLTLKCLTSVIAGRDTPFT
VKGADVSEKFINEQLKSHVRLGYVDWETKDDKTMNEFKKEKDLENSLTKVEQSASVREIK
EEVISDEEELVEKENEVNQKDEPSSSYYKWAEDFDITYMEIIIVHKSYYDYVDTSLRDIN
SNLMPEKSYLNYERMKAEIYMDVECGFCKTNFDTAQSFVEHMNKIHSVKSVPLYSCRVCC
ETFDNYLDLCTHVTEELADFEDLWICQFCDKEFDNREETRHHLTEHWTALDYDNCFSPHL
GFKCKYCPTLFWNEPDRETHQLRVHLDKYKHQFYKCEKCDMEFGDKVWYVYHHLENHQNP
NAVTNYILKCNICCSVMATIEEMRNHFARNHLEFKKVYCNIDPCCYKPLNHQRSLKIHIK
MAHRITDLPKTPRVPKTKKKVSCNMCNRKFNNARACSTHMAQVHGPGKFKCKLCREVLQT
ADERKLHYLLCHPGRHPFECTECGKSFQYKSSLYMHKQEHMPNKQSYTCSYCSKVFAKKD
SYREHVQIHEGPRHACSYCPMRFVQRSNMLRHERRHTGERPYRCPHCTRTFADKGACTSH
ARTHSKDSSYACVYCGQTFVQKSKLTYHIRKHTGENLESCSVCSKLFTSACSLREHMKIH
VEKKKIVKCPLCDKGYQDERYMLRHLRTLHSRSQFSCPLCHKLLSSAAGLRHHVITHSCV
NTFQCKSCTKSYAVKRTMLKHLRKRHGLTGNELNIKDYYTRLEPRECQLDLDETTMTSIF
GPPKKKSTDILFGDFVTLAKKINGPEEKKRDGDSSSDEPVTRIKQEVQNQTEIEIEPTDF
VSVKIESVDAGYTE