DPGLEAN06923 in OGS1.0

Genomic Positionscaffold100:- 82913-86760
See gene structure
CDS Length2235
Paired RNAseq reads  133
Single RNAseq reads  371
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008765 (2e-36)
Best Drosophila hit  crooked legs, isoform A (2e-22)
Best Human hitzinc finger protein 208 (1e-43)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_61483 [Branchiostoma floridae] (6e-52)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_61483 [Branchiostoma floridae] (4e-68)
GeneOntology terms
  
GO:0005622 intracellular
GO:0006355 regulation of transcription, DNA-dependent
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL40327

Nucleotide sequence:

ATGCGCCGCCGGAGACGCGCTAACAACGAACTCCCCGAAGAATCTGAGAAACGTATCGCG
AAAACTATGATGCGCAGAAATGCTTTAACTATACTAGAAAGTTCTACAGCTTGGGCCTTT
CGAACCGTCAAGAAAGTAATATGGAAACGAAGAAGACGGCCGTATAACGATCATAGAGAT
AACGCGGCCATTATTATTGAGTTCTCAAACGTGTGTCCTTTCCGTTGGAAACGCGGTGCG
TTTGCGTGCTCGTACTGTCCGAGTACCTTCGGTCATTTCAGAGACGTTTGTACCCACACT
CGGGAACACACCAGTCGAATAGAGGCAATGAGATACGCTCGGCCTTATGATAACATTAAG
ATTGAAGTCAGCAATCTTAAATGTGAAATTTGTGCTGAAAGCTTGAGTACATTGGAAGAT
TTGAAATATCATTTGATTGACGTACATCAGAAGCCAATTTTAAAACATCTGGGTTTGGGT
GTTACGCCGTTTGTGCTTCAAGACAAAGAGATGCTCTGTACGCATTGCGGAGAACGTTTC
TCGTTATTTTCCAAATTAAATAGCCACATGAACGAGCACTACCCCAACAATATATGTTTT
CATTGCGGCAAATCTTTCACAGCTGTACATAGAATGAAAGCCCACCAGCTCACCCACGAA
ACGAACGACCAGGAGCACAAATGTTCTAAATGCAATGAAGTCTTTGAAACTAGGGTGTTG
AGAAACTATCATATGAACACAAAGCACAGTCCGGTAAACCGATATAGATGCCCTTACTGC
AATTTGTCGTTTAAGCGCTACTCTGATAGATTAAGACATTTGAAGGAACTTCACGGCCAA
AACATTGAGTATCCGTGCCATTTGTGCTCAGCAGTGTTTGCAATGTCCGATCAGCGAACC
AAACACATTAAGAATGTCCATATAAAACATAAACAATTCCAATGTGATTTTTGCCCTAGC
AAATTCGTGACTGCCGGGCAATTGAAGAATCACATGGTCAAGCACACGGGAATACGTGAA
TTTCAATGCACCGTCTGTAAAAAGAGCTACGCGCGTGCTAGGACTCTCAAAGAACATATG
CGAAGAAACTCGGCAGTTGTTGACATAAAACAGGAAATGGATATAGATATGGAAAATTTT
GTAACTTACGACAGCATGGCTAGAGCTGAAATTGTGAAACGGAGAGCTGAAGGCAATGAA
AGAAGAGCCGCGTTTAGGAAAAATATAAAAATTATTATGACGTCATCCACCGCATATCCG
TTCAAATATATAAAGGGGACATACTTGTGTTTCTTTTGTGAGAATTCATTTCTAGAGCCG
GAAAAATTAAGGGAACACACTCAAAAAATACACACAGACCGAGCCTTTAAGCTAAAGAAA
TATGAGCCCTTGAAAATGGATTTTTCGGCTTCGATTTGTAAACTATGCGGCACAGGTGTG
GATGACTATTTAAATTTAAAAGCACATTTGCGTGATCACGGCAAAATATTGGATAGCACC
CACGGGGAATCAATTCTACCGTACAAATTGACTAAGGATGACCATTGCTGCCAAATATGC
GGAAAGCGGTATGAGATGTTTTTAAGTCTTCACCGCCACATGAATGATCACTACGAACAC
TTTATCTGCGAAACATGTGGCAAAAGGTTTGCCACTACGCAAAGACTACTGAACCACTCA
CGGACTCACGAAAGAAGAGATTTTCCTTGCAAACATTGTGGTGAGACGTCCTCGTCATAC
GCTGCATTATACGCGCACATTGCAAAAGTACACAGGCTAAATAAGAGATACAAATGTCCT
ATTTGTGACGAAAAGTTTGCTTCATATAAATATAGGATAAAACATCTGAATACCGTCCAC
GGTGAAAAAACTACACTATTCCCCTGTCCGTCATGTCCAAAGGTATTCGATCTCTGCAGC
CGCAGAACCGCTCACGTCAGATCTCATCATCTGCAGGAGAGGAACCACACTTGTTCCGTC
TGTGGGATGAAGTTCTTCAGTAACTATGAGCTGCAGGAGCACAGCGTGAAGCACGTCGGT
GCTAGAATATACCAGTGCGACGTTTGTAAGAAATCGTACGCCAGGTTGAAAACATTGAGG
GAACATATGCGTATACATAACAACGACAGACGTTTCGTGTGTCCGGCCTGCGGGGCCTCG
TTTATACAGAAATGTAGTTTAAAACAACACGTGAGAGTGCATCATCCCATTCAGGTCAAG
ACTGATATGTTCTAG

Protein sequence:

MRRRRRANNELPEESEKRIAKTMMRRNALTILESSTAWAFRTVKKVIWKRRRRPYNDHRD
NAAIIIEFSNVCPFRWKRGAFACSYCPSTFGHFRDVCTHTREHTSRIEAMRYARPYDNIK
IEVSNLKCEICAESLSTLEDLKYHLIDVHQKPILKHLGLGVTPFVLQDKEMLCTHCGERF
SLFSKLNSHMNEHYPNNICFHCGKSFTAVHRMKAHQLTHETNDQEHKCSKCNEVFETRVL
RNYHMNTKHSPVNRYRCPYCNLSFKRYSDRLRHLKELHGQNIEYPCHLCSAVFAMSDQRT
KHIKNVHIKHKQFQCDFCPSKFVTAGQLKNHMVKHTGIREFQCTVCKKSYARARTLKEHM
RRNSAVVDIKQEMDIDMENFVTYDSMARAEIVKRRAEGNERRAAFRKNIKIIMTSSTAYP
FKYIKGTYLCFFCENSFLEPEKLREHTQKIHTDRAFKLKKYEPLKMDFSASICKLCGTGV
DDYLNLKAHLRDHGKILDSTHGESILPYKLTKDDHCCQICGKRYEMFLSLHRHMNDHYEH
FICETCGKRFATTQRLLNHSRTHERRDFPCKHCGETSSSYAALYAHIAKVHRLNKRYKCP
ICDEKFASYKYRIKHLNTVHGEKTTLFPCPSCPKVFDLCSRRTAHVRSHHLQERNHTCSV
CGMKFFSNYELQEHSVKHVGARIYQCDVCKKSYARLKTLREHMRIHNNDRRFVCPACGAS
FIQKCSLKQHVRVHHPIQVKTDMF