DPGLEAN07502 in OGS1.0

New model in OGS2.0DPOGS210200 
Genomic Positionscaffold2271:+ 737-2611
See gene structure
CDS Length1782
Paired RNAseq reads  93
Single RNAseq reads  257
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003260 (6e-104)
Best Drosophila hit  CG6654 (8e-62)
Best Human hitzinc finger protein 569 (2e-61)
Best NR hit (blastp)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (3e-77)
Best NR hit (blastx)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (5e-83)
GeneOntology terms



  
GO:0046872 metal ion binding
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
GO:0005634 nucleus
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL20304

Nucleotide sequence:

ATGGAGAACGACGATAGGACTGTGTGCGAAGTTTTGAGTTTCATTACGAGCTTTGATATA
ATTATAAGTGAAAAATATCCCAAACAGATATGTAACGAGTGCTTTATTATGATACGGAAA
ACTGAAGAGTTCAAAACGCGTTGTATTCAGTCAGAAACATTACTAAAAAATGAATTTTTA
AACTGCTCCGTGTTTGTTGACGATTTTGTAAAGCAAGATCTAATCGCGAATAACAATTTA
TCTCCGTCTTTCAATAATGATGATAGTTTAAGTAACGAGAATGATACACTTAAGCTTGAT
GTTTTGAACGTGATAAAACAGGAGAGAGATGATTTCTTACAAGGAGACATAGAAGATTTA
TCCAATACTAATCTCATTGATAAAATTGAAGTACCAGAAACAAGTCCTTTGATTTGTCTT
CCCAAAACTATAGCAAGCGAGTATGACAGGGAGTACGAAGATATAAAACAGTTCAAGTCA
CATTTGGAAAAATGTAATCATGAAGAAAGCACCAAGAAGTGTGAGAATTTATTTATTGTT
TGCAAAAAAGAAACGGCATGTTCGGAATTATTAAAAACAGGTTCAAATACACACGAAAAT
GCACAAAATGTGTGTAACTTCGACCAAAACTTTACAAATGGTTGTAATATTAGCAATAAG
TATGACTTTAGTCAAGTAGATAGCACATGTAGCGATGAGATTTCTTGTACAACAAAAATC
TTTAACCAAGAAACTCCTGTTACAAAACAAATAAGTAATGAAAAAAAATCAAACAGTTTG
GCAAAGAGACGGGAAACACTAAAACCAGGTGCAGTTGATGAGTCATCAACAATATCCTGT
CCTCACTGTACTCGGGTACTGCCAGATGTGAAAGCATTTGAAAAGCATCTAGAAAAGCAC
AGAAAATGTGCAAAAAAGAAAGAATTCCAATGCTCAACGTGTTCTAGAAAATTCCTAAGT
AGAAAACTATTGACGGAACACATCAACTCGCATCTAGATAAAAATGATCAGAAATTTGTT
TGTACGACATGCAAAAGAGAGTTCAAACATCAGGCCCATTTAGAGAACCATATAGAAAAT
GCGCACACCAAGATAAAAGGCTTCAAATGTGACACATGCACTAAGAGCTTCTCCAACCAG
GAAAGTCTTGAATTCCACAAGAAACAACACACAAACACTAGGAAATATCAATGCAATGTG
TGCAGCAAGACGTTCGCCGTACATTCAGTGCTCAACGAACACTTGCGGACTCACACAGGC
GAAAAGCCGTTCCTCTGTTCGATTTGCGGAAGAGGTTTCACACAGAAGACGAATTTAGCC
CAACACATGAGGCGGCATCAGGGTTTGAAACCTTACAAGTGTAGTAACTGTGATAGAAGT
TTTGTATCTAAAGGCGAGCTCGATGCTCACACTCGTAAGCACACCGGCAAGCATCCGTTT
GTATGTGATGAGTGTGGGAACGGCTTCACAACGTCTAGTTCGTTAACAAAACACAGAAGA
ATACACACCGGTGAGAAAAGATATGCTTGTGATCTGTGCTCAATGAGATTCACAGCTTTA
GGGACATTGAAGAATCACAGACGGACACACACCGGGGAGAAGCCGTACCAGTGTATGTTG
TGTGAGAAGGCGTTCATTCAGCGGCAGGATCTCGTGGGTCATATCAGATGTCACACCGGA
GAGAGGCCCTTCACCTGCACGAGCTGCGGGCAAGGGTTCAGGAAGTCGTCAGCTTTGAAA
GTGCATTTAAGGAGTCACGGGAACGACATGCTTGTTATGTGA

Protein sequence:

MENDDRTVCEVLSFITSFDIIISEKYPKQICNECFIMIRKTEEFKTRCIQSETLLKNEFL
NCSVFVDDFVKQDLIANNNLSPSFNNDDSLSNENDTLKLDVLNVIKQERDDFLQGDIEDL
SNTNLIDKIEVPETSPLICLPKTIASEYDREYEDIKQFKSHLEKCNHEESTKKCENLFIV
CKKETACSELLKTGSNTHENAQNVCNFDQNFTNGCNISNKYDFSQVDSTCSDEISCTTKI
FNQETPVTKQISNEKKSNSLAKRRETLKPGAVDESSTISCPHCTRVLPDVKAFEKHLEKH
RKCAKKKEFQCSTCSRKFLSRKLLTEHINSHLDKNDQKFVCTTCKREFKHQAHLENHIEN
AHTKIKGFKCDTCTKSFSNQESLEFHKKQHTNTRKYQCNVCSKTFAVHSVLNEHLRTHTG
EKPFLCSICGRGFTQKTNLAQHMRRHQGLKPYKCSNCDRSFVSKGELDAHTRKHTGKHPF
VCDECGNGFTTSSSLTKHRRIHTGEKRYACDLCSMRFTALGTLKNHRRTHTGEKPYQCML
CEKAFIQRQDLVGHIRCHTGERPFTCTSCGQGFRKSSALKVHLRSHGNDMLVM