DPGLEAN01842 in OGS1.0

New model in OGS2.0DPOGS205032 
Genomic Positionscaffold2126:- 27583-37366
See gene structure
CDS Length1662
Paired RNAseq reads  1355
Single RNAseq reads  3178
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001656 (6e-126)
Best Drosophila hit  CG11247, isoform C (1e-20)
Best Human hitzinc finger and BTB domain-containing protein 41 (7e-21)
Best NR hit (blastp)  PREDICTED: zinc finger protein 26-like [Oryctolagus cuniculus] (7e-29)
Best NR hit (blastx)  PREDICTED: hypothetical protein [Mus musculus] (3e-38)
GeneOntology terms





  
GO:0008270 zinc ion binding
GO:0005622 intracellular
GO:0003677 DNA binding
GO:0046872 metal ion binding
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0005515 protein binding
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL39568

Nucleotide sequence:

ATGGTGTACTTATGTGAGGCTATGGACCACACATATATGTCCCAATCATTGTCGTCATTG
GAATGCGTGATCAAGACCGACTTCGACCAGATCTTAGTCGACTACAACTTGCAAGAAAAG
GACTGGCCGCAGTACATAGCGATTAATGGAGAGGAGAACGTTAAAAATGAGGGTTACGAT
GTAGCACAGGCTTTGAGAGACCACATAACATTGAGTATTGATGACCAGGTCAACGTTATG
GGCGTTCCGGAACTAGTTTTGGAGAATCCGGTGACCGGTGTGATGTCACACATCGTGGTC
AATGCCGGATCTCTGACCGATTACAGCAGTGTCAAGAGGGAAATGCCAGAACTGACCATA
GATCCCACAATAAAGCCTGATACGGTGATAATAAGCCAGAATCCAAGAAATAGTCCGAGG
AAGGAGAAAGCTGATCTCAAAAGCAAATATACTAAGGAGCTTATGACTGATGAGGAAATG
CTCGCATTGAGAGAGGAAGCCAAACAGAAAGTGCAATACGTGAGTTCTGTGTATAAATGT
GAACTGTGCATCATAGGATTCTACACGCAGCAGCAGGTGGAAGATCACTTCGTGGCTATG
CACAGAGAGAAGCCGGGTTACGTACCGTGTAAAGTATGTTTCGTATATACACCGGAGAAC
AAGGTGGACGAACACACGGACACGCACTACAGCAGGTACACGTGCAAGATGTGCAGCCGG
CGGGAGACCAGCCTCAAGATGATGATGGTGCACCTCAGGGCCCACGAGAACAGGACGCCC
AGGGCGCTCATACAGATAGACGGGGAGAAGAAGAGCAGAAAGAGAAAAAACACGAAGTGT
GATGAAGAGGAAAAATCACCTCCCAAGCCGGGAGACCTCAGGAAGCTACTGTCCAAGACT
ACTATAGTCGGCTACAAGTGTTTGGAATGTGACATGTTCTTTAAAAATTCAAGGGCACGC
AAGAACCACGTGGATAGATTCCACCGGGAGGGTCTGCAGTGTGATCATTGTAAGAAGAGA
TTCGTTAACAGGACCACTCTCGCCACACATTTGAGGCTCCACGAAGGCCCGCTGCCCCGC
GAGGAGTGCCCCATCTGCCACAAGATGGTCCGCACGATACAGATCAAGTACCACATACAG
AGGCATCAGAGCACCACCAAGTACGAGTGCAGGGACTGCAACAAAATCTTCTCCCACCTG
GCGACCTATCAGGCGCATCTGAAGTTCTCGAGGGCCCACGCATCCGATCAAGTTTTTAAA
TTCCCGTGTCCCATGTGCAACAAGGGCTATCCGACAAAACAGGCTATGCAGGACCACTTC
AACTATCAGCACCTCAACAAGACAACGCACAAATGTCCCATATGCAGTAAGCCGATAGCA
TCCAAAGCGAATGTTGAGAAACACATGATGAGGGTCCACGGGGAGAAGAAATCTAAGCCC
AGGAAGCATGTGTGTCAGATGTGTGGCAAGGGGTTCACGGACAAGAAAGCCTTAACTCAG
CACGAGGTCATCCACTCCGGGGAACGGCCTCTCTCTTGTGATATTTGCCAGCAGACGTTC
AAACAGAAGGCATCCCTGTACACACACAAGAAACGAGTCCACAAAGTATTCCCAGCTAAG
AGAGTCGTTGAATTTATGGACAACGGTGAAAATAATACCTAG

Protein sequence:

MVYLCEAMDHTYMSQSLSSLECVIKTDFDQILVDYNLQEKDWPQYIAINGEENVKNEGYD
VAQALRDHITLSIDDQVNVMGVPELVLENPVTGVMSHIVVNAGSLTDYSSVKREMPELTI
DPTIKPDTVIISQNPRNSPRKEKADLKSKYTKELMTDEEMLALREEAKQKVQYVSSVYKC
ELCIIGFYTQQQVEDHFVAMHREKPGYVPCKVCFVYTPENKVDEHTDTHYSRYTCKMCSR
RETSLKMMMVHLRAHENRTPRALIQIDGEKKSRKRKNTKCDEEEKSPPKPGDLRKLLSKT
TIVGYKCLECDMFFKNSRARKNHVDRFHREGLQCDHCKKRFVNRTTLATHLRLHEGPLPR
EECPICHKMVRTIQIKYHIQRHQSTTKYECRDCNKIFSHLATYQAHLKFSRAHASDQVFK
FPCPMCNKGYPTKQAMQDHFNYQHLNKTTHKCPICSKPIASKANVEKHMMRVHGEKKSKP
RKHVCQMCGKGFTDKKALTQHEVIHSGERPLSCDICQQTFKQKASLYTHKKRVHKVFPAK
RVVEFMDNGENNT