DPGLEAN00017 in OGS1.0

New model in OGS2.0DPOGS216059 
Genomic Positionscaffold6830:- 1131-2790
See gene structure
CDS Length966
Paired RNAseq reads  20
Single RNAseq reads  75
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009025 (3e-29)
Best Drosophila hit  gooseberry-neuro (5e-32)
Best Human hitpaired box protein Pax-7 isoform 1 (4e-28)
Best NR hit (blastp)  conserved hypothetical protein [Culex quinquefasciatus] (2e-37)
Best NR hit (blastx)  PREDICTED: similar to gooseberry-neuro CG2692-PA [Tribolium castaneum] (2e-36)
GeneOntology terms





  
GO:0003677 DNA binding
GO:0007367 segment polarity determination
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005667 transcription factor complex
GO:0005634 nucleus
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families



  
IPR017970 Homeobox, conserved site
IPR001356 Homeobox
IPR009057 Homeodomain-like
IPR000047 Helix-turn-helix motif, lambda-like repressor
IPR012287 Homeodomain-related
Orthology groupND

Nucleotide sequence:

ATGTCACCCGAACCCAAAGATGCCGACAAAAGGACCGCCACGTTGGGCCGTGGATCAGAC
TCTTCAGATATAGAATCGGAGCCAGGCCTGACCCTTAAAAGAAAACAGCGTAGATCTCGC
ACCACGTTTACAGGAGAACAGCTTGACGCGCTTGAGAGAGCTTTTCATAGAACACAATAT
CCTGATGTATACACTAGAGAAGAACTTGCTTTGCAGACTGGCCTTACCGAGGCCAGAATA
CAAGTTTGGTTCTCCAATAGAAGAGCTCGACTACGAAAGCATACTGGTTCAAATCCGACT
CCTTCACTCGCTAGTTATTCGACGATACCAATGCCGCAGATACCGTGCCCGTATCCTGCC
GGAGAAATACCTTCACTATCTCAACATCACCCGCAACATCCGGATGCCTGGCATCATCAA
AAGTATGCCAATTATAACCAGCTAATGGCTCAGTCTCAACATCTTAACCAAGCTTTTCAA
ACTGCAGCCTTCCCCAGCACCTCTGGGACTACTTTCAGCCATTTAGTGACCGGTGCTAGC
GCACCAACTCACAGTCAGCTTCTTGATAGCACTCCAAGAACTGATTATCCTCGATATCCC
ACTGATGTCTACAACAAACCCATCAGTTATATGCCTAAAGATACGGAAGCGGAAGATAAG
GGAGTGGGAGAAGAAATTATAGAGCAACGTGAAGAAGCTTACATAAAAACAGGTGGAAAT
GAATACAAAGAATTAGCGACCAGTGATTATCCTAAAGTTCCTACTGATTATTCTAAGCTT
TCTGTTGATCCTTCCTCCACCAACTGGACTGCATCTAATAACTCCTTGAATATGAGTCTA
TCTGGATTATCTAGTGACTATAAATATATGAGTGACCCTTATGCTTTTCCTGCTATCGCG
TCGGATACCCTAAATCAACATACCTACACCAATCCAGGAAATGCAGCCAATAAATACTGG
ATTTGA

Protein sequence:

MSPEPKDADKRTATLGRGSDSSDIESEPGLTLKRKQRRSRTTFTGEQLDALERAFHRTQY
PDVYTREELALQTGLTEARIQVWFSNRRARLRKHTGSNPTPSLASYSTIPMPQIPCPYPA
GEIPSLSQHHPQHPDAWHHQKYANYNQLMAQSQHLNQAFQTAAFPSTSGTTFSHLVTGAS
APTHSQLLDSTPRTDYPRYPTDVYNKPISYMPKDTEAEDKGVGEEIIEQREEAYIKTGGN
EYKELATSDYPKVPTDYSKLSVDPSSTNWTASNNSLNMSLSGLSSDYKYMSDPYAFPAIA
SDTLNQHTYTNPGNAANKYWI