DPGLEAN22302 in OGS1.0

New model in OGS2.0DPOGS213091 
Genomic Positionscaffold679:+ 32694-47298
See gene structure
CDS Length831
Paired RNAseq reads  154
Single RNAseq reads  603
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007623 (1e-15)
Best Drosophila hit  BarH1 (4e-42)
Best Human hitbarH-like 2 homeobox protein (2e-35)
Best NR hit (blastp)  B-H1 [Drosophila ananassae] (4e-52)
Best NR hit (blastx)  homeobox protein b [Aedes aegypti] (2e-46)
GeneOntology terms











  
GO:0008052 sensory organ boundary specification
GO:0008057 eye pigment granule organization
GO:0001751 compound eye photoreceptor cell differentiation
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0008407 bristle morphogenesis
GO:0007455 eye-antennal disc morphogenesis
GO:0007479 leg disc proximal/distal pattern formation
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0003677 DNA binding
GO:0043565 sequence-specific DNA binding
GO:0016481 negative regulation of transcription
InterPro families




  
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
IPR000047 Helix-turn-helix motif, lambda-like repressor
IPR020479 Homeobox, eukaryotic
IPR001356 Homeobox
IPR017970 Homeobox, conserved site
Orthology groupMCL16283

Nucleotide sequence:

ATGACCGTCCAACGCGACCAGCGCGAGCGCGCGCCGCGGACCAGGTTCATGATCACGGAC
ATCCTGGACGCGGCGCCCAGGGACCTCAGCGCGCACCGGGACTCGGACTCCGACAGGTCG
GCCACGGACTCCCCAGGTGTCAAAGATGACTCCGACGACGTGTCCAGCAAATCCTGCGGT
GACGCATCTGCATTGGCTAAGAAGCAGCGCAAGGCTAGAACAGCCTTCACGGATCATCAG
CTTCAGACCTTGGAGAAGTCGTTCGAGAGACAAAAATACCTCAGCGTCCAGGATCGAATG
GAGCTAGCTGCTAAACTAGGTCTTACAGATACCCAAGTGAAGACCTGGTATCAGAACAGA
AGAACGAAATGGAAGCGTCAAACGGCCGTTGGACTCGAGTTACTAGCAGAGGCTGGCAAC
TACGCAGCCTTTCAACGTTTGTATGGAGGTTACTGGGCAGGAGTGCCCGCGTATCCAACA
CAGCCTGCCCCTTCTGCTGATTTATACTATCGTCAAGCTGCCGCAACTGCTGCTGCAGCA
GCCTCGGCCTCTGCAAACACATTACAGAAACCATTACCATATCGATTATACCCTGGCGCT
CCAATGGCGGGTGTTCCCCCGTTAGGTTTGGGTCTGCCGGGTCCGTCTGCTCACTTGGGA
TCACTGGGTGCTCCTGGTTTGGGAGCCCTCGGTTATTATGCACAAGCTAGACGCACACCC
TCTCCAGACGTGGATCCTGGAAGCCCAGCACCTCCGCCGCGATCCCCGCGAGAGCAATCC
GTAGAACGACACTCTGACGACGAAGACGACGATGAAACCATACACGTGTAA

Protein sequence:

MTVQRDQRERAPRTRFMITDILDAAPRDLSAHRDSDSDRSATDSPGVKDDSDDVSSKSCG
DASALAKKQRKARTAFTDHQLQTLEKSFERQKYLSVQDRMELAAKLGLTDTQVKTWYQNR
RTKWKRQTAVGLELLAEAGNYAAFQRLYGGYWAGVPAYPTQPAPSADLYYRQAAATAAAA
ASASANTLQKPLPYRLYPGAPMAGVPPLGLGLPGPSAHLGSLGAPGLGALGYYAQARRTP
SPDVDPGSPAPPPRSPREQSVERHSDDEDDDETIHV