New model in OGS2.0 | DPOGS213091  |
---|---|
Genomic Position | scaffold679:+ 32694-47298 |
See gene structure | |
CDS Length | 831 |
Paired RNAseq reads   | 154 |
Single RNAseq reads   | 603 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007623 (1e-15) |
Best Drosophila hit   | BarH1 (4e-42) |
Best Human hit | barH-like 2 homeobox protein (2e-35) |
Best NR hit (blastp)   | B-H1 [Drosophila ananassae] (4e-52) |
Best NR hit (blastx)   | homeobox protein b [Aedes aegypti] (2e-46) |
GeneOntology terms    | GO:0008052 sensory organ boundary specification GO:0008057 eye pigment granule organization GO:0001751 compound eye photoreceptor cell differentiation GO:0005634 nucleus GO:0003704 specific RNA polymerase II transcription factor activity GO:0008407 bristle morphogenesis GO:0007455 eye-antennal disc morphogenesis GO:0007479 leg disc proximal/distal pattern formation GO:0006355 regulation of transcription, DNA-dependent GO:0003700 sequence-specific DNA binding transcription factor activity GO:0003677 DNA binding GO:0043565 sequence-specific DNA binding GO:0016481 negative regulation of transcription |
InterPro families    | IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR000047 Helix-turn-helix motif, lambda-like repressor IPR020479 Homeobox, eukaryotic IPR001356 Homeobox IPR017970 Homeobox, conserved site |
Orthology group | MCL16283 |
Nucleotide sequence:
ATGACCGTCCAACGCGACCAGCGCGAGCGCGCGCCGCGGACCAGGTTCATGATCACGGAC
ATCCTGGACGCGGCGCCCAGGGACCTCAGCGCGCACCGGGACTCGGACTCCGACAGGTCG
GCCACGGACTCCCCAGGTGTCAAAGATGACTCCGACGACGTGTCCAGCAAATCCTGCGGT
GACGCATCTGCATTGGCTAAGAAGCAGCGCAAGGCTAGAACAGCCTTCACGGATCATCAG
CTTCAGACCTTGGAGAAGTCGTTCGAGAGACAAAAATACCTCAGCGTCCAGGATCGAATG
GAGCTAGCTGCTAAACTAGGTCTTACAGATACCCAAGTGAAGACCTGGTATCAGAACAGA
AGAACGAAATGGAAGCGTCAAACGGCCGTTGGACTCGAGTTACTAGCAGAGGCTGGCAAC
TACGCAGCCTTTCAACGTTTGTATGGAGGTTACTGGGCAGGAGTGCCCGCGTATCCAACA
CAGCCTGCCCCTTCTGCTGATTTATACTATCGTCAAGCTGCCGCAACTGCTGCTGCAGCA
GCCTCGGCCTCTGCAAACACATTACAGAAACCATTACCATATCGATTATACCCTGGCGCT
CCAATGGCGGGTGTTCCCCCGTTAGGTTTGGGTCTGCCGGGTCCGTCTGCTCACTTGGGA
TCACTGGGTGCTCCTGGTTTGGGAGCCCTCGGTTATTATGCACAAGCTAGACGCACACCC
TCTCCAGACGTGGATCCTGGAAGCCCAGCACCTCCGCCGCGATCCCCGCGAGAGCAATCC
GTAGAACGACACTCTGACGACGAAGACGACGATGAAACCATACACGTGTAA
Protein sequence:
MTVQRDQRERAPRTRFMITDILDAAPRDLSAHRDSDSDRSATDSPGVKDDSDDVSSKSCG
DASALAKKQRKARTAFTDHQLQTLEKSFERQKYLSVQDRMELAAKLGLTDTQVKTWYQNR
RTKWKRQTAVGLELLAEAGNYAAFQRLYGGYWAGVPAYPTQPAPSADLYYRQAAATAAAA
ASASANTLQKPLPYRLYPGAPMAGVPPLGLGLPGPSAHLGSLGAPGLGALGYYAQARRTP
SPDVDPGSPAPPPRSPREQSVERHSDDEDDDETIHV