New model in OGS2.0 | DPOGS207916  |
---|---|
Genomic Position | scaffold828:- 56565-71387 |
See gene structure | |
CDS Length | 741 |
Paired RNAseq reads   | 336 |
Single RNAseq reads   | 1226 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000091 (4e-64) |
Best Drosophila hit   | sine oculis (3e-94) |
Best Human hit | homeobox protein SIX2 (7e-90) |
Best NR hit (blastp)   | AGAP011695-PA [Anopheles gambiae str. PEST] (3e-103) |
Best NR hit (blastx)   | AGAP011695-PA [Anopheles gambiae str. PEST] (2e-101) |
GeneOntology terms    | GO:0007623 circadian rhythm GO:0045449 regulation of transcription GO:0003702 RNA polymerase II transcription factor activity GO:0005634 nucleus GO:0001744 optic lobe placode formation GO:0048749 compound eye development GO:0001746 Bolwig's organ morphogenesis GO:0003700 sequence-specific DNA binding transcription factor activity GO:0007455 eye-antennal disc morphogenesis GO:0008347 glial cell migration GO:0009649 entrainment of circadian clock GO:0007283 spermatogenesis GO:0006355 regulation of transcription, DNA-dependent GO:0035271 ring gland development GO:0001745 compound eye morphogenesis GO:0005515 protein binding GO:0043565 sequence-specific DNA binding |
InterPro families    | IPR001356 Homeobox IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site |
Orthology group | MCL12651 |
Nucleotide sequence:
ATGCTGGGGGGGCCGGAGTGGGCCCAGCGGGAGGCCAGCCCTCCCAGGGACCCCCTGCCG
AGCTTCGGCTTCACACAGGAGCAGGTCGCCTGTGTCTGTGAGGTCCTCCAGCAGGCCGGT
AATATTGAACGTCTGGGCAGGTTCCTATGGTCGTTGCCGGCCTGTGAGCGTCTCCACGCT
CACGAATCAGTTCTGAAGGCTAAAGCCATGGTCGCCTTTCACCGCGGTAACTTCAAGGAG
TTGTACAGGTTGCTGGAATCACACAACTTCAGCGCACACAACCACGCCAAGCTTCAAAAC
CTCTGGTTAAAAGCACATTACATGGAGGCTGAACGTCTGAGAGGTCGTCCTCTGGGCGCC
GTGGGGAAGTACAGGGTCAGGCGCAAGTTCCCACTACCGAGGACTATATGGGATGGAGAG
GAAACGTCGTATTGTTTTAAGGAGAAGTCTCGTTCAGTGTTAAGGGACTGGTACCTCCAC
AACCCTTATCCCTCGCCCCGGGAGAAGAGAGAGCTGGCTGAGACCACGGGACTCACCACC
GTTCAGGTGTCAAATTGGTTTAAAAATCGCAGACAACGCGACCGACAGGCCGAGCACAAA
GATAGCGGCGGTCCGGGAGACAAGCAGCTGGACTCTTCCACGGACGACGACAGCGACGCC
CCGCATCCCGCGCCGCACGCGCCGCCGCTCTACCCGCTGTACGAACACCCGCTCGCTCAC
CTACAGTACCATCACTCGTGA
Protein sequence:
MLGGPEWAQREASPPRDPLPSFGFTQEQVACVCEVLQQAGNIERLGRFLWSLPACERLHA
HESVLKAKAMVAFHRGNFKELYRLLESHNFSAHNHAKLQNLWLKAHYMEAERLRGRPLGA
VGKYRVRRKFPLPRTIWDGEETSYCFKEKSRSVLRDWYLHNPYPSPREKRELAETTGLTT
VQVSNWFKNRRQRDRQAEHKDSGGPGDKQLDSSTDDDSDAPHPAPHAPPLYPLYEHPLAH
LQYHHS