DPGLEAN10223 in OGS1.0

New model in OGS2.0DPOGS207916 
Genomic Positionscaffold828:- 56565-71387
See gene structure
CDS Length741
Paired RNAseq reads  336
Single RNAseq reads  1226
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000091 (4e-64)
Best Drosophila hit  sine oculis (3e-94)
Best Human hithomeobox protein SIX2 (7e-90)
Best NR hit (blastp)  AGAP011695-PA [Anopheles gambiae str. PEST] (3e-103)
Best NR hit (blastx)  AGAP011695-PA [Anopheles gambiae str. PEST] (2e-101)
GeneOntology terms















  
GO:0007623 circadian rhythm
GO:0045449 regulation of transcription
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0001744 optic lobe placode formation
GO:0048749 compound eye development
GO:0001746 Bolwig's organ morphogenesis
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007455 eye-antennal disc morphogenesis
GO:0008347 glial cell migration
GO:0009649 entrainment of circadian clock
GO:0007283 spermatogenesis
GO:0006355 regulation of transcription, DNA-dependent
GO:0035271 ring gland development
GO:0001745 compound eye morphogenesis
GO:0005515 protein binding
GO:0043565 sequence-specific DNA binding
InterPro families


  
IPR001356 Homeobox
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
Orthology groupMCL12651

Nucleotide sequence:

ATGCTGGGGGGGCCGGAGTGGGCCCAGCGGGAGGCCAGCCCTCCCAGGGACCCCCTGCCG
AGCTTCGGCTTCACACAGGAGCAGGTCGCCTGTGTCTGTGAGGTCCTCCAGCAGGCCGGT
AATATTGAACGTCTGGGCAGGTTCCTATGGTCGTTGCCGGCCTGTGAGCGTCTCCACGCT
CACGAATCAGTTCTGAAGGCTAAAGCCATGGTCGCCTTTCACCGCGGTAACTTCAAGGAG
TTGTACAGGTTGCTGGAATCACACAACTTCAGCGCACACAACCACGCCAAGCTTCAAAAC
CTCTGGTTAAAAGCACATTACATGGAGGCTGAACGTCTGAGAGGTCGTCCTCTGGGCGCC
GTGGGGAAGTACAGGGTCAGGCGCAAGTTCCCACTACCGAGGACTATATGGGATGGAGAG
GAAACGTCGTATTGTTTTAAGGAGAAGTCTCGTTCAGTGTTAAGGGACTGGTACCTCCAC
AACCCTTATCCCTCGCCCCGGGAGAAGAGAGAGCTGGCTGAGACCACGGGACTCACCACC
GTTCAGGTGTCAAATTGGTTTAAAAATCGCAGACAACGCGACCGACAGGCCGAGCACAAA
GATAGCGGCGGTCCGGGAGACAAGCAGCTGGACTCTTCCACGGACGACGACAGCGACGCC
CCGCATCCCGCGCCGCACGCGCCGCCGCTCTACCCGCTGTACGAACACCCGCTCGCTCAC
CTACAGTACCATCACTCGTGA

Protein sequence:

MLGGPEWAQREASPPRDPLPSFGFTQEQVACVCEVLQQAGNIERLGRFLWSLPACERLHA
HESVLKAKAMVAFHRGNFKELYRLLESHNFSAHNHAKLQNLWLKAHYMEAERLRGRPLGA
VGKYRVRRKFPLPRTIWDGEETSYCFKEKSRSVLRDWYLHNPYPSPREKRELAETTGLTT
VQVSNWFKNRRQRDRQAEHKDSGGPGDKQLDSSTDDDSDAPHPAPHAPPLYPLYEHPLAH
LQYHHS