DPGLEAN00908 in OGS1.0

New model in OGS2.0DPOGS216135 
Genomic Positionscaffold1030:- 28638-52602
See gene structure
CDS Length1230
Paired RNAseq reads  55
Single RNAseq reads  189
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009432 (2e-52)
Best Drosophila hit  twin of eyeless (3e-100)
Best Human hitpaired box protein Pax-6 isoform a (2e-87)
Best NR hit (blastp)  twin of eyeless [Tribolium castaneum] (1e-158)
Best NR hit (blastx)  twin of eyeless [Tribolium castaneum] (1e-136)
GeneOntology terms






  
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0045449 regulation of transcription
GO:0035214 eye-antennal disc development
GO:0048749 compound eye development
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
InterPro families




  
IPR009057 Homeodomain-like
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR012287 Homeodomain-related
IPR001523 Paired box protein, N-terminal
IPR001356 Homeobox
IPR017970 Homeobox, conserved site
Orthology groupMCL14019

Nucleotide sequence:

ATGGCGGACGAGCTGATGCATAGCGCGGCGATGGGTGGTGGAGCGCTGTTCGGATGCTCG
TCTGCAGGGCACAGCGGCATCAACCAGCTCGGAGGGGTCTATGTGAACGGCAGACCGCTC
CCGGACTCCACTCTCCAAGATACTGGGAAGAACGATTCGTCAAAAAATACCTCTATCCAC
AGATATTACGAGACTGGTTCCATCAAACCTAGAGCGATCGGTGGTTCGAAGCCAAGAGTG
GCGACCACTCCCGTGGTCCAGAAGATAGCTGACTACAAGAGAGAATGTCCATCCATCTTC
GCCTGGGAGATAAGGGACCGTCTGCTCAGCGAGAACGTCTGCAACAATGATAATATACCA
AGCGTGTCATCAATAAACCGTGTGCTGCGTAATCTCGCCTCTCAGAAGGAGCAGGCAGCG
TCAGCACAGAACGACAGCGTTTACGAGAAGCTGAGAATGTTCAACGGCCAGGCGGCCACG
GGTTGGTGGTACCCAGGGTTACCGACCGCACCAGCACCAACCATACCCGCGCCGATACCG
CAACAGCTGAACAGACCGGAGGAACATAAACGAGCAGATACGCTGCAATCGGAGGCTGGG
TCTGATGGGAACAGCGAGCACGCGTCGTCTGGAGATGAAGACTCGCAAATGAGGCTGAGG
CTGAAGAGGAAGCTGCAAAGGAACAGAACGTCCTTCACAAACGATCAGATAGATAGTCTC
GAAAAAGAGTTCGAGCGCACTCACTACCCGGATGTTTTCGCGCGGGAACGACTGGCGGAA
AAGATCGGATTACCTGAGGCACGTATCCAGGTGTGGTTTTCAAACCGTCGAGCTAAGTGG
CGTCGTGAGGAGAAGCTTAGGAGCCAAAGAAGAGACGCGCCCGCGTCGCCCGCGCCTCCG
GCTAGGCTGCCGTTGAATGGCGGGTTCAACTCCATGTACAGCCCCATACCACAACCTATC
GCCACCATGACTGATACATATAGTTCGATGTCGTCCGGTCTGTCGTCCTCGTGTCTCCAG
CAACGTGACGGTGGGTATCCGTACATGTTCGGGGACGTCCTCTCGGGCGGCGGGTACAGA
GCGCCCGCGGCACACCAGCAACACGCCGCGTACAGCCAGCCACAGAGCGCGGGCAGCACC
GGTGTGATATCGGCGGGTGTGAGCGTCCCCGTCCAAATACCTTCTCAGGGGCCGGACCTC
GCGTCGAATTACTGGGGTAGGCTTCAGTGA

Protein sequence:

MADELMHSAAMGGGALFGCSSAGHSGINQLGGVYVNGRPLPDSTLQDTGKNDSSKNTSIH
RYYETGSIKPRAIGGSKPRVATTPVVQKIADYKRECPSIFAWEIRDRLLSENVCNNDNIP
SVSSINRVLRNLASQKEQAASAQNDSVYEKLRMFNGQAATGWWYPGLPTAPAPTIPAPIP
QQLNRPEEHKRADTLQSEAGSDGNSEHASSGDEDSQMRLRLKRKLQRNRTSFTNDQIDSL
EKEFERTHYPDVFARERLAEKIGLPEARIQVWFSNRRAKWRREEKLRSQRRDAPASPAPP
ARLPLNGGFNSMYSPIPQPIATMTDTYSSMSSGLSSSCLQQRDGGYPYMFGDVLSGGGYR
APAAHQQHAAYSQPQSAGSTGVISAGVSVPVQIPSQGPDLASNYWGRLQ