DPGLEAN18651 in OGS1.0

New model in OGS2.0DPOGS203984 
Genomic Positionscaffold2:+ 988459-1023119
See gene structure
CDS Length1194
Paired RNAseq reads  114
Single RNAseq reads  405
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002126 (4e-35)
Best Drosophila hit  apterous, isoform A (1e-76)
Best Human hitLIM/homeobox protein Lhx2 (3e-69)
Best NR hit (blastp)  apterous a [Tribolium castaneum] (9e-111)
Best NR hit (blastx)  apterous a [Tribolium castaneum] (1e-95)
GeneOntology terms



















  
GO:0007559 histolysis
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0007472 wing disc morphogenesis
GO:0008270 zinc ion binding
GO:0007481 haltere disc morphogenesis
GO:0007399 nervous system development
GO:0007411 axon guidance
GO:0006350 transcription
GO:0007517 muscle organ development
GO:0007479 leg disc proximal/distal pattern formation
GO:0045165 cell fate commitment
GO:0007451 dorsal/ventral lineage restriction, imaginal disc
GO:0048190 wing disc dorsal/ventral pattern formation
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0007450 dorsal/ventral pattern formation, imaginal disc
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0035286 leg segmentation
GO:0035218 leg disc development
InterPro families



  
IPR001781 Zinc finger, LIM-type
IPR001356 Homeobox
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
IPR017970 Homeobox, conserved site
Orthology groupMCL10320

Nucleotide sequence:

ATGGGAGTTTACGAAGAGAGAGGGGCGATGCACTGGCAACAGAATGAGCGATATCTCTCC
ACGTACGAGACGGGGTCAGAGTTGTCGCCTGTTGCCCCAGCAGCGTCGCCGGGATCACCC
AGAGACTGCACCTCGTGTCGCAAGCGAGAACCTCCAGATGAACCCGCTCCACCCGCTGAG
GATGCTTGCGCTGGCTGCGGAGCACGAATAACTGATAGATACTACCTTCTAGCGCTGGAG
CGGCGCTGGCACACCCCATGCCTCAGGTGCTGTGAATGCAAGATGCCTCTCGACTCTGAA
CAGAGATGTTATGCTCGTGACAGCAATATATTTTGCAAGAATGACTACTTCAGGTTGTAC
GGTTCAAAGCGGTGTTCTCGTTGCAACACGACCATTTCAGCATCAGAATTGGTGATGAGA
GCGCGCGACTTGGTCTTTCACGTCCACTGTTTCTCCTGTGCACTCTGCAGCGCCCGACTC
ACAAAAGGCGACACATTCGGCATCAGGGATTCAGCTGTTTATTGCAGGCTACACTACGAA
ACTATGCCGGATTATGCTCCCCATATGTCTGTACCGGGGCCTCCACAGATGTGTCCAGGT
CCTTACGCCGGCCCTCCACCGGGTTCGCACTACCCACCATACCCCTCTCCTGAGTTCTCC
CGAGTGGAGCCCGATGTCCCCAAAGGCTCTTTCTTCAACGGGGTATCAGCTCCTCCGCCG
AGACAAAAAGGCCGTCCGAGAAAAAAGAAGCCTAAAGACCAAGATTTAATGACAGCAAAT
CTTGATCTCAACCCCGACTACTTGGAGATGGGTTTCCGGGGCGGCGGCGGGCTGGGATCC
ACATCACGCACCAAGCGCATGCGCACCAGCTTCAAGCACCACCAGTTACGCACCATGAAG
TCATACTTCGCCATCAACCATAACCCAGACGCAAAAGACCTGAAGCAATTGAGCCAGAAG
ACTGGCCTTCCCAAAAGGGTGTTACAGGTATGGTTTCAAAATGCGCGGGCAAAATGGCGG
CGTATGGTGACAAAGCAAGAGAACAAGATGACGGACAAATGTTCTCCAGACGGCTCCTTG
GAGATGGACATGTACCACGGACCGATGGGTTCCATACAATCCTTACCCCCGCACAGCCCA
CCCTACAGCGTGATGGGAGGCCCCCCGAGCCCGAACTCGATGGACTGTCCGTAG

Protein sequence:

MGVYEERGAMHWQQNERYLSTYETGSELSPVAPAASPGSPRDCTSCRKREPPDEPAPPAE
DACAGCGARITDRYYLLALERRWHTPCLRCCECKMPLDSEQRCYARDSNIFCKNDYFRLY
GSKRCSRCNTTISASELVMRARDLVFHVHCFSCALCSARLTKGDTFGIRDSAVYCRLHYE
TMPDYAPHMSVPGPPQMCPGPYAGPPPGSHYPPYPSPEFSRVEPDVPKGSFFNGVSAPPP
RQKGRPRKKKPKDQDLMTANLDLNPDYLEMGFRGGGGLGSTSRTKRMRTSFKHHQLRTMK
SYFAINHNPDAKDLKQLSQKTGLPKRVLQVWFQNARAKWRRMVTKQENKMTDKCSPDGSL
EMDMYHGPMGSIQSLPPHSPPYSVMGGPPSPNSMDCP