New model in OGS2.0 | DPOGS203984  |
---|---|
Genomic Position | scaffold2:+ 988459-1023119 |
See gene structure | |
CDS Length | 1194 |
Paired RNAseq reads   | 114 |
Single RNAseq reads   | 405 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002126 (4e-35) |
Best Drosophila hit   | apterous, isoform A (1e-76) |
Best Human hit | LIM/homeobox protein Lhx2 (3e-69) |
Best NR hit (blastp)   | apterous a [Tribolium castaneum] (9e-111) |
Best NR hit (blastx)   | apterous a [Tribolium castaneum] (1e-95) |
GeneOntology terms    | GO:0007559 histolysis GO:0003704 specific RNA polymerase II transcription factor activity GO:0005634 nucleus GO:0007472 wing disc morphogenesis GO:0008270 zinc ion binding GO:0007481 haltere disc morphogenesis GO:0007399 nervous system development GO:0007411 axon guidance GO:0006350 transcription GO:0007517 muscle organ development GO:0007479 leg disc proximal/distal pattern formation GO:0045165 cell fate commitment GO:0007451 dorsal/ventral lineage restriction, imaginal disc GO:0048190 wing disc dorsal/ventral pattern formation GO:0007476 imaginal disc-derived wing morphogenesis GO:0007450 dorsal/ventral pattern formation, imaginal disc GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0035286 leg segmentation GO:0035218 leg disc development |
InterPro families    | IPR001781 Zinc finger, LIM-type IPR001356 Homeobox IPR009057 Homeodomain-like IPR012287 Homeodomain-related IPR017970 Homeobox, conserved site |
Orthology group | MCL10320 |
Nucleotide sequence:
ATGGGAGTTTACGAAGAGAGAGGGGCGATGCACTGGCAACAGAATGAGCGATATCTCTCC
ACGTACGAGACGGGGTCAGAGTTGTCGCCTGTTGCCCCAGCAGCGTCGCCGGGATCACCC
AGAGACTGCACCTCGTGTCGCAAGCGAGAACCTCCAGATGAACCCGCTCCACCCGCTGAG
GATGCTTGCGCTGGCTGCGGAGCACGAATAACTGATAGATACTACCTTCTAGCGCTGGAG
CGGCGCTGGCACACCCCATGCCTCAGGTGCTGTGAATGCAAGATGCCTCTCGACTCTGAA
CAGAGATGTTATGCTCGTGACAGCAATATATTTTGCAAGAATGACTACTTCAGGTTGTAC
GGTTCAAAGCGGTGTTCTCGTTGCAACACGACCATTTCAGCATCAGAATTGGTGATGAGA
GCGCGCGACTTGGTCTTTCACGTCCACTGTTTCTCCTGTGCACTCTGCAGCGCCCGACTC
ACAAAAGGCGACACATTCGGCATCAGGGATTCAGCTGTTTATTGCAGGCTACACTACGAA
ACTATGCCGGATTATGCTCCCCATATGTCTGTACCGGGGCCTCCACAGATGTGTCCAGGT
CCTTACGCCGGCCCTCCACCGGGTTCGCACTACCCACCATACCCCTCTCCTGAGTTCTCC
CGAGTGGAGCCCGATGTCCCCAAAGGCTCTTTCTTCAACGGGGTATCAGCTCCTCCGCCG
AGACAAAAAGGCCGTCCGAGAAAAAAGAAGCCTAAAGACCAAGATTTAATGACAGCAAAT
CTTGATCTCAACCCCGACTACTTGGAGATGGGTTTCCGGGGCGGCGGCGGGCTGGGATCC
ACATCACGCACCAAGCGCATGCGCACCAGCTTCAAGCACCACCAGTTACGCACCATGAAG
TCATACTTCGCCATCAACCATAACCCAGACGCAAAAGACCTGAAGCAATTGAGCCAGAAG
ACTGGCCTTCCCAAAAGGGTGTTACAGGTATGGTTTCAAAATGCGCGGGCAAAATGGCGG
CGTATGGTGACAAAGCAAGAGAACAAGATGACGGACAAATGTTCTCCAGACGGCTCCTTG
GAGATGGACATGTACCACGGACCGATGGGTTCCATACAATCCTTACCCCCGCACAGCCCA
CCCTACAGCGTGATGGGAGGCCCCCCGAGCCCGAACTCGATGGACTGTCCGTAG
Protein sequence:
MGVYEERGAMHWQQNERYLSTYETGSELSPVAPAASPGSPRDCTSCRKREPPDEPAPPAE
DACAGCGARITDRYYLLALERRWHTPCLRCCECKMPLDSEQRCYARDSNIFCKNDYFRLY
GSKRCSRCNTTISASELVMRARDLVFHVHCFSCALCSARLTKGDTFGIRDSAVYCRLHYE
TMPDYAPHMSVPGPPQMCPGPYAGPPPGSHYPPYPSPEFSRVEPDVPKGSFFNGVSAPPP
RQKGRPRKKKPKDQDLMTANLDLNPDYLEMGFRGGGGLGSTSRTKRMRTSFKHHQLRTMK
SYFAINHNPDAKDLKQLSQKTGLPKRVLQVWFQNARAKWRRMVTKQENKMTDKCSPDGSL
EMDMYHGPMGSIQSLPPHSPPYSVMGGPPSPNSMDCP