New model in OGS2.0 | DPOGS216193  |
---|---|
Genomic Position | scaffold479:+ 56469-66178 |
See gene structure | |
CDS Length | 1347 |
Paired RNAseq reads   | 13 |
Single RNAseq reads   | 57 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004601 (2e-124) |
Best Drosophila hit   | aristaless (5e-24) |
Best Human hit | homeobox protein ARX (2e-23) |
Best NR hit (blastp)   | aristaless-related homeobox protein [Danio rerio] (3e-24) |
Best NR hit (blastx)   | PREDICTED: similar to transcription factor protein [Tribolium castaneum] (4e-24) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0003704 specific RNA polymerase II transcription factor activity GO:0007449 proximal/distal pattern formation, imaginal disc GO:0048800 antennal morphogenesis GO:0035015 elongation of arista core GO:0045747 positive regulation of Notch signaling pathway GO:0035218 leg disc development GO:0022416 bristle development GO:0043234 protein complex GO:0016481 negative regulation of transcription GO:0043565 sequence-specific DNA binding GO:0007480 imaginal disc-derived leg morphogenesis |
InterPro families    | IPR001356 Homeobox IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site |
Orthology group | MCL39872 |
Nucleotide sequence:
ATGGATGGTTTTAACAAGGAGAGACTAGACATAAGATCTGTATGTTCCCCGCGGCGTACA
CAAGGTGTAGCCGATCATTCGAAATACATTACTAGGTTGTACCTTCTAGCGATTAACGAT
ATGGATGAATCTTTAGACGGACGGGAAGCCGGTGCTGGTGCATCTGGATTCGATTCAGCT
GCCATTTTAAACGACGACGTGTCACCAACCACCAGTTTCTGTTACAACATACCAAATCTT
TTTAATGGACTTGGACCGGGATCAGTTCTGTGTGAAAGACATGAGACAGATGTCAACGAA
AGCTATCCAACAAATACGGAACCTTCAGATGAACCAGCTGATCAACAGTATGAAGATGAT
GAAGAGTATTCACCTAATTCTAGTAAGAATCGAACAACTTTCTCGAATGTTCAACTGGAA
CAATTGGAAGCTGCCTTTCATAAGACACACTACCCTGACGTCTTCTTCAGAGAGGAGTTG
GCGATGAGAATTGACCTGACCGAAGCAAGAGTTCAGGTCTGGTTTCAGAACCGTCGGGCT
AAATGGAGGAAACAACAAAAGGCTGGATGTGAGCCTTACCCTCGCTCCACGAGATCCCCT
GACCCTCGGTCTCCTTCAGCTCTCGCCCTTGAAATGAGGGATTTTATTACCATACCAGTA
TCTTCATCTCAAATAGTCAGCCTCGCTGACAACTCCAATCAAAACAGTTACGCTTCCAAG
TCAAAACAATCATCTCCCGCGATACATATATCAGCGTTGCCAACTAGGCAGAATTCTCAG
GTGGATCTACCCTCGTTTTCGATTCCACTGCAATATAACTTAGGATCGTTTAAGAAAATA
GGAGAAGATAACCAGTTAGACTTAGACACTTCGACGTCAGATGTACAAAACCACTGGGAT
ACTCGATTGATGCCGAATTTCAATATAATGAATATAAAACCGATGTTAGAAAATCGAGAA
AATCTCGAAATGTCCGAAGTGAAATATGAAGTGAAAAACGAAAATGTGAGATCATACAAT
GATATGTCAAATTCGATTCCGGCTACGATGGAACATGAAGACATATTGGGCAAAGAAGCC
GTGAGTGAAGGTCAAACTATATCACCGGATTTTGAGATTGGGAGATACCAGAGTTACCAC
AGCGAAAATGAAAAGAGATATAGAGAAACTGGCTTCGATGAATCTATAATAGAGGATGGT
AGAAATGACTTCGGGTGCGACGGTGACTTTCTCTGTAAAAGTAACTATGATAAAATTGAT
GAAAAAAGAAACGACTTTGAAGAATGCCATCTATTGGACGCGGACAGCAGCGCTGTCGGG
ATGAGTAATTTTGAAAGCAATCTTTAA
Protein sequence:
MDGFNKERLDIRSVCSPRRTQGVADHSKYITRLYLLAINDMDESLDGREAGAGASGFDSA
AILNDDVSPTTSFCYNIPNLFNGLGPGSVLCERHETDVNESYPTNTEPSDEPADQQYEDD
EEYSPNSSKNRTTFSNVQLEQLEAAFHKTHYPDVFFREELAMRIDLTEARVQVWFQNRRA
KWRKQQKAGCEPYPRSTRSPDPRSPSALALEMRDFITIPVSSSQIVSLADNSNQNSYASK
SKQSSPAIHISALPTRQNSQVDLPSFSIPLQYNLGSFKKIGEDNQLDLDTSTSDVQNHWD
TRLMPNFNIMNIKPMLENRENLEMSEVKYEVKNENVRSYNDMSNSIPATMEHEDILGKEA
VSEGQTISPDFEIGRYQSYHSENEKRYRETGFDESIIEDGRNDFGCDGDFLCKSNYDKID
EKRNDFEECHLLDADSSAVGMSNFESNL