New model in OGS2.0 | DPOGS215543  |
---|---|
Genomic Position | scaffold227:+ 50183-62153 |
See gene structure | |
CDS Length | 888 |
Paired RNAseq reads   | 13 |
Single RNAseq reads   | 42 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002297 (3e-57) |
Best Drosophila hit   | C15 (7e-41) |
Best Human hit | T-cell leukemia homeobox protein 3 (1e-34) |
Best NR hit (blastp)   | T-cell leukemia homeobox protein, putative [Pediculus humanus corporis] (2e-53) |
Best NR hit (blastx)   | T-cell leukemia homeobox protein, putative [Pediculus humanus corporis] (9e-51) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0007501 mesodermal cell fate specification GO:0035015 elongation of arista core GO:0045747 positive regulation of Notch signaling pathway GO:0035218 leg disc development GO:0007480 imaginal disc-derived leg morphogenesis GO:0048800 antennal morphogenesis GO:0045941 positive regulation of transcription GO:0016481 negative regulation of transcription GO:0043565 sequence-specific DNA binding GO:0022416 bristle development GO:0043234 protein complex |
InterPro families    | IPR001356 Homeobox IPR020479 Homeobox, eukaryotic IPR009057 Homeodomain-like IPR012287 Homeodomain-related |
Orthology group | MCL11873 |
Nucleotide sequence:
ATGTCGTCTGTGCATTCTGATGATGATCAGGAGGAGATAAATGTTGACTCTGACAGTAGA
CTATCAGGTCAATACGAGAGAAGCATCGCTTCAGATAGAGATTCGATACATAGTTACAAC
GAGAGATACAGAGAATCACCCCCCGACCATCTGACTGATACCCAGCAGAGGGTAAATTTA
CCTTTCAGCATATCACGTTTGTTGGGTAAACAGTTCGAGGATATTGATAAGAGGAATTCA
GGAGAGAGTGATGATAGAAGCGACCAAGATACGGAGTCTGAAGGAGCGAAGGACGATTCT
AAGGAAGCCGGTCTAGCGTTGAACTTCAGCCATAACCCGGGATTATATCCTAACTCGAGT
TTGTTGTTGCGGCCCGGTTTAAATCTGGGGGCTGGTTATGGATTCCCGGGTAATCCGGGA
GTTGTCAGGGTGCCAGCACATAGAGCTCTGGGGGCTTTGGGAGCGTGGGGCGTAGCCCTT
GATCCGATGAGACAAGCGGCTGCTGCAGCCTTCGCCCATCAAGTTGTGAAAGACAGATTA
AACGCATCATTTCCTATAACGAGACGAATTGGTCACCCGTATCAGAACAGAACACCTCCT
AAGAGAAAGAAACCTCGAACATCATTCACTCGGATGCAGATAGCTGAACTGGAGAAGAGA
TTCCACAAACAGAAATACCTCGCATCAGCTGAACGCGCCTCGCTAGCCAAAACCTTGAAA
ATGACGGACGCCCAAGTGAAAACATGGTTCCAGAACCGACGAACGAAATGGAGTCTCGAT
GGTGGTGGCATCACCGATTACCGTCTAATGGACAAGCGGTGTTTGTGTTCGGGGCCGATA
ACACGCGATACGCACGGCGCCATGCCACCGTTGCCAACAAAAAACTAA
Protein sequence:
MSSVHSDDDQEEINVDSDSRLSGQYERSIASDRDSIHSYNERYRESPPDHLTDTQQRVNL
PFSISRLLGKQFEDIDKRNSGESDDRSDQDTESEGAKDDSKEAGLALNFSHNPGLYPNSS
LLLRPGLNLGAGYGFPGNPGVVRVPAHRALGALGAWGVALDPMRQAAAAAFAHQVVKDRL
NASFPITRRIGHPYQNRTPPKRKKPRTSFTRMQIAELEKRFHKQKYLASAERASLAKTLK
MTDAQVKTWFQNRRTKWSLDGGGITDYRLMDKRCLCSGPITRDTHGAMPPLPTKN