DPGLEAN18712 in OGS1.0

New model in OGS2.0DPOGS215543 
Genomic Positionscaffold227:+ 50183-62153
See gene structure
CDS Length888
Paired RNAseq reads  13
Single RNAseq reads  42
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002297 (3e-57)
Best Drosophila hit  C15 (7e-41)
Best Human hitT-cell leukemia homeobox protein 3 (1e-34)
Best NR hit (blastp)  T-cell leukemia homeobox protein, putative [Pediculus humanus corporis] (2e-53)
Best NR hit (blastx)  T-cell leukemia homeobox protein, putative [Pediculus humanus corporis] (9e-51)
GeneOntology terms












  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0007501 mesodermal cell fate specification
GO:0035015 elongation of arista core
GO:0045747 positive regulation of Notch signaling pathway
GO:0035218 leg disc development
GO:0007480 imaginal disc-derived leg morphogenesis
GO:0048800 antennal morphogenesis
GO:0045941 positive regulation of transcription
GO:0016481 negative regulation of transcription
GO:0043565 sequence-specific DNA binding
GO:0022416 bristle development
GO:0043234 protein complex
InterPro families


  
IPR001356 Homeobox
IPR020479 Homeobox, eukaryotic
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
Orthology groupMCL11873

Nucleotide sequence:

ATGTCGTCTGTGCATTCTGATGATGATCAGGAGGAGATAAATGTTGACTCTGACAGTAGA
CTATCAGGTCAATACGAGAGAAGCATCGCTTCAGATAGAGATTCGATACATAGTTACAAC
GAGAGATACAGAGAATCACCCCCCGACCATCTGACTGATACCCAGCAGAGGGTAAATTTA
CCTTTCAGCATATCACGTTTGTTGGGTAAACAGTTCGAGGATATTGATAAGAGGAATTCA
GGAGAGAGTGATGATAGAAGCGACCAAGATACGGAGTCTGAAGGAGCGAAGGACGATTCT
AAGGAAGCCGGTCTAGCGTTGAACTTCAGCCATAACCCGGGATTATATCCTAACTCGAGT
TTGTTGTTGCGGCCCGGTTTAAATCTGGGGGCTGGTTATGGATTCCCGGGTAATCCGGGA
GTTGTCAGGGTGCCAGCACATAGAGCTCTGGGGGCTTTGGGAGCGTGGGGCGTAGCCCTT
GATCCGATGAGACAAGCGGCTGCTGCAGCCTTCGCCCATCAAGTTGTGAAAGACAGATTA
AACGCATCATTTCCTATAACGAGACGAATTGGTCACCCGTATCAGAACAGAACACCTCCT
AAGAGAAAGAAACCTCGAACATCATTCACTCGGATGCAGATAGCTGAACTGGAGAAGAGA
TTCCACAAACAGAAATACCTCGCATCAGCTGAACGCGCCTCGCTAGCCAAAACCTTGAAA
ATGACGGACGCCCAAGTGAAAACATGGTTCCAGAACCGACGAACGAAATGGAGTCTCGAT
GGTGGTGGCATCACCGATTACCGTCTAATGGACAAGCGGTGTTTGTGTTCGGGGCCGATA
ACACGCGATACGCACGGCGCCATGCCACCGTTGCCAACAAAAAACTAA

Protein sequence:

MSSVHSDDDQEEINVDSDSRLSGQYERSIASDRDSIHSYNERYRESPPDHLTDTQQRVNL
PFSISRLLGKQFEDIDKRNSGESDDRSDQDTESEGAKDDSKEAGLALNFSHNPGLYPNSS
LLLRPGLNLGAGYGFPGNPGVVRVPAHRALGALGAWGVALDPMRQAAAAAFAHQVVKDRL
NASFPITRRIGHPYQNRTPPKRKKPRTSFTRMQIAELEKRFHKQKYLASAERASLAKTLK
MTDAQVKTWFQNRRTKWSLDGGGITDYRLMDKRCLCSGPITRDTHGAMPPLPTKN