DPGLEAN03504 in OGS1.0

New model in OGS2.0DPOGS207623 
Genomic Positionscaffold419:+ 14137-17168
See gene structure
CDS Length777
Paired RNAseq reads  75
Single RNAseq reads  280
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014549 (5e-37)
Best Drosophila hit  aristaless (8e-43)
Best Human hithomeobox protein ARX (6e-28)
Best NR hit (blastp)  paired-like family homeodomain transcription factor [Heliconius erato] (5e-116)
Best NR hit (blastx)  paired-like family homeodomain transcription factor [Heliconius erato] (3e-92)
GeneOntology terms












  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007449 proximal/distal pattern formation, imaginal disc
GO:0048800 antennal morphogenesis
GO:0035015 elongation of arista core
GO:0045747 positive regulation of Notch signaling pathway
GO:0035218 leg disc development
GO:0022416 bristle development
GO:0043234 protein complex
GO:0016481 negative regulation of transcription
GO:0043565 sequence-specific DNA binding
GO:0007480 imaginal disc-derived leg morphogenesis
InterPro families




  
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR003654 Paired-like homeodomain protein, OAR
IPR017970 Homeobox, conserved site
IPR012287 Homeodomain-related
IPR000047 Helix-turn-helix motif, lambda-like repressor
Orthology groupMCL12331

Nucleotide sequence:

ATGGGAGTGTCTGATACCGGTTCATCCGCTACTCCTGAACTACCAGTCCACGATATCGAT
CGGCCAGGGTCGGGTAGTGGAGTCGATGACGAAGACATCCCGAGGAGGAAACAGAGGAGG
TACAGAACGACCTTCACCAGCTACCAACTAGATGAACTGGAGAAAGCCTTCGGAAGAACT
CACTATCCAGATGTTTTTACAAGGGAGGAATTGGCTCTCAAAATTGGACTCACTGAAGCA
AGAATACAGGTGTGGTTTCAAAACCGGAGAGCAAAATGGCGCAAGCAAGAAAAGGTGGGT
CCCCACGCTCATCCCTACGGCGGATACTTGGGAGGACAGCCTTTGCCAACAGCCGCAATG
CCAGTATCGCCACACTCACTGACACAACTTGGCTTCGGATTGAGGAAGCCTTTCGACAGC
TCCTTGGCTACATTCAGGTATGCCAGTAGTCCACTGTTTGGGACGCAATACCTACCACCG
CTGACCCGGCCTCATCTATTCGGTGCTCCGTTGTACGCCACCTCGCCAGCTCATTTTCAT
TCTCTTTTCGCTAACCTAACCGCACCAGAACCACCGCGCGCATCACCCGAACATTCCCGA
TTATCCCCCGAGGTCACCCGATCTCCATCTCTTTCTCCCCCCATCTCCCCCGGTAGTGAA
ACTCTCCCCCACGTTGAAGATGTTAGAAGTTCCAGTATAGCAGCCTTAAGGCTAGCTGCG
AGAGAACACGAATTAAGATTAGAACTGTTGCGGCAAAGAGCCGATTTAATTTGTTAG

Protein sequence:

MGVSDTGSSATPELPVHDIDRPGSGSGVDDEDIPRRKQRRYRTTFTSYQLDELEKAFGRT
HYPDVFTREELALKIGLTEARIQVWFQNRRAKWRKQEKVGPHAHPYGGYLGGQPLPTAAM
PVSPHSLTQLGFGLRKPFDSSLATFRYASSPLFGTQYLPPLTRPHLFGAPLYATSPAHFH
SLFANLTAPEPPRASPEHSRLSPEVTRSPSLSPPISPGSETLPHVEDVRSSSIAALRLAA
REHELRLELLRQRADLIC