DPGLEAN03505 in OGS1.0

New model in OGS2.0DPOGS207625 
Genomic Positionscaffold419:+ 31394-70789
See gene structure
CDS Length1026
Paired RNAseq reads  303
Single RNAseq reads  872
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006008 (3e-53)
Best Drosophila hit  aristaless (7e-48)
Best Human hithomeobox protein ARX (3e-30)
Best NR hit (blastp)  paired-like family homeodomain transcription factor [Heliconius erato] (7e-119)
Best NR hit (blastx)  paired-like family homeodomain transcription factor [Junonia coenia] (3e-95)
GeneOntology terms












  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007449 proximal/distal pattern formation, imaginal disc
GO:0048800 antennal morphogenesis
GO:0035015 elongation of arista core
GO:0045747 positive regulation of Notch signaling pathway
GO:0035218 leg disc development
GO:0022416 bristle development
GO:0043234 protein complex
GO:0016481 negative regulation of transcription
GO:0043565 sequence-specific DNA binding
GO:0007480 imaginal disc-derived leg morphogenesis
InterPro families




  
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR003654 Paired-like homeodomain protein, OAR
IPR017970 Homeobox, conserved site
IPR012287 Homeodomain-related
IPR000047 Helix-turn-helix motif, lambda-like repressor
Orthology groupMCL12331

Nucleotide sequence:

ATGGACCTGACAACGAGGCGCTGTGAGGACGGGGAGGTTGTGAACGGGAAGGAACAGTGC
CGGGGTTTTAATGGGACGAGTGACGTCACGGGCGTGATCGAAGTGTCGCGGAAGAGCAGC
TCCTTCAGCATCAGAAGTCTCGTCGGCGGAGAGGACTCCGACCGCCCAGCCGACGGACAT
GTTAACACACCCGAAGAGTACTACCACCAAGATTCCTATCAAAAGTTCACTGGTATGGGG
GTATCGGAGGTTCAAAAGGACGATTCTCCAAGAACAACTCCTGAGCTTTCACGAAACGAT
CAGTCCCCTTCGGAGCGACCACCCCCCGGCTCTGCAGACAGCGATGACCCAGACGACTTC
GCCCCCAAGAGGAAGCAGAGACGATACAGAACAACCTTCACCAGTTTCCAGCTCGAAGAG
TTGGAGAAAGCATTCTCTAGAACTCACTACCCTGATGTTTTTACGAGAGAGGAGTTGGCG
ATGAAAATCGGACTAACGGAAGCGAGAATACAGGTGTGGTTTCAAAACCGTCGTGCTAAA
TGGAGGAAACAGGAGAAGGTGGGGCCTCAAGGGCACCCTTACAACCCTTATCTGGCTGGG
GGCGCAGCGCCTCCCCCATCAGTAGTCGCTTCAATGCCGAACCCTTTCTCACAACTCGGC
TTTGGCTTCAGAAAACCATTTGACGCAAACGCTTTGGCATCATTTAGATATAATAGTACC
CCAGTGCTGGGAACGCAATACCTCGGTACGCCGTTATCTCGACCTCCGCTTTTCAGCGCT
CCGATGTATTCTTCGGCTCCTCCCTTCCACTCGCTCCTCGCTGGCTTGGCAGCTCCCAGA
CAATCTCCTGACCCTCCGCCGGTCTCGCCCCCCATATCTCCCGGCAGCGAGTCCCCCCCA
ATACAACCAGGTCCAGAAGTCGAACGAAGGAGTTCAAGTATAGCCGCCTTAAGAATGGCG
GCTAGAGAACACGAGTTGAGGTTGGAAATGTTAAGACAACGACACCATACTGACTTGATA
AGTTGA

Protein sequence:

MDLTTRRCEDGEVVNGKEQCRGFNGTSDVTGVIEVSRKSSSFSIRSLVGGEDSDRPADGH
VNTPEEYYHQDSYQKFTGMGVSEVQKDDSPRTTPELSRNDQSPSERPPPGSADSDDPDDF
APKRKQRRYRTTFTSFQLEELEKAFSRTHYPDVFTREELAMKIGLTEARIQVWFQNRRAK
WRKQEKVGPQGHPYNPYLAGGAAPPPSVVASMPNPFSQLGFGFRKPFDANALASFRYNST
PVLGTQYLGTPLSRPPLFSAPMYSSAPPFHSLLAGLAAPRQSPDPPPVSPPISPGSESPP
IQPGPEVERRSSSIAALRMAAREHELRLEMLRQRHHTDLIS