DPGLEAN12590 in OGS1.0

New model in OGS2.0DPOGS216193 
Genomic Positionscaffold479:+ 56469-66178
See gene structure
CDS Length1347
Paired RNAseq reads  13
Single RNAseq reads  57
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004601 (2e-124)
Best Drosophila hit  aristaless (5e-24)
Best Human hithomeobox protein ARX (2e-23)
Best NR hit (blastp)  aristaless-related homeobox protein [Danio rerio] (3e-24)
Best NR hit (blastx)  PREDICTED: similar to transcription factor protein [Tribolium castaneum] (4e-24)
GeneOntology terms












  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007449 proximal/distal pattern formation, imaginal disc
GO:0048800 antennal morphogenesis
GO:0035015 elongation of arista core
GO:0045747 positive regulation of Notch signaling pathway
GO:0035218 leg disc development
GO:0022416 bristle development
GO:0043234 protein complex
GO:0016481 negative regulation of transcription
GO:0043565 sequence-specific DNA binding
GO:0007480 imaginal disc-derived leg morphogenesis
InterPro families


  
IPR001356 Homeobox
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
Orthology groupMCL39872

Nucleotide sequence:

ATGGATGGTTTTAACAAGGAGAGACTAGACATAAGATCTGTATGTTCCCCGCGGCGTACA
CAAGGTGTAGCCGATCATTCGAAATACATTACTAGGTTGTACCTTCTAGCGATTAACGAT
ATGGATGAATCTTTAGACGGACGGGAAGCCGGTGCTGGTGCATCTGGATTCGATTCAGCT
GCCATTTTAAACGACGACGTGTCACCAACCACCAGTTTCTGTTACAACATACCAAATCTT
TTTAATGGACTTGGACCGGGATCAGTTCTGTGTGAAAGACATGAGACAGATGTCAACGAA
AGCTATCCAACAAATACGGAACCTTCAGATGAACCAGCTGATCAACAGTATGAAGATGAT
GAAGAGTATTCACCTAATTCTAGTAAGAATCGAACAACTTTCTCGAATGTTCAACTGGAA
CAATTGGAAGCTGCCTTTCATAAGACACACTACCCTGACGTCTTCTTCAGAGAGGAGTTG
GCGATGAGAATTGACCTGACCGAAGCAAGAGTTCAGGTCTGGTTTCAGAACCGTCGGGCT
AAATGGAGGAAACAACAAAAGGCTGGATGTGAGCCTTACCCTCGCTCCACGAGATCCCCT
GACCCTCGGTCTCCTTCAGCTCTCGCCCTTGAAATGAGGGATTTTATTACCATACCAGTA
TCTTCATCTCAAATAGTCAGCCTCGCTGACAACTCCAATCAAAACAGTTACGCTTCCAAG
TCAAAACAATCATCTCCCGCGATACATATATCAGCGTTGCCAACTAGGCAGAATTCTCAG
GTGGATCTACCCTCGTTTTCGATTCCACTGCAATATAACTTAGGATCGTTTAAGAAAATA
GGAGAAGATAACCAGTTAGACTTAGACACTTCGACGTCAGATGTACAAAACCACTGGGAT
ACTCGATTGATGCCGAATTTCAATATAATGAATATAAAACCGATGTTAGAAAATCGAGAA
AATCTCGAAATGTCCGAAGTGAAATATGAAGTGAAAAACGAAAATGTGAGATCATACAAT
GATATGTCAAATTCGATTCCGGCTACGATGGAACATGAAGACATATTGGGCAAAGAAGCC
GTGAGTGAAGGTCAAACTATATCACCGGATTTTGAGATTGGGAGATACCAGAGTTACCAC
AGCGAAAATGAAAAGAGATATAGAGAAACTGGCTTCGATGAATCTATAATAGAGGATGGT
AGAAATGACTTCGGGTGCGACGGTGACTTTCTCTGTAAAAGTAACTATGATAAAATTGAT
GAAAAAAGAAACGACTTTGAAGAATGCCATCTATTGGACGCGGACAGCAGCGCTGTCGGG
ATGAGTAATTTTGAAAGCAATCTTTAA

Protein sequence:

MDGFNKERLDIRSVCSPRRTQGVADHSKYITRLYLLAINDMDESLDGREAGAGASGFDSA
AILNDDVSPTTSFCYNIPNLFNGLGPGSVLCERHETDVNESYPTNTEPSDEPADQQYEDD
EEYSPNSSKNRTTFSNVQLEQLEAAFHKTHYPDVFFREELAMRIDLTEARVQVWFQNRRA
KWRKQQKAGCEPYPRSTRSPDPRSPSALALEMRDFITIPVSSSQIVSLADNSNQNSYASK
SKQSSPAIHISALPTRQNSQVDLPSFSIPLQYNLGSFKKIGEDNQLDLDTSTSDVQNHWD
TRLMPNFNIMNIKPMLENRENLEMSEVKYEVKNENVRSYNDMSNSIPATMEHEDILGKEA
VSEGQTISPDFEIGRYQSYHSENEKRYRETGFDESIIEDGRNDFGCDGDFLCKSNYDKID
EKRNDFEECHLLDADSSAVGMSNFESNL