DPGLEAN03494 in OGS1.0

New model in OGS2.0DPOGS203877 
Genomic Positionscaffold5243:+ 13950-17096
See gene structure
CDS Length771
Paired RNAseq reads  31
Single RNAseq reads  108
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003821 (2e-93)
Best Drosophila hit  single-minded, isoform B (7e-96)
Best Human hitsingle-minded homolog 1 (8e-94)
Best NR hit (blastp)  PREDICTED: similar to Single minded [Tribolium castaneum] (2e-108)
Best NR hit (blastx)  PREDICTED: similar to Single minded [Tribolium castaneum] (7e-104)
GeneOntology terms

















  
GO:0007418 ventral midline development
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007417 central nervous system development
GO:0007398 ectoderm development
GO:0007628 adult walking behavior
GO:0040011 locomotion
GO:0007420 brain development
GO:0007165 signal transduction
GO:0006355 regulation of transcription, DNA-dependent
GO:0004871 signal transducer activity
GO:0007419 ventral cord development
GO:0035225 determination of genital disc primordium
GO:0016566 specific transcriptional repressor activity
GO:0007409 axonogenesis
GO:0043565 sequence-specific DNA binding
GO:0016563 transcription activator activity
GO:0007411 axon guidance
InterPro families



  
IPR000014 PAS
IPR001092 Helix-loop-helix DNA-binding domain
IPR013767 PAS fold
IPR011598 Helix-loop-helix DNA-binding
IPR001067 Nuclear translocator
Orthology groupMCL10901

Nucleotide sequence:

ATGAAGGAGAAGAGCAAGAACGCGGCACGTTCGAGGAGGGAGAAGGAAAACGCTGAGTTC
CTCGAACTAGCTAAACTGTTACCACTACCATCAGCCATCACCTCACAGCTGGACAAGGCG
TCGGTGATACGGCTCACCACAAGTTACCTGAAGATGAGGCAGGTCTTCCCTGATGGTCTG
GGAGACGCCTGGGGCGCCGCCCCTCCTCCACCACAGCCCAGGGAACTCTCAATACGAGAG
CTGGGATCCCATCTCCTGCAGACCCTCGATGGGTTTATATTCGTGGTGTCACCAGATGGA
AAGATTATGTACATAAGTGAGACGGCGTCCGTTCATCTCGGACTTAGTCAGGTGGAATTG
ACCGGGAACTCTATATACGAGTACATCCACCAAGCTGATCACGAGGAGATGTCCGCGGTG
CTCAGCCTTCAGCATCCGCACACGTATGCTGGACCGCCGGCCGTTGGGTATCCTGTAGGT
GGTACCTGGAGTCCCAACGTGGACGTGGAGTGTGAGAGAGCCTTCTTCATCAGGATGAAG
TGCGTCCTCGCTAAGAGGAACGCTGGCCTCACCACGTCAGGGTATAAGGTCATCCACTGT
TCTGGATACCTCCGCGCCCGCCGCTTCGGCGACGGCACGGCTCCTCTCGGGCTGGTCGCC
GTCGGCCACTCCCTCCCGCCGTCAGCCGTCACCGAGCTGAAGCTCCACTCCAACATGTTC
ATGTTCCGCGCCTCGCTGGACATGAGGCTCATCTTCCTGGACGCCAGGTGA

Protein sequence:

MKEKSKNAARSRREKENAEFLELAKLLPLPSAITSQLDKASVIRLTTSYLKMRQVFPDGL
GDAWGAAPPPPQPRELSIRELGSHLLQTLDGFIFVVSPDGKIMYISETASVHLGLSQVEL
TGNSIYEYIHQADHEEMSAVLSLQHPHTYAGPPAVGYPVGGTWSPNVDVECERAFFIRMK
CVLAKRNAGLTTSGYKVIHCSGYLRARRFGDGTAPLGLVAVGHSLPPSAVTELKLHSNMF
MFRASLDMRLIFLDAR