DPGLEAN00318 in OGS1.0

New model in OGS2.0DPOGS209211 
Genomic Positionscaffold1056:+ 28871-29746
See gene structure
CDS Length876
Paired RNAseq reads  10
Single RNAseq reads  33
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010980 (8e-92)
Best Drosophila hit  intermediate neuroblasts defective (1e-22)
Best Human hitGS homeobox 1 (5e-22)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC006888 [Tribolium castaneum] (3e-28)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL004130 [Aedes aegypti] (6e-28)
GeneOntology terms












  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045449 regulation of transcription
GO:0007419 ventral cord development
GO:0007400 neuroblast fate determination
GO:0009953 dorsal/ventral pattern formation
GO:0007398 ectoderm development
GO:0007420 brain development
GO:0007417 central nervous system development
GO:0007389 pattern specification process
GO:0043565 sequence-specific DNA binding
GO:0010551 regulation of gene-specific transcription from RNA polymerase II promoter
GO:0010553 negative regulation of gene-specific transcription from RNA polymerase II promoter
GO:0008134 transcription factor binding
InterPro families



  
IPR001356 Homeobox
IPR020479 Homeobox, eukaryotic
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
IPR012287 Homeodomain-related
Orthology groupMCL17995

Nucleotide sequence:

ATGTCGAGATCATTCCTAGTGGACGCCTTGATCAGTGACACCAAAGACAACAACACAGAA
ATGAAGAGCGACCATCTCACCTACAACCTGGGCAACTTGGACACGAGACCGAAGTTCCTC
CCGTACCCTTACCCAGGCAGTATCAACCTGCTGTCTCTCGGCCTCCAGCAGCAGCGAGCG
CCAGACCTGTTCCGACCGTTCCTGGAACAATTGAATTTCCGCTACCCGATGTTACATCAG
CTGCCCCGACAGACGGACTTCTTTGGTCCCGCTCACGAGACTCGCCCCTTCGAAGGTTTC
AAAACCGAAGATCAGGAGACGGTTGGTTTAGTGAATAGAGCTAAGAAATCTGTGTCACCG
TACTTGCACCATCCTTACAAATCGACCGCGACTTCACCATCCAAGAGCCAGGGTCAGAGG
TCACCGTCTTTATCTAGCGATAGTCGGAACGGCTCCCCGAGCCCGCCCCTCGGACATCCC
GAAGAACTCCTACCCGGATACTCAAAAGAACTAAAACGGCTACCCTTAAAAGAAGATTCG
AGCAAACGCATTAGAACAGCTTTCACGGGGACACAACTCCTTGAGCTGGAGAGAGAGTTC
TCCATGAACATGTATCTATCGAGACTGAGGAGGATAGAGATCGCCTCCAGGCTGAAGCTG
TCAGAGAAACAAGTGAAGATATGGTTCCAGAACCGACGCGTCAAGCTCAAGAAAGAAGAG
ACCCCGCTCGCTAACGAGGGGAGAGGAAAGAGATGCTGCTGCAGCAAGGGAACCTGCTCC
AAGAGCTCCACCTCCTGCGACGACGAGCAGGGACAGATAGACGTGGTCACCGACTACGAC
ACGTGTGAAGCACAGAACCTGTCCAGGTACTCCTGA

Protein sequence:

MSRSFLVDALISDTKDNNTEMKSDHLTYNLGNLDTRPKFLPYPYPGSINLLSLGLQQQRA
PDLFRPFLEQLNFRYPMLHQLPRQTDFFGPAHETRPFEGFKTEDQETVGLVNRAKKSVSP
YLHHPYKSTATSPSKSQGQRSPSLSSDSRNGSPSPPLGHPEELLPGYSKELKRLPLKEDS
SKRIRTAFTGTQLLELEREFSMNMYLSRLRRIEIASRLKLSEKQVKIWFQNRRVKLKKEE
TPLANEGRGKRCCCSKGTCSKSSTSCDDEQGQIDVVTDYDTCEAQNLSRYS