DPGLEAN13130 in OGS1.0

New model in OGS2.0DPOGS210997 
Genomic Positionscaffold307:+ 77324-78537
See gene structure
CDS Length1110
Paired RNAseq reads  164
Single RNAseq reads  399
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006483 (8e-29)
Best Drosophila hit  intermediate neuroblasts defective (2e-16)
Best Human hithomeobox protein Hox-D3 (2e-19)
Best NR hit (blastp)  special homeobox protein 8 [Bombyx mori] (7e-28)
Best NR hit (blastx)  special homeobox protein 1 [Bombyx mori] (9e-28)
GeneOntology terms












  
GO:0030528 transcription regulator activity
GO:0007160 cell-matrix adhesion
GO:0051216 cartilage development
GO:0010628 positive regulation of gene expression
GO:0043565 sequence-specific DNA binding
GO:0030878 thyroid gland development
GO:0005634 nucleus
GO:0048704 embryonic skeletal system morphogenesis
GO:0007275 multicellular organismal development
GO:0007219 Notch signaling pathway
GO:0009952 anterior/posterior pattern formation
GO:0045666 positive regulation of neuron differentiation
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
InterPro families



  
IPR017970 Homeobox, conserved site
IPR020479 Homeobox, eukaryotic
IPR001356 Homeobox
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
Orthology groupND

Nucleotide sequence:

ATGTCAGTCTCAACATCAATTTCGAATGCAGCTCAGCCAGTTTTCAGTCCAACTAATGCC
AGTGATGACCAGTGGCTATCAACCAGCAATCTGCAACCCCGGAACGAGACGAATTTTATA
ACAGGAGAGCCACAGCAATATAAAGTAAATCAAATTAATTATAACAGCACTAGTGCTTTT
TATAATGCACCTGATATTCCGAGTACTCATGTAAATTCTTGCAACGGCCCTATGACTTAT
AATCCAAATTATATAAACACTTGTCCACCAAATCCAGTTTATGGGCAAGAATTTAGATAC
GAACCATCACAAACTGCTCATCCTAATCCAGGAGTTATGTTTTTGGGTCCTCGTGGAACT
ATTGGGACTATGAATTCTTGGAGTAACTCAACCAACAGCAGCAATCGTTTAGTATCTAGG
CCACTGAACGGGAATAAACCGTTAAGTGGTGGTGTAAAGAAACCGAAACGTATTCGCACA
GCATTCACTAGTTCTCAAATGATGGAACTAGAAAACGAGTATACAAGGAACAGATATCTA
GACCGCAGCCGTCGCATCGAACTGTCTGAGATATTGAATTTAAACGAACGCACTATTAAG
ATTTGGTTCCAGAATAGAAGGATGAAGGAAAAGAAGGATAGAGCTGAGAGTCTTGAGGAC
ACTGAAGCCTCAAGCACTACAGAGCTTAACGATCACCAAGATTATCCTGGACAGATGATC
ATGTATGGCCAATATCCCCAAAACTTATACGGCAGAAGTAATATTTACATTGAACAGTAT
CCAGTAACATCCACTCCTCTGACGATGCCGACTAATGAAGTCCAATTAGTTAATAGTATT
CCTGAATCGGTTCTTAATACATATCCCACGTATATGGTTGAGAATAATTCTGATATAGTC
GAGAATTTCGATATTAAAGAACCAGAAATGAACGTGCAAATGCAAGGGTATAATAACAAT
AAAATTGAATTGATCGATTCCAAAGAAACGACCCAAGATTCGGTGCCTCAGTCAGAGAGT
AGTACAAACGACGCTGGTAAAGATGGCTTTAATGGTCCCAATTGGGATTTATCTTGGATC
CGCAGCATTCATATGGACGAAGAACTTTGA

Protein sequence:

MSVSTSISNAAQPVFSPTNASDDQWLSTSNLQPRNETNFITGEPQQYKVNQINYNSTSAF
YNAPDIPSTHVNSCNGPMTYNPNYINTCPPNPVYGQEFRYEPSQTAHPNPGVMFLGPRGT
IGTMNSWSNSTNSSNRLVSRPLNGNKPLSGGVKKPKRIRTAFTSSQMMELENEYTRNRYL
DRSRRIELSEILNLNERTIKIWFQNRRMKEKKDRAESLEDTEASSTTELNDHQDYPGQMI
MYGQYPQNLYGRSNIYIEQYPVTSTPLTMPTNEVQLVNSIPESVLNTYPTYMVENNSDIV
ENFDIKEPEMNVQMQGYNNNKIELIDSKETTQDSVPQSESSTNDAGKDGFNGPNWDLSWI
RSIHMDEEL