DPGLEAN14720 in OGS1.0

New model in OGS2.0DPOGS213606 
Genomic Positionscaffold772:+ 6283-16607
See gene structure
CDS Length639
Paired RNAseq reads  53
Single RNAseq reads  129
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011668 (3e-62)
Best Drosophila hit  Sox100B (9e-27)
Best Human hittranscription factor SOX-9 (6e-32)
Best NR hit (blastp)  PREDICTED: similar to SRY-box containing gene 10 [Apis mellifera] (4e-33)
Best NR hit (blastx)  PREDICTED: similar to SRY-box containing gene 8 [Nasonia vitripennis] (1e-25)
GeneOntology terms





















  
GO:0003702 RNA polymerase II transcription factor activity
GO:0005737 cytoplasm
GO:0016481 negative regulation of transcription
GO:0033690 positive regulation of osteoblast proliferation
GO:0010628 positive regulation of gene expression
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0010817 regulation of hormone levels
GO:0043565 sequence-specific DNA binding
GO:0048709 oligodendrocyte differentiation
GO:0001701 in utero embryonic development
GO:0048469 cell maturation
GO:0060009 Sertoli cell development
GO:0005634 nucleus
GO:0001649 osteoblast differentiation
GO:0007422 peripheral nervous system development
GO:0045662 negative regulation of myoblast differentiation
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0007283 spermatogenesis
GO:0005515 protein binding
GO:0008584 male gonad development
GO:0045165 cell fate commitment
GO:0045444 fat cell differentiation
GO:0060612 adipose tissue development
InterPro families
  
IPR000910 High mobility group, HMG1/HMG2
IPR009071 High mobility group, superfamily
Orthology groupMCL17858

Nucleotide sequence:

ATGAGCTGGGATCAGGACAGACCCGAGAAGCTGGAGATCAATGAAGCTGTCGGCAAGCTG
CTGCAGTCATTCAACTATGATACTATCGTGCCGCAGCCCAGCAAGGGCGGCGGTTGCATG
CGGCGGGCTCATGTGAAGCGCCCCATGAATGCTTTCATGGTGTTCGCACAGGCAATGCGT
CGTAGGTTGTCTGAACAACGGCCGTCACTGCACAACGCTGAACTGAGCAAATCCCTGGGA
TCCATGTGGAAGAGCTTGAGCGAAATGGAAAAGCTACCGTTTGTTAAGGAAGCCGAAAAA
CTACGGACTCAGCACAAACGTGAGTACCCCGACTATAAATACCAACCTAGACGTCGCAAG
CCACCGCCTACAGCATCCACGAGACTAAAACGGGAACCTACACCAGAAAGATCTCAAATC
GATTTCTCCCGCATCGAAGTCGATGGCGCCTTACTAGCTGACGGACCGCCAGATGGCGCT
GAGCTCGACCAGTATCTGAAACCAGTACCAGTGCCGGACTACCACGAGATGCAGCCACGC
TACACCCACACCTTATCCCATCACCCACCAGGGCTGTATACCCCTGTGCCGTCACATTTA
CACCCTCCATGTGACTGGCAACACTACCCGCATCCATAA

Protein sequence:

MSWDQDRPEKLEINEAVGKLLQSFNYDTIVPQPSKGGGCMRRAHVKRPMNAFMVFAQAMR
RRLSEQRPSLHNAELSKSLGSMWKSLSEMEKLPFVKEAEKLRTQHKREYPDYKYQPRRRK
PPPTASTRLKREPTPERSQIDFSRIEVDGALLADGPPDGAELDQYLKPVPVPDYHEMQPR
YTHTLSHHPPGLYTPVPSHLHPPCDWQHYPHP