DPGLEAN02459 in OGS1.0

New model in OGS2.0DPOGS214105 
Genomic Positionscaffold60:+ 64477-65583
See gene structure
CDS Length1107
Paired RNAseq reads  593
Single RNAseq reads  1819
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006156 (6e-11)
Best Drosophila hit  SoxNeuro (1e-43)
Best Human hittranscription factor SOX-2 (2e-38)
Best NR hit (blastp)  sex-determining region y protein, sry [Aedes aegypti] (4e-83)
Best NR hit (blastx)  transcription factor SOX-19 [Culex quinquefasciatus] (4e-56)
GeneOntology terms







  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0007399 nervous system development
GO:0005515 protein binding
GO:0007417 central nervous system development
GO:0007411 axon guidance
GO:0031490 chromatin DNA binding
GO:0030111 regulation of Wnt receptor signaling pathway
InterPro families

  
IPR000910 High mobility group, HMG1/HMG2
IPR009071 High mobility group, superfamily
IPR022097 Transcription factor SOX
Orthology groupMCL13937

Nucleotide sequence:

ATGCTGACGATGGAGACGGATCTAAAGGGCGGCAGCCTGCACGCCACGGTGCCGCCGCAC
CACGCGCTGCAGCACGGGTACGGGTCGCTAGGCGCGCTCGGTGGCATGATGGCGCTGCCG
CAGCAGCAGCCGCTGGCCCAGCACCAGCCACTGCAGCACCACCAGCACCAACCCCTGCCC
CAGCACCACGCGCAGCCCCAGCAGCAACACCATCAACCCCCAAACAACAACAACAACAAC
TCCAGCAAGAACTCTAACGCGGAGAGGGTGAAGCGTCCCATGAACGCTTTCATGGTGTGG
TCGCGCGGGCAACGCAGGAAGATGGCTTCCGACAATCCTAAAATGCACAACTCGGAAATA
TCTAAACGTTTGGGTGCACAGTGGAAAGACCTCTCGGAGTCAGAGAAGCGACCATTCATC
GACGAGGCGAAGAGGCTTCGGGCCGTTCACATGAAGGAACACCCGGACTACAAGTACAGA
CCCCGGAGGAAAACGAAGACGCTCGCGAAGAAACAGGAAAAGTATCCGTTAGGAGGCGGC
GCTCTACTGGGAGCCGGTGACGGTCAGCGCACGAACGCGCCGACGGCTCAGCAGCCGCGG
GACGTGTACCAGATGACACCGAACGGGTACATGCCCAACGGTTACATGATGCACGATCCC
AGCGCCTATCAACAGCAGGCGTACGGCTACCCGCGCTACGACGTGTCACAGATGCAGCAG
CAGTACAGCGGCGGATACTACGGCGGCGGGCAGGGCTCGCCGTATCTCCCTCAGCCGCCG
TCTCCTTCCGCGTACGGCCTGGGTCCCGGCTCGCCAGGAGGGTACGCGATGCCCGCCTCG
TGCGCCTCGCACTCACCCAGCGGATCATCCGCCAAGTCGGAGCCGGTTTCCCCGGGCCCG
CCGGGTATGAAGCGCGAATACGTGCACGAGCCGCAGCTAAAGCGCGAGTTCGCGCACGCG
CACGGCGGCCAGAGTCACCCGCACGAGCAGCTCGGCATGAAGCGCGAGTACGGCCAGCAG
GACCTCAGCCACATCATCAACATGTACCACGTGCCTGATGAACAGCGGTACGCGGCGGCC
GGAGAGCGAGCCATGCCGCTCATCTGA

Protein sequence:

MLTMETDLKGGSLHATVPPHHALQHGYGSLGALGGMMALPQQQPLAQHQPLQHHQHQPLP
QHHAQPQQQHHQPPNNNNNNSSKNSNAERVKRPMNAFMVWSRGQRRKMASDNPKMHNSEI
SKRLGAQWKDLSESEKRPFIDEAKRLRAVHMKEHPDYKYRPRRKTKTLAKKQEKYPLGGG
ALLGAGDGQRTNAPTAQQPRDVYQMTPNGYMPNGYMMHDPSAYQQQAYGYPRYDVSQMQQ
QYSGGYYGGGQGSPYLPQPPSPSAYGLGPGSPGGYAMPASCASHSPSGSSAKSEPVSPGP
PGMKREYVHEPQLKREFAHAHGGQSHPHEQLGMKREYGQQDLSHIINMYHVPDEQRYAAA
GERAMPLI