New model in OGS2.0 | DPOGS214105  |
---|---|
Genomic Position | scaffold60:+ 64477-65583 |
See gene structure | |
CDS Length | 1107 |
Paired RNAseq reads   | 593 |
Single RNAseq reads   | 1819 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006156 (6e-11) |
Best Drosophila hit   | SoxNeuro (1e-43) |
Best Human hit | transcription factor SOX-2 (2e-38) |
Best NR hit (blastp)   | sex-determining region y protein, sry [Aedes aegypti] (4e-83) |
Best NR hit (blastx)   | transcription factor SOX-19 [Culex quinquefasciatus] (4e-56) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0045449 regulation of transcription GO:0007399 nervous system development GO:0005515 protein binding GO:0007417 central nervous system development GO:0007411 axon guidance GO:0031490 chromatin DNA binding GO:0030111 regulation of Wnt receptor signaling pathway |
InterPro families    | IPR000910 High mobility group, HMG1/HMG2 IPR009071 High mobility group, superfamily IPR022097 Transcription factor SOX |
Orthology group | MCL13937 |
Nucleotide sequence:
ATGCTGACGATGGAGACGGATCTAAAGGGCGGCAGCCTGCACGCCACGGTGCCGCCGCAC
CACGCGCTGCAGCACGGGTACGGGTCGCTAGGCGCGCTCGGTGGCATGATGGCGCTGCCG
CAGCAGCAGCCGCTGGCCCAGCACCAGCCACTGCAGCACCACCAGCACCAACCCCTGCCC
CAGCACCACGCGCAGCCCCAGCAGCAACACCATCAACCCCCAAACAACAACAACAACAAC
TCCAGCAAGAACTCTAACGCGGAGAGGGTGAAGCGTCCCATGAACGCTTTCATGGTGTGG
TCGCGCGGGCAACGCAGGAAGATGGCTTCCGACAATCCTAAAATGCACAACTCGGAAATA
TCTAAACGTTTGGGTGCACAGTGGAAAGACCTCTCGGAGTCAGAGAAGCGACCATTCATC
GACGAGGCGAAGAGGCTTCGGGCCGTTCACATGAAGGAACACCCGGACTACAAGTACAGA
CCCCGGAGGAAAACGAAGACGCTCGCGAAGAAACAGGAAAAGTATCCGTTAGGAGGCGGC
GCTCTACTGGGAGCCGGTGACGGTCAGCGCACGAACGCGCCGACGGCTCAGCAGCCGCGG
GACGTGTACCAGATGACACCGAACGGGTACATGCCCAACGGTTACATGATGCACGATCCC
AGCGCCTATCAACAGCAGGCGTACGGCTACCCGCGCTACGACGTGTCACAGATGCAGCAG
CAGTACAGCGGCGGATACTACGGCGGCGGGCAGGGCTCGCCGTATCTCCCTCAGCCGCCG
TCTCCTTCCGCGTACGGCCTGGGTCCCGGCTCGCCAGGAGGGTACGCGATGCCCGCCTCG
TGCGCCTCGCACTCACCCAGCGGATCATCCGCCAAGTCGGAGCCGGTTTCCCCGGGCCCG
CCGGGTATGAAGCGCGAATACGTGCACGAGCCGCAGCTAAAGCGCGAGTTCGCGCACGCG
CACGGCGGCCAGAGTCACCCGCACGAGCAGCTCGGCATGAAGCGCGAGTACGGCCAGCAG
GACCTCAGCCACATCATCAACATGTACCACGTGCCTGATGAACAGCGGTACGCGGCGGCC
GGAGAGCGAGCCATGCCGCTCATCTGA
Protein sequence:
MLTMETDLKGGSLHATVPPHHALQHGYGSLGALGGMMALPQQQPLAQHQPLQHHQHQPLP
QHHAQPQQQHHQPPNNNNNNSSKNSNAERVKRPMNAFMVWSRGQRRKMASDNPKMHNSEI
SKRLGAQWKDLSESEKRPFIDEAKRLRAVHMKEHPDYKYRPRRKTKTLAKKQEKYPLGGG
ALLGAGDGQRTNAPTAQQPRDVYQMTPNGYMPNGYMMHDPSAYQQQAYGYPRYDVSQMQQ
QYSGGYYGGGQGSPYLPQPPSPSAYGLGPGSPGGYAMPASCASHSPSGSSAKSEPVSPGP
PGMKREYVHEPQLKREFAHAHGGQSHPHEQLGMKREYGQQDLSHIINMYHVPDEQRYAAA
GERAMPLI