New model in OGS2.0 | DPOGS210964  |
---|---|
Genomic Position | scaffold1784:+ 10065-28230 |
See gene structure | |
CDS Length | 1011 |
Paired RNAseq reads   | 244 |
Single RNAseq reads   | 615 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006394 (3e-75) |
Best Drosophila hit   | Sex combs reduced, isoform A (2e-62) |
Best Human hit | homeobox protein Hox-B5 (1e-35) |
Best NR hit (blastp)   | sex combs reduced homolog [Bombyx mori] (5e-151) |
Best NR hit (blastx)   | sex combs reduced homolog [Bombyx mori] (3e-148) |
GeneOntology terms    | GO:0003704 specific RNA polymerase II transcription factor activity GO:0006357 regulation of transcription from RNA polymerase II promoter GO:0005634 nucleus GO:0007494 midgut development GO:0007379 segment specification GO:0007381 specification of segmental identity, labial segment GO:0045498 sex comb development GO:0007548 sex differentiation GO:0007432 salivary gland boundary specification GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity |
InterPro families    | IPR001356 Homeobox IPR012287 Homeodomain-related IPR017970 Homeobox, conserved site IPR001827 Homeobox protein, antennapedia type, conserved site IPR020479 Homeobox, eukaryotic IPR017995 Homeobox protein, antennapedia type IPR009057 Homeodomain-like |
Orthology group | MCL16519 |
Nucleotide sequence:
ATGAGTTCCTATCAATTCGTCAATTCCCTGGCCTCGTGCTATGGGAATCAGGTCCCGGGA
CGGACTGGGACACCCGTGGATCAGAGCGGTCACCCGGGCCTGCCGACACCCGGGACGGAT
TACTACAATCCGAACGCGGCAGCGTCCTATCCGAACACGTGTTATTCACCTCCACAAGTT
AGCCATCATTATCCCCAACATCCATACGCCACACCGGCCGCTGGTGCACACATGCAGCCA
CAAGCCATGATAGATTACACACAACTCCATCCCCAAAGACTCGCGAGCAGCGCGACTCAC
ATCCACCAACACACGAACCCAAGCCCTGGTGCGTTATCACCGAACCTAATGACACCAACG
AGTCAGACGGCTAGTGCTTGTAAATTCGCCGATTCAACTTCGACGACGGGAGTGGCATCC
CCACAGGATCTATCCACATCGTCAGGCCCTGGGAGGACTTCACCTGGATTTAATGTTAAT
CCAGCCGGAACGAGTACCAAATTAGGCTTGACGACACCAATAGCGTCTCCAGTGGAACAC
AAGGCTAATATCAACCAGAATATATCGAGCCCAGCTTCAAGCACCTCGAGCAATGAGAGT
GCCGAAGCTAACAGCACCAGTACAAAGAACACCAAATCATCAGCCAGCGCTCAAGCGAAT
CCACCGCAGATATATCCATGGATGAAACGAGTGCATCTAGGACAGAGTACAGTGAACGCT
AACGGTGAGACGAAACGTCAGAGAACTTCCTACACCCGGTACCAGACGCTGGAACTGGAG
AAGGAGTTCCACTTCAACCGCTACCTGACGAGGAGGAGACGCATAGAGATCGCACACGCG
CTCTGTCTCACCGAGAGACAGATCAAGATATGGTTCCAGAATCGACGTATGAAGTGGAAG
AAGGAGCACAAAATGGCGTCCATGAACATAGTCCCCTACCACATGAACCCGTACGGACAC
CCCTACCAGTTCGACCTCCACCCGAGCCAATTCGCGCATCTATCAGCATAG
Protein sequence:
MSSYQFVNSLASCYGNQVPGRTGTPVDQSGHPGLPTPGTDYYNPNAAASYPNTCYSPPQV
SHHYPQHPYATPAAGAHMQPQAMIDYTQLHPQRLASSATHIHQHTNPSPGALSPNLMTPT
SQTASACKFADSTSTTGVASPQDLSTSSGPGRTSPGFNVNPAGTSTKLGLTTPIASPVEH
KANINQNISSPASSTSSNESAEANSTSTKNTKSSASAQANPPQIYPWMKRVHLGQSTVNA
NGETKRQRTSYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWK
KEHKMASMNIVPYHMNPYGHPYQFDLHPSQFAHLSA