New model in OGS2.0 | DPOGS210962  |
---|---|
Genomic Position | scaffold14:- 54849-64464 |
See gene structure | |
CDS Length | 912 |
Paired RNAseq reads   | 259 |
Single RNAseq reads   | 687 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006391 (3e-83) |
Best Drosophila hit   | antennapedia, isoform K (3e-48) |
Best Human hit | homeobox protein Hox-B7 (6e-34) |
Best NR hit (blastp)   | antennapedia homologue protein [Bombyx mori] (1e-132) |
Best NR hit (blastx)   | antennapedia homologue protein [Bombyx mori] (5e-113) |
GeneOntology terms    | GO:0005634 nucleus GO:0003704 specific RNA polymerase II transcription factor activity GO:0007494 midgut development GO:0006357 regulation of transcription from RNA polymerase II promoter GO:0007379 segment specification GO:0007507 heart development GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0007383 specification of segmental identity, antennal segment GO:0048542 lymph gland development |
InterPro families    | IPR009057 Homeodomain-like IPR001356 Homeobox IPR017970 Homeobox, conserved site IPR001827 Homeobox protein, antennapedia type, conserved site IPR020479 Homeobox, eukaryotic IPR017995 Homeobox protein, antennapedia type IPR012287 Homeodomain-related |
Orthology group | MCL11855 |
Nucleotide sequence:
ATGAGCGCCAACAATTGCGATAGCATGACATATTTCTCTAACGCATACGTCCCAGACATG
AGGAACGGGGGCCATGATCACCAGCAGGCGCACGCTCATTACGGTGCTGTGCCGCAGCAG
GGACACGAAATGGACGGTTGTGATCAACAACTCAGGCCTGCACAACATCACTACTCGGCG
CAGACGGCGCCTGGGATGCCCTATCCAAGATTCCCACCATACGACAGATTGGGTTACTAT
CAACAAATGGAACAGAATGGATATCGGCCCGATAGTCCATCACAGATGGGTCATATGGGT
CCGAAATCGGATGGATATGGCCCTAATGGTCACCAACCACCGGCTCCAGCGGTCTACCCT
TCGTCGTGCAAAGTACAAGCAGCAGCCGCGATGGCGGGTGGTGTCCCTGGGAGTCCTCCA
CTGGAACAGGCCCAGCAAATGCCTCACCACATGCATCCCCAGCAGCACATGGCCCAGCAC
GGGATGCCGTCGCATCAGCAGCACCTCATGTATCCCGTGGACGACATGCAGCACCAGACG
CAGATGCCTCCCATGCATCAGCAGTCGATGCACGCACAGCAGGCACCTCCTCAACAACCA
CCGCCAAATACCAATGCGTCGTTACCAAGTCCGCTCTACCCTTGGATGAGAAGTCAATTT
GAGCGGAAGCGTGGTCGGCAAACGTACACCCGGTACCAGACCCTGGAACTGGAGAAGGAG
TTCCACTTCAACCGCTACCTGACGAGGAGGAGACGCATAGAGATCGCACACGCGCTCTGT
CTCACCGAGAGACAGATCAAGATATGGTTCCAGAATCGACGTATGAAGTGGAAGAAGGAG
AATAAAACCAAAGGCGAGCCAGGGTCGGGAGACGAACCTGACAACATGAGTCCACCGACA
TCGCCACAATAA
Protein sequence:
MSANNCDSMTYFSNAYVPDMRNGGHDHQQAHAHYGAVPQQGHEMDGCDQQLRPAQHHYSA
QTAPGMPYPRFPPYDRLGYYQQMEQNGYRPDSPSQMGHMGPKSDGYGPNGHQPPAPAVYP
SSCKVQAAAAMAGGVPGSPPLEQAQQMPHHMHPQQHMAQHGMPSHQQHLMYPVDDMQHQT
QMPPMHQQSMHAQQAPPQQPPPNTNASLPSPLYPWMRSQFERKRGRQTYTRYQTLELEKE
FHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKENKTKGEPGSGDEPDNMSPPT
SPQ