New model in OGS2.0 | DPOGS209211  |
---|---|
Genomic Position | scaffold1056:+ 28871-29746 |
See gene structure | |
CDS Length | 876 |
Paired RNAseq reads   | 10 |
Single RNAseq reads   | 33 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010980 (8e-92) |
Best Drosophila hit   | intermediate neuroblasts defective (1e-22) |
Best Human hit | GS homeobox 1 (5e-22) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC006888 [Tribolium castaneum] (3e-28) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL004130 [Aedes aegypti] (6e-28) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0045449 regulation of transcription GO:0007419 ventral cord development GO:0007400 neuroblast fate determination GO:0009953 dorsal/ventral pattern formation GO:0007398 ectoderm development GO:0007420 brain development GO:0007417 central nervous system development GO:0007389 pattern specification process GO:0043565 sequence-specific DNA binding GO:0010551 regulation of gene-specific transcription from RNA polymerase II promoter GO:0010553 negative regulation of gene-specific transcription from RNA polymerase II promoter GO:0008134 transcription factor binding |
InterPro families    | IPR001356 Homeobox IPR020479 Homeobox, eukaryotic IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site IPR012287 Homeodomain-related |
Orthology group | MCL17995 |
Nucleotide sequence:
ATGTCGAGATCATTCCTAGTGGACGCCTTGATCAGTGACACCAAAGACAACAACACAGAA
ATGAAGAGCGACCATCTCACCTACAACCTGGGCAACTTGGACACGAGACCGAAGTTCCTC
CCGTACCCTTACCCAGGCAGTATCAACCTGCTGTCTCTCGGCCTCCAGCAGCAGCGAGCG
CCAGACCTGTTCCGACCGTTCCTGGAACAATTGAATTTCCGCTACCCGATGTTACATCAG
CTGCCCCGACAGACGGACTTCTTTGGTCCCGCTCACGAGACTCGCCCCTTCGAAGGTTTC
AAAACCGAAGATCAGGAGACGGTTGGTTTAGTGAATAGAGCTAAGAAATCTGTGTCACCG
TACTTGCACCATCCTTACAAATCGACCGCGACTTCACCATCCAAGAGCCAGGGTCAGAGG
TCACCGTCTTTATCTAGCGATAGTCGGAACGGCTCCCCGAGCCCGCCCCTCGGACATCCC
GAAGAACTCCTACCCGGATACTCAAAAGAACTAAAACGGCTACCCTTAAAAGAAGATTCG
AGCAAACGCATTAGAACAGCTTTCACGGGGACACAACTCCTTGAGCTGGAGAGAGAGTTC
TCCATGAACATGTATCTATCGAGACTGAGGAGGATAGAGATCGCCTCCAGGCTGAAGCTG
TCAGAGAAACAAGTGAAGATATGGTTCCAGAACCGACGCGTCAAGCTCAAGAAAGAAGAG
ACCCCGCTCGCTAACGAGGGGAGAGGAAAGAGATGCTGCTGCAGCAAGGGAACCTGCTCC
AAGAGCTCCACCTCCTGCGACGACGAGCAGGGACAGATAGACGTGGTCACCGACTACGAC
ACGTGTGAAGCACAGAACCTGTCCAGGTACTCCTGA
Protein sequence:
MSRSFLVDALISDTKDNNTEMKSDHLTYNLGNLDTRPKFLPYPYPGSINLLSLGLQQQRA
PDLFRPFLEQLNFRYPMLHQLPRQTDFFGPAHETRPFEGFKTEDQETVGLVNRAKKSVSP
YLHHPYKSTATSPSKSQGQRSPSLSSDSRNGSPSPPLGHPEELLPGYSKELKRLPLKEDS
SKRIRTAFTGTQLLELEREFSMNMYLSRLRRIEIASRLKLSEKQVKIWFQNRRVKLKKEE
TPLANEGRGKRCCCSKGTCSKSSTSCDDEQGQIDVVTDYDTCEAQNLSRYS