New model in OGS2.0 | DPOGS206628  |
---|---|
Genomic Position | scaffold346:- 110098-119915 |
See gene structure | |
CDS Length | 966 |
Paired RNAseq reads   | 43 |
Single RNAseq reads   | 110 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010706 (2e-20) |
Best Drosophila hit   | ventral nervous system defective, isoform A (1e-37) |
Best Human hit | homeobox protein Nkx-2.2 (1e-38) |
Best NR hit (blastp)   | PREDICTED: ventral nervous system defective [Tribolium castaneum] (2e-47) |
Best NR hit (blastx)   | PREDICTED: ventral nervous system defective [Tribolium castaneum] (5e-48) |
GeneOntology terms    | GO:0021508 floor plate formation GO:0031018 endocrine pancreas development GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0030528 transcription regulator activity GO:0045449 regulation of transcription GO:0005634 nucleus GO:0003677 DNA binding GO:0007275 multicellular organismal development GO:0033504 floor plate development GO:0014044 Schwann cell development GO:0008347 glial cell migration GO:0008045 motor axon guidance |
InterPro families    | IPR017970 Homeobox, conserved site IPR001356 Homeobox IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR020479 Homeobox, eukaryotic |
Orthology group | MCL18010 |
Nucleotide sequence:
ATGATTGGCGACGGTGAAATGGGTTATGGGGACTATTATAATTATGAAAATAACTGGTCG
GTGGTGCCAACGGATTATGGGGCTGAATCGTGTCAGTACAGACAGCCACAAGTCGAAGAC
GACTACCGATATGGAACTTACGCATTGCTTGACAGCATGAAAATGCCCGGTCAGCGCCCC
GGTTTCCAAATATCCGATATTCTCGGCTTAAATGAGGCTAAAGGATTGGAACCACCTCCT
CAAGGAGGACTAAGCGGCCTGGAGTTACCTCCGTATGCTCCTCCGCATCACAACTACCCC
CATGAACTACTCCGACATCATCAGCCTTGGTTATCATTAGATCAGCACGATGGCACAGGT
ATGCTTGGACAGCAGGCGAGTCCTGACAGTACTTCCAGAGCATCTGAATTGTCATACGTT
GGTCCGTCAGCAGCTTCTCCCACAGTGACTGACCCGCGCCACGACCACGACCTGGAACAG
GAACATGATCATGACATCCACGATCACAGCCTGGAGTTAGACGACGATAACGACAACGAT
CAACCTAACACAGCCTCCGAGTCAAATCTATCGCACAAGAAACGCAAACGCAGAGTACTC
TTCTCCAAAGCCCAAACATACGAGTTGGAACGACGTTTCAGACAACAGAGATACCTCTCA
GCGCCTGAGCGGGAACACTTGGCTAGTTTAATACGCCTGACGCCGACGCAAGTAAAAATC
TGGTTTCAGAACCACAGATACAAGACAAAACGTGCAGTTCAAGAGAAAGGCGCCCATGAT
TTGAACGTGGGCGGTCTTAATTCACCGCGCCGTGTGGCGGTGCCGGTGTTGGTGAAGGAC
GGTAGGCCGTGTATCGGAAAGCCCGACGGCCTACCACCGTTGGGGATGACGCTGCCACCG
TACCAGCCGATGCATCACCAGCCCCCCGTCACGGGTCACGGACCTCAACCAGGTTGTTGG
TGGTGA
Protein sequence:
MIGDGEMGYGDYYNYENNWSVVPTDYGAESCQYRQPQVEDDYRYGTYALLDSMKMPGQRP
GFQISDILGLNEAKGLEPPPQGGLSGLELPPYAPPHHNYPHELLRHHQPWLSLDQHDGTG
MLGQQASPDSTSRASELSYVGPSAASPTVTDPRHDHDLEQEHDHDIHDHSLELDDDNDND
QPNTASESNLSHKKRKRRVLFSKAQTYELERRFRQQRYLSAPEREHLASLIRLTPTQVKI
WFQNHRYKTKRAVQEKGAHDLNVGGLNSPRRVAVPVLVKDGRPCIGKPDGLPPLGMTLPP
YQPMHHQPPVTGHGPQPGCWW