New model in OGS2.0 | DPOGS214398  |
---|---|
Genomic Position | scaffold3005:+ 3971-8002 |
See gene structure | |
CDS Length | 1035 |
Paired RNAseq reads   | 115 |
Single RNAseq reads   | 685 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011254 (1e-91) |
Best Drosophila hit   | homothorax, isoform B (4e-23) |
Best Human hit | homeobox protein PKNOX2 (3e-59) |
Best NR hit (blastp)   | PREDICTED: similar to Homeobox protein PKNOX2 (PBX/knotted homeobox 2) (Homeobox protein PREP-2) [Tribolium castaneum] (8e-84) |
Best NR hit (blastx)   | PREDICTED: similar to Homeobox protein PKNOX2 (PBX/knotted homeobox 2) (Homeobox protein PREP-2) [Tribolium castaneum] (3e-71) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding |
InterPro families    | IPR009057 Homeodomain-like IPR001356 Homeobox IPR012287 Homeodomain-related |
Orthology group | MCL16495 |
Nucleotide sequence:
ATGCAGGGCGGCGCCCCCGTCACTGACGCCGATCAGGCACAGTTCGAGGCGGACAAGCGA
GCTGTCTACAAACATCCTCTATTCCCGCTTCTGGCGCTGCTGTTAGAGCGATGCGAGCAG
GCGACTGCTGGGGCGGAGCCCCCGGCCGCGGATGCCTTCGGGGCCGACCTGCAGGCCTTC
GTACAGCACCAGCGAAGGGACCGCCGCCCCTTCCTGGTGGACGACCCCGAGATCGACGGC
TTGATGATTAAGAGCATCCAGGTGCTCCGCATACACCTACTGGAGCTGGAGAAGGTGCAG
GAGCTGTGCAGGGACTTCTGCGGAAGATACATAGCCTGTCTCAAGACGAAGATGCAAAGT
GAAAATCTTCTGAGGACCGATTACTCTGCTGTCGTGGCGGCCGGGAGGTTAGGACCGCCT
TTTATGCGTGAGGCGCGGGTTCGAAACCTGGCGAGTACCTATGTGATCTGTTCCGAGTCA
TATTCCCCGGCTCCCCCGCCGCCGCCATCTTTCCCCCCGCCTTACCCCGCGCTGACTGAT
CTAGCCTACCCCAGAGATAGCGCATCTCTCGTAGTCCAAGGTTCAACACCTATCGGTCAA
ATCGGAGCTGGTTTAATATCCACTGATTCATTTACAGGCACCAACTCTAATTCGTGTTCT
TCGGTATCTGGTTCCCCGCCACCCGCTGATGATGATGATGAGGGAGTCAAGCGGGGAGTC
CTGCCGCGACACGCGACCCAGGTCATGAGGGCCTGGCTGTTCCAACACCTGGTGCATCCT
TATCCCACGGAGGAGGAAAAGCGCTCCCTGGCGGCGCAGACGAGACTCACCCTCCTCCAG
GTCAACAACTGGTTCATTAACGCCAGGAGACGCATACTCCAGCCCATGCTGGACTGTCAA
GAGAAACCTGGTGGTAAGAAGAGTAAAAACGGGTCATCCATCAGCAAGCGGTACTGGCCG
GATGCTCTCACCAACCAGCAGTTCACAGCTGGTATGTGTTCATATGAAGCATATATTTTT
ATCATAACAGTATAA
Protein sequence:
MQGGAPVTDADQAQFEADKRAVYKHPLFPLLALLLERCEQATAGAEPPAADAFGADLQAF
VQHQRRDRRPFLVDDPEIDGLMIKSIQVLRIHLLELEKVQELCRDFCGRYIACLKTKMQS
ENLLRTDYSAVVAAGRLGPPFMREARVRNLASTYVICSESYSPAPPPPPSFPPPYPALTD
LAYPRDSASLVVQGSTPIGQIGAGLISTDSFTGTNSNSCSSVSGSPPPADDDDEGVKRGV
LPRHATQVMRAWLFQHLVHPYPTEEEKRSLAAQTRLTLLQVNNWFINARRRILQPMLDCQ
EKPGGKKSKNGSSISKRYWPDALTNQQFTAGMCSYEAYIFIITV