New model in OGS2.0 | DPOGS201237  |
---|---|
Genomic Position | scaffold687:- 12935-19945 |
See gene structure | |
CDS Length | 933 |
Paired RNAseq reads   | 13 |
Single RNAseq reads   | 29 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012499 (3e-59) |
Best Drosophila hit   | CG34031 (2e-28) |
Best Human hit | barH-like 2 homeobox protein (7e-14) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (2e-42) |
Best NR hit (blastx)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (8e-42) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding GO:0003677 DNA binding GO:0045449 regulation of transcription GO:0005634 nucleus GO:0030528 transcription regulator activity |
InterPro families    | IPR020479 Homeobox, eukaryotic IPR000047 Helix-turn-helix motif, lambda-like repressor IPR001356 Homeobox IPR012287 Homeodomain-related IPR017970 Homeobox, conserved site IPR009057 Homeodomain-like |
Orthology group | MCL17914 |
Nucleotide sequence:
ATGAACTCTATAGATTATATATCTAGCACGTGTGAGAATGATACAGGGAAGTTTGATAAT
AATGTCAGTGTCACGAATGATTTGACGGAGGAACGTACAGAGATACGGACATTCGTGGAA
TCGGATTCCGATTTGGACAGTGAAAATGAAACCGTCGATGTCATATGTGAGAACAGTGAA
CAAATGAATAGTATTAATTACAATACAATAGCTTACGGCTCCGTCGATAGTATTATATAC
AGCAAAAACTTATACGCTAACAAAATGACGGTGAAAAAGAAACGGGTTTGTGATAGTGAA
CAAATTAAAATTATAAACGATTCCGCAAACAGTTTATTACAGGCGAAGGGGAAAAACTTT
CTGATAGACAGTATACTAGGCAACGACGAGAGTAAAACTCAAAGAAAACTCGTCAAAGAC
GCCGATAACCCAGAGGAGACGGGAGACGAACATAATGTGTCATCAACCTCAACATGTCCG
GACATCTCCATCAACAGTGCTGATGTCTTGGCGGGACATGCGTACGCGCATTGGCTGGCG
ACACAACAACCTACTTTTTATGATGACAAAAATAATCGGAGGCAGAAACGTTCCGGACCA
GAGAGGAAACCTCGACAAGCATACAGCGCTAAACAACTAGAGAGACTCGAATCTGAATTT
AAGTTGGACAAATATCTGAGCGTATCAAAAAGATTGGAGCTCTCCAAGGCGCTCGGACTC
ACTGAGGTTCAAATAAAAACGTGGTTTCAAAATCGAAGGACAAAATGGAAGAAACAACTC
ACATCTCGTCTCAAGATCGCCCAGCGTCAGGGATTATTTCCCGGACATATCTTCGGACAC
GCCCCTCAGACTTATTCACTTATAAATCCTTATACCTACAGTCCATTAAGCTGCATGTTC
ACCCCCGTGACGTTGCCGACGTCGCAACCATGA
Protein sequence:
MNSIDYISSTCENDTGKFDNNVSVTNDLTEERTEIRTFVESDSDLDSENETVDVICENSE
QMNSINYNTIAYGSVDSIIYSKNLYANKMTVKKKRVCDSEQIKIINDSANSLLQAKGKNF
LIDSILGNDESKTQRKLVKDADNPEETGDEHNVSSTSTCPDISINSADVLAGHAYAHWLA
TQQPTFYDDKNNRRQKRSGPERKPRQAYSAKQLERLESEFKLDKYLSVSKRLELSKALGL
TEVQIKTWFQNRRTKWKKQLTSRLKIAQRQGLFPGHIFGHAPQTYSLINPYTYSPLSCMF
TPVTLPTSQP