New model in OGS2.0 | DPOGS202998  |
---|---|
Genomic Position | scaffold8:- 99619-191605 |
See gene structure | |
CDS Length | 1221 |
Paired RNAseq reads   | 7 |
Single RNAseq reads   | 50 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003888 (2e-88) |
Best Drosophila hit   | arrowhead, isoform B (2e-92) |
Best Human hit | LIM/homeobox protein Lhx8 (2e-51) |
Best NR hit (blastp)   | PREDICTED: similar to GA10520-PA [Tribolium castaneum] (4e-116) |
Best NR hit (blastx)   | PREDICTED: similar to GA10520-PA [Tribolium castaneum] (1e-104) |
GeneOntology terms    | GO:0007444 imaginal disc development GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding GO:0008270 zinc ion binding GO:0005515 protein binding |
InterPro families    | IPR001781 Zinc finger, LIM-type IPR001356 Homeobox IPR009057 Homeodomain-like IPR012287 Homeodomain-related |
Orthology group | MCL10541 |
Nucleotide sequence:
ATGGAACGGGAGTGCGTGCAGCGACATGTGACTCGAGAGATACAACTAAGCACAAACCCT
TTACAGAAACAAAACAATGCACAAAATCGCAATGCAACAACAGCTATGCTGATAGACGGA
TCTCACAACGTGGTCCCTGTTCATATGTCTGTGTTTGTGTCTGTGCTGTGTCCGTCTGTG
ACTGTGCTGTGTCGACACTGTGACGTATGTGCAAACAGCGGTGACGTCAGAAGCCACGCC
CAGATCCGCCCGCTGTCGGCGCCCGTCTACCCATCTTACACCGGGCCACTAGAAAATACG
CCTTACCGTGATGATAGAGAGCTCAAAATTGATGCGACGTATGTCGTTGAACACTTGAAC
CACTGCGATGTTTCTGAAGAATTCTCAACATCGGGGACGGAGCATCGCACGTGTTGTGCC
TGTGGGGAGCCCATAGCTGATCGCTTCCTACTGGAAGTGGGCGGCGGCGCATGGCACACG
GGCTGCCTGAGATGCTGTGTCTGTGCTGTGCAATTAGACAGACATCCCTCCTGCTTTCTC
AGAGACAGACAGGTGTACTGCAAGCAAGACTACGCCAAGAGTTTTGGGGCAAAGTGCTCC
AAGTGCTGCCGAGGCATCTCTTCATCAGACTGGGTTCGCAAGGCTCGTGAGCAGGTATAC
CATCTGGCCTGCTTCGCCTGCGACGCCTGCGGCCGTCAGCTATCCACGGGGGAACAGTTC
GCTTTGCACGAGGACAGAGTACTCTGCAAGCCGCACTACTTGGAAACATTGGATGGAGGA
TCTATTTCCTCAGATGGCAAGTGCAATGGCTGTGACTCAGAAGGTTACCACAAAAGTAAA
GCGAAACGCGTTCGTACGACTTTCACGGAAGAACAATTGCAAGTTCTCCAAGCTAACTTC
CAGCTGGACTCGAACCCTGACGGCCAAGACCTGGAAAGGATCGCTCAAGTCACCGGTCTC
AGCAAACGGGTCACACAAGTCTGGTTTCAAAATAGCCGTGCCAGGCAGAAGAAACATCAA
CATACGGGAAAAGGGAAACAGAACCAAGCGATGTCCCGCGAGGATGCGGTGGGGTTTGGT
CGGCCCATCAACCTCCACCTCACGTATTCCTTCCAGAACAAACCGCCGTTCGTTCCGATA
GATGGAACTTCTTTCACTGACTCATCAATGGACGAACTGTCCGAGGACTCCTCGATACAC
TGCATGCAGAGCGAGGTCTAG
Protein sequence:
MERECVQRHVTREIQLSTNPLQKQNNAQNRNATTAMLIDGSHNVVPVHMSVFVSVLCPSV
TVLCRHCDVCANSGDVRSHAQIRPLSAPVYPSYTGPLENTPYRDDRELKIDATYVVEHLN
HCDVSEEFSTSGTEHRTCCACGEPIADRFLLEVGGGAWHTGCLRCCVCAVQLDRHPSCFL
RDRQVYCKQDYAKSFGAKCSKCCRGISSSDWVRKAREQVYHLACFACDACGRQLSTGEQF
ALHEDRVLCKPHYLETLDGGSISSDGKCNGCDSEGYHKSKAKRVRTTFTEEQLQVLQANF
QLDSNPDGQDLERIAQVTGLSKRVTQVWFQNSRARQKKHQHTGKGKQNQAMSREDAVGFG
RPINLHLTYSFQNKPPFVPIDGTSFTDSSMDELSEDSSIHCMQSEV