Genomic Position | scaffold238:+ 31992-64202 |
---|---|
See gene structure | |
CDS Length | 1026 |
Paired RNAseq reads   | 684 |
Single RNAseq reads   | 2253 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004886 (3e-38) |
Best Drosophila hit   | homothorax, isoform F (4e-68) |
Best Human hit | homeobox protein Meis1 (5e-59) |
Best NR hit (blastp)   | homothorax, isoform F [Drosophila melanogaster] (3e-74) |
Best NR hit (blastx)   | homothorax homeobox protein [Aedes aegypti] (1e-72) |
GeneOntology terms    | GO:0007422 peripheral nervous system development GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0006357 regulation of transcription from RNA polymerase II promoter GO:0003677 DNA binding GO:0007479 leg disc proximal/distal pattern formation GO:0007420 brain development GO:0048749 compound eye development GO:0007432 salivary gland boundary specification GO:0010092 specification of organ identity GO:0009954 proximal/distal pattern formation GO:0001752 compound eye photoreceptor fate commitment GO:0043565 sequence-specific DNA binding GO:0007380 specification of segmental identity, head GO:0007480 imaginal disc-derived leg morphogenesis GO:0035282 segmentation GO:0048735 haltere morphogenesis GO:0007476 imaginal disc-derived wing morphogenesis GO:0007383 specification of segmental identity, antennal segment GO:0010552 positive regulation of gene-specific transcription from RNA polymerase II promoter |
InterPro families   | ND |
Orthology group | MCL10435 |
Nucleotide sequence:
ATGCCGTTTAATGTACATCAGCTGGCGGCGGCGACGCGTCGAGGAGTGGCGCGGTCGAGC
GGAGGGCGCCGGCGGGGTGGTGCGGCGGAGTGCGGCGAGCGGGGGGCGGGCACACGCCCC
GCGGTGGCCCGGCTCGCGTCAGCCGCGCCAGTGTCCGCTAGACTGCCGAGCGCGCCGGGC
CTCCCGCTCCCCGCGCCTCGCTCGGCCACGACGCGTCCCGTACGACACCTTCCAAACCTC
GACCGTCGCGAGTGCATTATGACACTAACAGCCCGGGACAACAAAACGTTCGTGGTGGCC
AGCGGAGCCTTGCACCCGGGCTCGGGCCGAGCTTTCATGGCTCAGCCTAGGTACGACGAG
AGCCTTCACGGCGGGGGCTACATGGAAGGCGGCGCCATGTACCACGAGCACCGTCTCTCC
CACCCTCACATCCCGCCGGTACACTACCCTCCGCCCGCTGCGCCGGCGCACGCGCTGCCA
GGCGAACCCCTCGTGCACAAACGAGACAAGGACGCCATTTATGGCCATCCCCTCTTCCCC
CTACTGGCGCTGATCTTTGAGAAGTGCGAGTTGGCGACTTGCACGCCTCGCGACCCTGGC
GTGGCGGGAGGAGATGTCTGCTCCTCAGAGTCCTTCAACGAAGATATCGCGGTCTTCAGT
AAACAGATACGCCAAGAAAAACCATATTACATTGCGGACCCAGAGGTTGACTCTTTAATG
GTGCAAGCGATACAAGTCTTACGGTTTCACTTGTTAGAATTAGAAAAAGTGCACGAGTTA
TGCGACAACTTCTGCCACCGCTACATCAGCTGTCTGAAGGGCAAGATGCCCATCGACCTG
GTGATCGACGAGCGAGAGACGCGGCCGCCGGACAACGGCGAGCGCTCCGCCCCTGACAGC
AGCCACGACGGCGCCTCCACTCCGGACGTCGTGAGTACCCCCTATAGCAAACACCTGATT
CAGAGGCGCGATCCCCAGCTAGCACCCAATTCGACCACGAGCTATCTCGTGCACCCCTGC
TCTTAG
Protein sequence:
MPFNVHQLAAATRRGVARSSGGRRRGGAAECGERGAGTRPAVARLASAAPVSARLPSAPG
LPLPAPRSATTRPVRHLPNLDRRECIMTLTARDNKTFVVASGALHPGSGRAFMAQPRYDE
SLHGGGYMEGGAMYHEHRLSHPHIPPVHYPPPAAPAHALPGEPLVHKRDKDAIYGHPLFP
LLALIFEKCELATCTPRDPGVAGGDVCSSESFNEDIAVFSKQIRQEKPYYIADPEVDSLM
VQAIQVLRFHLLELEKVHELCDNFCHRYISCLKGKMPIDLVIDERETRPPDNGERSAPDS
SHDGASTPDVVSTPYSKHLIQRRDPQLAPNSTTSYLVHPCS