Genomic Position | scaffold2167:- 1982-13961 |
---|---|
See gene structure | |
CDS Length | 1239 |
Paired RNAseq reads   | 39 |
Single RNAseq reads   | 117 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003265 (2e-53) |
Best Drosophila hit   | extra-extra (8e-43) |
Best Human hit | motor neuron and pancreas homeobox protein 1 isoform 1 (5e-31) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC009461 [Tribolium castaneum] (5e-51) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC009461 [Tribolium castaneum] (3e-46) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0007417 central nervous system development GO:0007412 axon target recognition GO:0007399 nervous system development GO:0043565 sequence-specific DNA binding GO:0006355 regulation of transcription, DNA-dependent |
InterPro families    | IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site IPR020479 Homeobox, eukaryotic IPR012287 Homeodomain-related IPR001356 Homeobox |
Orthology group | MCL15124 |
Nucleotide sequence:
ATGCTGTCATCGACGATGTTCCGACGGGATTCACAGGCGTCGGAGAAACCGCTGGACATA
GACCAGGTCTCCAACCCCGACCGCAGCAGGTCAAACAGTCCAAAGAACATGAGGTACTAT
AAAGAAGATGACAAGACGAACGTCAGTGCCAAGAAATCCTTCTGCATAGACGCGCTGCTA
TCAAAACATGTTGAACCAGAGCAGACTATACAGGAGATGGCGCAGCAGAAGTATCTAGCC
CTGATACAGAACAAGAATTACATCACCAGATACCAGAACGAGAATTACGACACGCAAATC
GAGCAGAATACTATAAGCAAGTGTCAGAATAGACCTCCCAGCCAGAACTTCGATAGGGCC
AGCAGTCCCAGGTCTCTGGCGAGCAGCCCCAGGAGCGGCAGTCCGGGGTCTGAAGACGGG
AACCGTTCAGATAACTACAGTCCACCGATTTCGCCTGGAGTGGAAGGCGGCACTGACGAA
TATGATAACTATAGCGATAAAATCATCAAAAAATATAGTTATTCTCGTTCTGAAGACTTA
TGTTTTCGTTTTTCTGTAGGCATAATCCCGAAACCCGGTCTACTGCAGCCTCCGATGTCC
CAACCCCTGCTGTATCCGCCTCAGATGATGCAGTCGGCTTTCCATCACCCCGGAGACAGA
GTCCTGCATCAGATGCAGCTGGAATGGCTGGCTCGGACTGGAATGTTCTACCACCGGATA
CCAGAGCTGGGAGGTGCCCCTCACGCTCTTCTCGGTAAGACAAGAAGACCACGGACCGCG
TTCACGTCACAGCAGTTGTTGGAACTTGAGAAGCAGTTCCGCATGAACAAATACCTGTCC
AGACCAAAGAGGTTCGAAGTAGCCACCAGTTTGATGCTCACAGAGACACAGGTGAAAATA
TGGTTCCAAAACCGTAGAATGAAGTGGAAGCGGTCGAAAAAAGCACAACAGGATACGAAG
ATCAAAGAACCGCAGAGCCACGACGACAAAAACAAAACTAAGGACGTACCAGTGGCTGAA
CACGACAAACAACCTTCACAGCACATAGCGGCAGATTTGACATCTGTCAAGCCGGCCCCG
CTGTTAGACAGAGACAGAATCATCGCGCTAGAGAGGGAGAGAGCGATGGCCGCCGCCAAT
TTCAATTCGAATTTGGAAAACAATAGACGCGGATTGGTCGTCATGAACCAAGATGGCGGC
CGGCCCGGCATTGACATGTTCAGGCCTTACGTAGTATGA
Protein sequence:
MLSSTMFRRDSQASEKPLDIDQVSNPDRSRSNSPKNMRYYKEDDKTNVSAKKSFCIDALL
SKHVEPEQTIQEMAQQKYLALIQNKNYITRYQNENYDTQIEQNTISKCQNRPPSQNFDRA
SSPRSLASSPRSGSPGSEDGNRSDNYSPPISPGVEGGTDEYDNYSDKIIKKYSYSRSEDL
CFRFSVGIIPKPGLLQPPMSQPLLYPPQMMQSAFHHPGDRVLHQMQLEWLARTGMFYHRI
PELGGAPHALLGKTRRPRTAFTSQQLLELEKQFRMNKYLSRPKRFEVATSLMLTETQVKI
WFQNRRMKWKRSKKAQQDTKIKEPQSHDDKNKTKDVPVAEHDKQPSQHIAADLTSVKPAP
LLDRDRIIALERERAMAAANFNSNLENNRRGLVVMNQDGGRPGIDMFRPYVV