New model in OGS2.0 | DPOGS204692  |
---|---|
Genomic Position | scaffold533:+ 59905-97699 |
See gene structure | |
CDS Length | 1299 |
Paired RNAseq reads   | 326 |
Single RNAseq reads   | 736 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007471 (3e-30) |
Best Drosophila hit   | CG34340 (3e-63) |
Best Human hit | dorsal root ganglia homeobox protein (4e-30) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC005600 [Tribolium castaneum] (5e-68) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC005600 [Tribolium castaneum] (6e-66) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding GO:0007517 muscle organ development GO:0048813 dendrite morphogenesis |
InterPro families    | IPR001356 Homeobox IPR003654 Paired-like homeodomain protein, OAR IPR012287 Homeodomain-related IPR017970 Homeobox, conserved site IPR009057 Homeodomain-like |
Orthology group | MCL17330 |
Nucleotide sequence:
ATGCAAATCGTGTCGGAGAGACCCGTCCGCGACGAGACTCCGTCTCGTTCATGCATCTAT
CTCGAGATCGTCGCCTCTCTAGCGCCGGATGCTGGCATCTCGGCGTGTGGGAATGAGATG
GTCTATGTTGTAGCGCGTACTAAGCTTCTGTCCGCAGTCGGTCTCGCGCCGTGGTCGCGG
GTGCGTGTGTCGGGCTATGTCAGGGTGAGCGCGCTTCTTGTCACGAGTAAGGCGGGAAAA
CTTCCCGCCCACAACGCAGCAACAATATTTTCAGACGCTAATAAGCTGGCGAGTGGGCCG
GGCGGTGGGCGCGGGTTGTTCTGCTATCATTGCCCGCCGAGCCTGCCCCCGCACCAGCAC
CGTCTTCCAACCCTGGAGTACCCCTTCACAGCATCACATCCCTACACCAGCTATTCCTAC
CACCCCGCCATCCACGATGACACTTTCGTTAGACGCAAACAGAGACGAAACAGAACCACC
TTTACATTACAGCAGCTGGAAGAGCTGGAGACGGCGTTTGCACAGACGCATTACCCGGAT
GTGTTCACTAGAGAGGATCTAGCACTCAAGATAAACCTCACCGAAGCTAGAGTTCAGGTT
TGGTTTCAAAACAGACGGGCTAAGTGGCGGAAAGCGGAAAGACTAAAAGAGGAACAGCGC
AAACGAGAGGGAGCTGAAGTTTTGGCTAAGAGGGATCCAGCGGATGATAAGGGTTCTTCG
GAGTGCGGAATGTCTCGAGGGTCTGGGGATGCATCTCCAATGTCAACTGGCGTGTCCCCT
CGCGCGTCTCCCCCGGTAACGCCAGGGTCACCTCGTCGTTCCCCCCACCGTTCACCCAAT
AGGTCACCAAGATTGGAAAGATCTGAGACCTGTGCCTCTCCCGCTCCTTCGGTTGGCAGC
GCAGGTTCCCGCGAGCCAGACCCTCGCCCGCCGCACAACATCTTCTCTCCTTTCGATCAT
GGAGCGTTCCGTTCATCAGCCCCAGGCGGTGCTGACCCGCCTCCGCTGTTCCTGCCTCCC
CATCTCTCTCATCTCTCGCAGCATCTCAACCATCTATCGCAGCCTTTCTTCCCGTTAAAA
GGTTGGGGAGCACCTTGCCCGTGTTGTCCCAAAGAAGAAGCTCGCTCAACCAGCGTGGCT
GAGTTGAGACGTAAAGCTCACGAACATTCCGCTGCGTTACTGCAATCGCTAGCAAATTTC
CAGTCGCGAGCGTTCCCGCTTCCGCTCCCGCTGCCGCCTCTGCCGCTCCCGCTGTTACAC
GAGCCACCGCCGTCGGAACCTCCCAAACATCTCGAATAA
Protein sequence:
MQIVSERPVRDETPSRSCIYLEIVASLAPDAGISACGNEMVYVVARTKLLSAVGLAPWSR
VRVSGYVRVSALLVTSKAGKLPAHNAATIFSDANKLASGPGGGRGLFCYHCPPSLPPHQH
RLPTLEYPFTASHPYTSYSYHPAIHDDTFVRRKQRRNRTTFTLQQLEELETAFAQTHYPD
VFTREDLALKINLTEARVQVWFQNRRAKWRKAERLKEEQRKREGAEVLAKRDPADDKGSS
ECGMSRGSGDASPMSTGVSPRASPPVTPGSPRRSPHRSPNRSPRLERSETCASPAPSVGS
AGSREPDPRPPHNIFSPFDHGAFRSSAPGGADPPPLFLPPHLSHLSQHLNHLSQPFFPLK
GWGAPCPCCPKEEARSTSVAELRRKAHEHSAALLQSLANFQSRAFPLPLPLPPLPLPLLH
EPPPSEPPKHLE