New model in OGS2.0 | DPOGS216048  |
---|---|
Genomic Position | scaffold665:+ 14523-28815 |
See gene structure | |
CDS Length | 1416 |
Paired RNAseq reads   | 32 |
Single RNAseq reads   | 124 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009025 (0.0) |
Best Drosophila hit   | gooseberry (2e-101) |
Best Human hit | paired box protein Pax-3 isoform PAX3i (2e-89) |
Best NR hit (blastp)   | gooseberry [Tribolium castaneum] (2e-147) |
Best NR hit (blastx)   | gooseberry [Tribolium castaneum] (3e-125) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005667 transcription factor complex GO:0007367 segment polarity determination GO:0005634 nucleus GO:0007419 ventral cord development GO:0043565 sequence-specific DNA binding GO:0006355 regulation of transcription, DNA-dependent GO:0007435 salivary gland morphogenesis GO:0005737 cytoplasm |
InterPro families    | IPR017970 Homeobox, conserved site IPR001523 Paired box protein, N-terminal IPR009057 Homeodomain-like IPR001356 Homeobox IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR012287 Homeodomain-related |
Orthology group | MCL11701 |
Nucleotide sequence:
ATGGCTGTATCCACATTAAACATGACTCCCTATTTCACTGGATACTCTTTCCAAGGACAA
GGACGTATGAATCAGTTAGGCGGAGTCTTCATTAACGGACGTCCTCTGCCGAACCACATC
CGTCTAAAAATCGTGGAGATGGCAGCGGCGGGTGTTCGGCCTTGTGTCATCTCAAGACAG
TTGAGGGTCTCCCACGGCTGTGTCTCTAAAATACTCAATAGATACCAGGAAACTGGATCT
ATTCGCCCTGGTGTGATAGGGGGATCGAAGCCGAGAGTAGCCACGCCCGAAGTTGAAAAC
AGGATCGAAGAATTGAAGAGACAAAACCCAGGTATATTTTCCTGGGAAATTAGAGATAAA
TTAATAAAAGAAGGCATATGTGATAAAAACACCGCGCCATCAGTAAGTTCGATTTCGCGA
CTCATAAGAGGAGGCAAAAGGGACGAATCAGATCCTAGAAGGAACCACAGTATAGATGGT
ATTCTTGGACCATCCTCATCGTGTGAGGATAGTGATACGGAGTCTGAGCCTGGTATAACG
TTAAAAAGAAAACAACGAAGATCTAGAACAACCTTTTCTGGAGATCAACTTGAAGCTCTC
GAGCGCGCTTTCACGCGAACCCAATACCCAGACGTTTACACTCGTGAAGAATTAGCTCAA
AAGACGAAGTTGACCGAAGCACGTGTTCAGGTATGGTTCTCAAACCGAAGAGCACGTCTT
CGCAAACAACTGAACTCACAACAATTGAGTGCTTTTAATACAATGTCTTTACAATCTGCA
TTTCCCTCCGTTCATCAACAATACGAACCACCAACAACATTTAATGCACAGTGCGCGTCG
TGGCAACAATCATATTCTGCTTTGGGTAGCAGTTCTGTTCTGAATTCTGCTTTGGCACCT
TCTTTACATCAGTCATCATTATCAGCTCCTTCTGTTTGTCAATCTGCTCTTACAGCACCA
TCCCTCCATCCTCCCACATCGAGTTCATATTCATCAGGAAACTTGACGCCATTGTCACAT
TCATCTGAACTGCCTACACCTATACAAGCCTCTACTGATGCTACACCACCAAGTTCAAGC
CCAATCACTTCCAGCCCTGCTGGAAATCAAAGCGGTGGCATCACTTACCAACATCCAACT
TACACTAATACTAGTGATGTAGTTTCACACCCATACGGCTATGGTGACTACGCAAAACAA
GAACATATGTCAGCACATAACCACTGGACTTCAAGACAACTGAGTGGACATTCACAAAAT
AAATTAGCAGAAGTCAGCGCTTGGCCAGAAAATTATAGTTCATTCTTTGGCGCGAATACT
CACTATGCGTCGCACGCGCATTCACCTAGTGAAGCAAAGTCTGGTTATCCCTATATAGGA
CAGCTTGGCGGAATGGATATGGGGAGAGTTCATTGA
Protein sequence:
MAVSTLNMTPYFTGYSFQGQGRMNQLGGVFINGRPLPNHIRLKIVEMAAAGVRPCVISRQ
LRVSHGCVSKILNRYQETGSIRPGVIGGSKPRVATPEVENRIEELKRQNPGIFSWEIRDK
LIKEGICDKNTAPSVSSISRLIRGGKRDESDPRRNHSIDGILGPSSSCEDSDTESEPGIT
LKRKQRRSRTTFSGDQLEALERAFTRTQYPDVYTREELAQKTKLTEARVQVWFSNRRARL
RKQLNSQQLSAFNTMSLQSAFPSVHQQYEPPTTFNAQCASWQQSYSALGSSSVLNSALAP
SLHQSSLSAPSVCQSALTAPSLHPPTSSSYSSGNLTPLSHSSELPTPIQASTDATPPSSS
PITSSPAGNQSGGITYQHPTYTNTSDVVSHPYGYGDYAKQEHMSAHNHWTSRQLSGHSQN
KLAEVSAWPENYSSFFGANTHYASHAHSPSEAKSGYPYIGQLGGMDMGRVH