New model in OGS2.0 | DPOGS205542  |
---|---|
Genomic Position | scaffold1981:- 26315-30646 |
See gene structure | |
CDS Length | 1146 |
Paired RNAseq reads   | 14 |
Single RNAseq reads   | 72 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000091 (1e-141) |
Best Drosophila hit   | Six4, isoform B (3e-82) |
Best Human hit | homeobox protein SIX4 (9e-72) |
Best NR hit (blastp)   | AGAP011065-PA [Anopheles gambiae str. PEST] (2e-90) |
Best NR hit (blastx)   | six/sine homebox transcription factors [Culex quinquefasciatus] (2e-89) |
GeneOntology terms    | GO:0003702 RNA polymerase II transcription factor activity GO:0005634 nucleus GO:0045449 regulation of transcription GO:0008406 gonad development GO:0007520 myoblast fusion GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding GO:0006355 regulation of transcription, DNA-dependent GO:0007498 mesoderm development GO:0007503 fat body development |
InterPro families    | IPR009057 Homeodomain-like IPR001356 Homeobox IPR012287 Homeodomain-related IPR017970 Homeobox, conserved site |
Orthology group | MCL13219 |
Nucleotide sequence:
ATGGAGTCGTGTTCTGACCGGTTGAGTCCATCCAGCGATATGACAAACGAATCCGAGACA
TCGCTGCCATCATTTCCATACGAACAGCAGAGTGGGAACTTCTATCAGCCGGAGATAAAC
GAAAAACAATACTTTTCCTGCAAACAGAAGTCACCGCGCGACGACAGGAAAGAGGTTTAC
TTACAGGAGAGCAATAAAAACTCCTTAGAGAACTACTTGCCGGCGAGGAGCAAAAAGTAT
TTAAATTTCGAACTCAAATTGCCGCCGATCAATGACAACTTTTTCTCTCAAATTGACAGC
GATGACAGGACGATGCGTTCAGCGTACTTCAGTGACATGAACAAATGCGTTCAGAATGAG
AACAATCCGAACATGCAAGAATTGGATATAGATTTAAACAATCAGTTTGAAAACGTTAAC
AGTGACAGAGAGAGGGTCCAACAATCTTATTTTGCGGATAATGAAAGTGGAAATATGCGA
AGATGTTTAAACTTCAACTCCGAACAGGTTCAATGCGTGTGTGAGGCCCTCCAGCAAAAA
GGCGATATAGAAAAATTGGCGGCATTCCTATGGAGTCTACCACCGAGTGAATTATTAAGA
GGAAATGAAACCGTTCTCAGAGCCCGCGCTTTGGTGGCGTATCATCGCGGCGTATTTCAG
GAGTTGTACGCCATATTGGAGACGCACACATTCTCACCTCGTCACCACACCGATCTCCAG
AACCTTTGGTTTAAAGCGCACTATAAGGAAGCCCAGAAAGTCAGAGGAAGACCGCTTGGA
GCTGTTGATAAATACCGTCTTCGCAAGAAGTATCCCTTGCCAAAGACGATCTGGGATGGT
GAAGAGACGGTGTACTGCTTTAAGGAAAAGTCGAGAAATGCGCTGAAAGACTGTTACTAT
AGAAACCGTTACCCAACTCCAGACGAAAAACGTGCGCTCGCACAAAAAACAGGCTTGACA
TTAACACAAGTGTCAAATTGGTTCAAGAACCGACGCCAGAGGGATAGGACACCGCAGCAA
CCGAATAGACCTGAAATGATGGTTCCGGCTCAATATGTTGGTTCGCAGCCGGGTTTGGCG
CAATCTTTTCTTCCCAATGCCTACTATAAGCTTCAAGAATCTCACTATTTACACGGAAAT
CCTTGA
Protein sequence:
MESCSDRLSPSSDMTNESETSLPSFPYEQQSGNFYQPEINEKQYFSCKQKSPRDDRKEVY
LQESNKNSLENYLPARSKKYLNFELKLPPINDNFFSQIDSDDRTMRSAYFSDMNKCVQNE
NNPNMQELDIDLNNQFENVNSDRERVQQSYFADNESGNMRRCLNFNSEQVQCVCEALQQK
GDIEKLAAFLWSLPPSELLRGNETVLRARALVAYHRGVFQELYAILETHTFSPRHHTDLQ
NLWFKAHYKEAQKVRGRPLGAVDKYRLRKKYPLPKTIWDGEETVYCFKEKSRNALKDCYY
RNRYPTPDEKRALAQKTGLTLTQVSNWFKNRRQRDRTPQQPNRPEMMVPAQYVGSQPGLA
QSFLPNAYYKLQESHYLHGNP