New model in OGS2.0 | DPOGS210898  |
---|---|
Genomic Position | scaffold70:- 46814-47422 |
See gene structure | |
CDS Length | 609 |
Paired RNAseq reads   | 15 |
Single RNAseq reads   | 55 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003079 (3e-67) |
Best Drosophila hit   | dichaete (2e-29) |
Best Human hit | transcription factor SOX-2 (8e-27) |
Best NR hit (blastp)   | AGAP010919-PA [Anopheles gambiae str. PEST] (2e-38) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP010919-PA [Tribolium castaneum] (3e-28) |
GeneOntology terms    | GO:0007350 blastoderm segmentation GO:0007417 central nervous system development GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0016563 transcription activator activity GO:0003677 DNA binding GO:0008301 DNA bending activity GO:0045893 positive regulation of transcription, DNA-dependent GO:0005737 cytoplasm GO:0005634 nucleus GO:0045449 regulation of transcription GO:0002168 instar larval development GO:0035120 post-embryonic appendage morphogenesis GO:0007442 hindgut morphogenesis GO:0007420 brain development GO:0003730 mRNA 3'-UTR binding GO:0009950 dorsal/ventral axis specification GO:0060810 intracellular mRNA localization involved in pattern specification process GO:0043565 sequence-specific DNA binding GO:0008134 transcription factor binding GO:0010553 negative regulation of gene-specific transcription from RNA polymerase II promoter GO:0010551 regulation of gene-specific transcription from RNA polymerase II promoter |
InterPro families    | IPR000910 High mobility group, HMG1/HMG2 IPR009071 High mobility group, superfamily |
Orthology group | MCL16975 |
Nucleotide sequence:
ATGAACGCCTTCATGGTCTGGTCCAGGCTACAACGTCGCCAGATCGCCAAGGATAATCCG
AAGATGCACAATTCGGAGATATCGAAACGGCTCGGAGCCGAATGGAAGCTGTTGTCTGAA
ATGCAGAAGAGACCATTCATTGACGAAGCAAAACGACTCCGAGCTCTCCATATGAAAGAG
CACCCCGACTACAAGTACCGGCCACGAAGGAAGCCCAAGCCACCGACAGCGGGAGGAGCT
CCGGGCGCTGGAGCTTTCCCGAGCTTTCCACTGCCTTACTTCGCGGGTCCAGCACCGACC
GTTGGACCGTTGGACGCCCTTTCGTATTCGGCAGTGCCTCCATACTTCCCACATCAGCTG
GATCACTTGCAATTCTCAAAACTAATGGCTCCGACCGAGAAGTTGCCGACGGCATCTTCA
GCTGCCGCTGTGGTGTCGTCGTTCTATTCATCACTCTACACACAGCCGGCCGCACCTCCG
AAGCCGTTCCCATCTCCTCTGTTCCACCAGTACGGAGCAGCGCCGGCTTCTCCTGTGTCT
CCGGTGACTTCCACGCAGCACAGCCCTCATGACGACCAGCTCAGGCGGCCGGTTTCAGTT
ATATATTGA
Protein sequence:
MNAFMVWSRLQRRQIAKDNPKMHNSEISKRLGAEWKLLSEMQKRPFIDEAKRLRALHMKE
HPDYKYRPRRKPKPPTAGGAPGAGAFPSFPLPYFAGPAPTVGPLDALSYSAVPPYFPHQL
DHLQFSKLMAPTEKLPTASSAAAVVSSFYSSLYTQPAAPPKPFPSPLFHQYGAAPASPVS
PVTSTQHSPHDDQLRRPVSVIY