New model in OGS2.0 | DPOGS208685  |
---|---|
Genomic Position | scaffold721:- 33980-36129 |
See gene structure | |
CDS Length | 1803 |
Paired RNAseq reads   | 81 |
Single RNAseq reads   | 224 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003334 (0.0) |
Best Drosophila hit   | hunchback, isoform A (2e-66) |
Best Human hit | zinc finger protein 142 (5e-13) |
Best NR hit (blastp)   | RecName: Full=Protein hunchback (2e-172) |
Best NR hit (blastx)   | RecName: Full=Protein hunchback (5e-158) |
GeneOntology terms    | GO:0045941 positive regulation of transcription GO:0003677 DNA binding GO:0016563 transcription activator activity GO:0008293 torso signaling pathway GO:0007362 terminal region determination GO:0003704 specific RNA polymerase II transcription factor activity GO:0007355 anterior region determination GO:0005634 nucleus GO:0007424 open tracheal system development GO:0001763 morphogenesis of a branching structure GO:0007400 neuroblast fate determination GO:0007431 salivary gland development GO:0008595 anterior/posterior axis specification, embryo GO:0007354 zygotic determination of anterior/posterior axis, embryo GO:0040034 regulation of development, heterochronic GO:0007419 ventral cord development GO:0007402 ganglion mother cell fate determination GO:0007417 central nervous system development GO:0042659 regulation of cell fate specification GO:0045449 regulation of transcription GO:0007427 epithelial cell migration, open tracheal system GO:0005622 intracellular GO:0008270 zinc ion binding GO:0035290 trunk segmentation GO:0035289 posterior head segmentation |
InterPro families    | IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR007087 Zinc finger, C2H2-type |
Orthology group | MCL16078 |
Nucleotide sequence:
ATGGCCGCGCCGCACGCGCAGCCCTGGGGCTCACTCTTACAACAACCCATTAAATCGGAA
CCGATGGAAGACGGAAGTTTCTCAAAGGAACAAGCGAGCGGTTTTTACTCAGAAGGCTTT
CATAGCGCATCGCCTTCGTCATCCAGCAAGGACTCCAATGGGCACTCGCCGCGCAGCGTC
GGTAGTTCCGGAGAACCGTCCCCTTTCTACGATAATATGCCTTTAAAAGCAAAGGCTAAC
CTCGGGATGCATTTGGAATCATACCGTAACGGCTTGCCCTACAGTCTGCTCACGCCGCCT
GGGTTTGAAAACCGACATAACGAAGAACACGAACAGTCGCCATACAGCTCGTATTCCCCC
CGGTCTATAGCACCACCGGCTCACGTTTCCACACCTTTAGCCCGCGCTGATGCCACACCG
CCGAAGTCTCCGCCGCAAACTCCATCATCCCCTCTTAGAGAACATGAAAGAAGAGCATTC
GAAAGATTCCACGACTCTGGTTTCGACGGCATCGGACAAAATAAATCTGATGCTGACGAC
GGTCGGGATGGATCTGGACTCGAAGAAGATTTTGATGAAGAGCCCGGCCTACGCGTGCCG
GCGGTCAACTCGCACGGAAAAGTCAAAACATTCAAATGCAAACAATGCGAATTTGTTGCT
GTTACAAAATTGAGTTTCTGGGAGCACAGCAAAGAGCACATCAAGCCCGAAAAAATGCTT
ACGTGTAGAAAGTGCCCATTCGTCACTGAATACAAACACCATCTCGAATATCACATGAGA
AACCATTTAGGCTCCAAGCCCTTCCAATGTTCTCAGTGCTCTTACTCTTGCGTCAACAAA
TCTATGTTAAATTCACACCTGAAATCTCATTCAAACATCTACCAATACAGATGCGCTGAT
TGTAACTATGCTACCAAATACTGTCATTCTCTCAAACTTCACCTACGGAAGTATAAACAC
AACCCAGCGATGGTGCTGAACATGGATGGTACACCGAACCCTTTGCCAATAATCGATGTG
TACGGAACACGACGTGGTCCTAAACAAAAGCCATTAATGAAGATGTACGATCAGCAGCAG
ATGAATAACAAGCCACAACCTCTCCCACCCCAGCATCCAATTTTCGGAAATCACTTCCCG
GTGAATCTGCCATACTTACCGCCACTTCTGCCACACTCGTTCTTGTTTCCGCCAAATAAT
AATTACGAACAGAGGACGTCGCCTAAAGTGACTGAAACATCGGTTGAAAATCAGCCATCT
ACTTCACCTCAATCGATATTACAACAGCGCTTGTCTTATGGCGAATATCCTTCGGAAGCA
GGTGCCACGCCACCACCAACTAAATCACCCACAATCTTACCACAAACCCCCACAAAACGT
ACGCTGACACCACCTCAAACCACTGACGCTCTTGACTTGACAAATACCAAAACGAGCGAG
GCAGGATCGCCTCCGCCCATAGAACCACCAGCGCCTGTCACGCCCACAACGGCCTTGAAG
AACAGAAGAAAAGGAAGAGCATTCAAACTCCAACCAGCAGCTTTGAGATTACAGCATGAA
GATACTAAAATGGAGGCGGACAACTCGGATTCGGAATCCGACGCTTCAGCTGAACCAACA
CCGAGTGCGCCAACATCGTACACCTGCCAATACTGCGACATAACGTTCGGGGATCTCACC
ATGCACACCATACACATGGGTTTCCATGGATACAACGATCCCTTCATGTGTAACAAATGC
GGCGAAAGAAGCTCCGACCGCATAGCTTTCTTCATACACTTAGGACGCGCCCAGCATGCC
TAA
Protein sequence:
MAAPHAQPWGSLLQQPIKSEPMEDGSFSKEQASGFYSEGFHSASPSSSSKDSNGHSPRSV
GSSGEPSPFYDNMPLKAKANLGMHLESYRNGLPYSLLTPPGFENRHNEEHEQSPYSSYSP
RSIAPPAHVSTPLARADATPPKSPPQTPSSPLREHERRAFERFHDSGFDGIGQNKSDADD
GRDGSGLEEDFDEEPGLRVPAVNSHGKVKTFKCKQCEFVAVTKLSFWEHSKEHIKPEKML
TCRKCPFVTEYKHHLEYHMRNHLGSKPFQCSQCSYSCVNKSMLNSHLKSHSNIYQYRCAD
CNYATKYCHSLKLHLRKYKHNPAMVLNMDGTPNPLPIIDVYGTRRGPKQKPLMKMYDQQQ
MNNKPQPLPPQHPIFGNHFPVNLPYLPPLLPHSFLFPPNNNYEQRTSPKVTETSVENQPS
TSPQSILQQRLSYGEYPSEAGATPPPTKSPTILPQTPTKRTLTPPQTTDALDLTNTKTSE
AGSPPPIEPPAPVTPTTALKNRRKGRAFKLQPAALRLQHEDTKMEADNSDSESDASAEPT
PSAPTSYTCQYCDITFGDLTMHTIHMGFHGYNDPFMCNKCGERSSDRIAFFIHLGRAQHA