DPGLEAN11398 in OGS1.0

New model in OGS2.0DPOGS208685 
Genomic Positionscaffold721:- 33980-36129
See gene structure
CDS Length1803
Paired RNAseq reads  81
Single RNAseq reads  224
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003334 (0.0)
Best Drosophila hit  hunchback, isoform A (2e-66)
Best Human hitzinc finger protein 142 (5e-13)
Best NR hit (blastp)  RecName: Full=Protein hunchback (2e-172)
Best NR hit (blastx)  RecName: Full=Protein hunchback (5e-158)
GeneOntology terms























  
GO:0045941 positive regulation of transcription
GO:0003677 DNA binding
GO:0016563 transcription activator activity
GO:0008293 torso signaling pathway
GO:0007362 terminal region determination
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007355 anterior region determination
GO:0005634 nucleus
GO:0007424 open tracheal system development
GO:0001763 morphogenesis of a branching structure
GO:0007400 neuroblast fate determination
GO:0007431 salivary gland development
GO:0008595 anterior/posterior axis specification, embryo
GO:0007354 zygotic determination of anterior/posterior axis, embryo
GO:0040034 regulation of development, heterochronic
GO:0007419 ventral cord development
GO:0007402 ganglion mother cell fate determination
GO:0007417 central nervous system development
GO:0042659 regulation of cell fate specification
GO:0045449 regulation of transcription
GO:0007427 epithelial cell migration, open tracheal system
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0035290 trunk segmentation
GO:0035289 posterior head segmentation
InterPro families

  
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL16078

Nucleotide sequence:

ATGGCCGCGCCGCACGCGCAGCCCTGGGGCTCACTCTTACAACAACCCATTAAATCGGAA
CCGATGGAAGACGGAAGTTTCTCAAAGGAACAAGCGAGCGGTTTTTACTCAGAAGGCTTT
CATAGCGCATCGCCTTCGTCATCCAGCAAGGACTCCAATGGGCACTCGCCGCGCAGCGTC
GGTAGTTCCGGAGAACCGTCCCCTTTCTACGATAATATGCCTTTAAAAGCAAAGGCTAAC
CTCGGGATGCATTTGGAATCATACCGTAACGGCTTGCCCTACAGTCTGCTCACGCCGCCT
GGGTTTGAAAACCGACATAACGAAGAACACGAACAGTCGCCATACAGCTCGTATTCCCCC
CGGTCTATAGCACCACCGGCTCACGTTTCCACACCTTTAGCCCGCGCTGATGCCACACCG
CCGAAGTCTCCGCCGCAAACTCCATCATCCCCTCTTAGAGAACATGAAAGAAGAGCATTC
GAAAGATTCCACGACTCTGGTTTCGACGGCATCGGACAAAATAAATCTGATGCTGACGAC
GGTCGGGATGGATCTGGACTCGAAGAAGATTTTGATGAAGAGCCCGGCCTACGCGTGCCG
GCGGTCAACTCGCACGGAAAAGTCAAAACATTCAAATGCAAACAATGCGAATTTGTTGCT
GTTACAAAATTGAGTTTCTGGGAGCACAGCAAAGAGCACATCAAGCCCGAAAAAATGCTT
ACGTGTAGAAAGTGCCCATTCGTCACTGAATACAAACACCATCTCGAATATCACATGAGA
AACCATTTAGGCTCCAAGCCCTTCCAATGTTCTCAGTGCTCTTACTCTTGCGTCAACAAA
TCTATGTTAAATTCACACCTGAAATCTCATTCAAACATCTACCAATACAGATGCGCTGAT
TGTAACTATGCTACCAAATACTGTCATTCTCTCAAACTTCACCTACGGAAGTATAAACAC
AACCCAGCGATGGTGCTGAACATGGATGGTACACCGAACCCTTTGCCAATAATCGATGTG
TACGGAACACGACGTGGTCCTAAACAAAAGCCATTAATGAAGATGTACGATCAGCAGCAG
ATGAATAACAAGCCACAACCTCTCCCACCCCAGCATCCAATTTTCGGAAATCACTTCCCG
GTGAATCTGCCATACTTACCGCCACTTCTGCCACACTCGTTCTTGTTTCCGCCAAATAAT
AATTACGAACAGAGGACGTCGCCTAAAGTGACTGAAACATCGGTTGAAAATCAGCCATCT
ACTTCACCTCAATCGATATTACAACAGCGCTTGTCTTATGGCGAATATCCTTCGGAAGCA
GGTGCCACGCCACCACCAACTAAATCACCCACAATCTTACCACAAACCCCCACAAAACGT
ACGCTGACACCACCTCAAACCACTGACGCTCTTGACTTGACAAATACCAAAACGAGCGAG
GCAGGATCGCCTCCGCCCATAGAACCACCAGCGCCTGTCACGCCCACAACGGCCTTGAAG
AACAGAAGAAAAGGAAGAGCATTCAAACTCCAACCAGCAGCTTTGAGATTACAGCATGAA
GATACTAAAATGGAGGCGGACAACTCGGATTCGGAATCCGACGCTTCAGCTGAACCAACA
CCGAGTGCGCCAACATCGTACACCTGCCAATACTGCGACATAACGTTCGGGGATCTCACC
ATGCACACCATACACATGGGTTTCCATGGATACAACGATCCCTTCATGTGTAACAAATGC
GGCGAAAGAAGCTCCGACCGCATAGCTTTCTTCATACACTTAGGACGCGCCCAGCATGCC
TAA

Protein sequence:

MAAPHAQPWGSLLQQPIKSEPMEDGSFSKEQASGFYSEGFHSASPSSSSKDSNGHSPRSV
GSSGEPSPFYDNMPLKAKANLGMHLESYRNGLPYSLLTPPGFENRHNEEHEQSPYSSYSP
RSIAPPAHVSTPLARADATPPKSPPQTPSSPLREHERRAFERFHDSGFDGIGQNKSDADD
GRDGSGLEEDFDEEPGLRVPAVNSHGKVKTFKCKQCEFVAVTKLSFWEHSKEHIKPEKML
TCRKCPFVTEYKHHLEYHMRNHLGSKPFQCSQCSYSCVNKSMLNSHLKSHSNIYQYRCAD
CNYATKYCHSLKLHLRKYKHNPAMVLNMDGTPNPLPIIDVYGTRRGPKQKPLMKMYDQQQ
MNNKPQPLPPQHPIFGNHFPVNLPYLPPLLPHSFLFPPNNNYEQRTSPKVTETSVENQPS
TSPQSILQQRLSYGEYPSEAGATPPPTKSPTILPQTPTKRTLTPPQTTDALDLTNTKTSE
AGSPPPIEPPAPVTPTTALKNRRKGRAFKLQPAALRLQHEDTKMEADNSDSESDASAEPT
PSAPTSYTCQYCDITFGDLTMHTIHMGFHGYNDPFMCNKCGERSSDRIAFFIHLGRAQHA