DPGLEAN19696 in OGS1.0

New model in OGS2.0DPOGS210898 
Genomic Positionscaffold70:- 46814-47422
See gene structure
CDS Length609
Paired RNAseq reads  15
Single RNAseq reads  55
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003079 (3e-67)
Best Drosophila hit  dichaete (2e-29)
Best Human hittranscription factor SOX-2 (8e-27)
Best NR hit (blastp)  AGAP010919-PA [Anopheles gambiae str. PEST] (2e-38)
Best NR hit (blastx)  PREDICTED: similar to AGAP010919-PA [Tribolium castaneum] (3e-28)
GeneOntology terms




















  
GO:0007350 blastoderm segmentation
GO:0007417 central nervous system development
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0016563 transcription activator activity
GO:0003677 DNA binding
GO:0008301 DNA bending activity
GO:0045893 positive regulation of transcription, DNA-dependent
GO:0005737 cytoplasm
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0002168 instar larval development
GO:0035120 post-embryonic appendage morphogenesis
GO:0007442 hindgut morphogenesis
GO:0007420 brain development
GO:0003730 mRNA 3'-UTR binding
GO:0009950 dorsal/ventral axis specification
GO:0060810 intracellular mRNA localization involved in pattern specification process
GO:0043565 sequence-specific DNA binding
GO:0008134 transcription factor binding
GO:0010553 negative regulation of gene-specific transcription from RNA polymerase II promoter
GO:0010551 regulation of gene-specific transcription from RNA polymerase II promoter
InterPro families
  
IPR000910 High mobility group, HMG1/HMG2
IPR009071 High mobility group, superfamily
Orthology groupMCL16975

Nucleotide sequence:

ATGAACGCCTTCATGGTCTGGTCCAGGCTACAACGTCGCCAGATCGCCAAGGATAATCCG
AAGATGCACAATTCGGAGATATCGAAACGGCTCGGAGCCGAATGGAAGCTGTTGTCTGAA
ATGCAGAAGAGACCATTCATTGACGAAGCAAAACGACTCCGAGCTCTCCATATGAAAGAG
CACCCCGACTACAAGTACCGGCCACGAAGGAAGCCCAAGCCACCGACAGCGGGAGGAGCT
CCGGGCGCTGGAGCTTTCCCGAGCTTTCCACTGCCTTACTTCGCGGGTCCAGCACCGACC
GTTGGACCGTTGGACGCCCTTTCGTATTCGGCAGTGCCTCCATACTTCCCACATCAGCTG
GATCACTTGCAATTCTCAAAACTAATGGCTCCGACCGAGAAGTTGCCGACGGCATCTTCA
GCTGCCGCTGTGGTGTCGTCGTTCTATTCATCACTCTACACACAGCCGGCCGCACCTCCG
AAGCCGTTCCCATCTCCTCTGTTCCACCAGTACGGAGCAGCGCCGGCTTCTCCTGTGTCT
CCGGTGACTTCCACGCAGCACAGCCCTCATGACGACCAGCTCAGGCGGCCGGTTTCAGTT
ATATATTGA

Protein sequence:

MNAFMVWSRLQRRQIAKDNPKMHNSEISKRLGAEWKLLSEMQKRPFIDEAKRLRALHMKE
HPDYKYRPRRKPKPPTAGGAPGAGAFPSFPLPYFAGPAPTVGPLDALSYSAVPPYFPHQL
DHLQFSKLMAPTEKLPTASSAAAVVSSFYSSLYTQPAAPPKPFPSPLFHQYGAAPASPVS
PVTSTQHSPHDDQLRRPVSVIY