DPGLEAN06976 in OGS1.0

New model in OGS2.0DPOGS204843 
Genomic Positionscaffold1512:- 36135-51620
See gene structure
CDS Length1137
Paired RNAseq reads  17409
Single RNAseq reads  57177
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitND
Best Drosophila hit  cryptocephal, isoform A (7e-12)
Best Human hitcyclic AMP-dependent transcription factor ATF-4 (8e-11)
Best NR hit (blastp)  activating transcription factor of chaperone [Bombyx mori] (7e-59)
Best NR hit (blastx)  activating transcription factor of chaperone [Bombyx mori] (3e-47)
GeneOntology terms










  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0007552 metamorphosis
GO:0007591 molting cycle, chitin-based cuticle
GO:0035074 pupation
GO:0035073 pupariation
GO:0045449 regulation of transcription
GO:0046982 protein heterodimerization activity
GO:0046983 protein dimerization activity
GO:0043565 sequence-specific DNA binding
GO:0030528 transcription regulator activity
GO:0032583 regulation of gene-specific transcription
InterPro families
  
IPR004827 Basic-leucine zipper (bZIP) transcription factor
IPR011616 bZIP transcription factor, bZIP-1
Orthology groupMCL18443

Nucleotide sequence:

ATGGCGTCTTCACCGGACAGCTTTGTCCAACGACAAAAATCACCATCACCCACGGTCAGC
GATCGATCCCAAATTGAAATTGCATCCTTGGTAAGTAAAAAAGTGTTGCTATATTTACCA
CCGACAACATCCCAAACAAGTATGAGTTCGAGTCATTGGGACGAGTCGCTGGCTTCGGCC
AACAACCTTCTCACTGATAACGAGTGCTGTATACTGCTCGATGAAAGCCTATTCTATGCC
GATCAAGATATTCTTAAGAGCTTTCCGGGCGCTGCAACAAAGATCGAAACGGCTAGCTTC
CAAGATGACCTTACCAACACCATGTACCCACCATCTCCTGTGGACATCAAGCCCAGTCAA
GCGGAGCGAGCTGAAGATCTGCTGCAGCAGCTGGAAAGTCAATGCAAACAAGAAAACATA
TACTCTAACTGGTTCGAAGAGAAAGTTGAGAACAGCATCTTCGATAATATCAGTCAGGGA
CCAGAGCCGGAGTTCCGGCCGGTGGCTGTCGATTACACCGCGCAGACCGTGCCGCGCTCC
ACCGAGGTTCTTTTGAGGGAGTTCGAGTCGGTGTACAGTGGCGTCCAACTGACTCACCTC
ACCCCGCCTCAGAGCCCGCCCGGTCCGGCTACCCAACTCCTGCTAAGCTACGCCCAAGCT
CAGGCTGCTCCGCCTTTACAACCACTAACTGTCGAGCAATGGCCATTGATCCCGCCCCAA
AGCTCAATACCGGAGTACGACTGCGATCCTCAGGCCCTCGAGGAGTTGGTCCGCCATCGT
GCCGCTCAATTGGAATCGCCGCAGCCCGCGCACAGCCCTTCACCATCACCGCAATCATCA
CCGTCCTCATCGCCGCGGTCATCTTCCACTGATGAGGATTGGACATCATCCCGCCCCAAG
CCGTACTCCCGGAACGGTGATGATCGCAGGTCTCGTAAGAAGGAGCAGAACAAGAATGCG
GCTACCCGTTACCGCCAGAAGAAGAAAGCCGAGATCGAGGTGCTCCTCAACGAGGAACAG
GAGCTGCGCAAGCGACACGGTGAGCTCGGGGACAAGTGTTCCGACCTCCAACGCGAGATC
CGCTACATCAAGGGCATCCTGCGCGACCTCTTCAAGGCAAAAGGCCTCATCAAATAG

Protein sequence:

MASSPDSFVQRQKSPSPTVSDRSQIEIASLVSKKVLLYLPPTTSQTSMSSSHWDESLASA
NNLLTDNECCILLDESLFYADQDILKSFPGAATKIETASFQDDLTNTMYPPSPVDIKPSQ
AERAEDLLQQLESQCKQENIYSNWFEEKVENSIFDNISQGPEPEFRPVAVDYTAQTVPRS
TEVLLREFESVYSGVQLTHLTPPQSPPGPATQLLLSYAQAQAAPPLQPLTVEQWPLIPPQ
SSIPEYDCDPQALEELVRHRAAQLESPQPAHSPSPSPQSSPSSSPRSSSTDEDWTSSRPK
PYSRNGDDRRSRKKEQNKNAATRYRQKKKAEIEVLLNEEQELRKRHGELGDKCSDLQREI
RYIKGILRDLFKAKGLIK