DPGLEAN20020 in OGS1.0

New model in OGS2.0DPOGS200164 
Genomic Positionscaffold2269:- 4866-6644
See gene structure
CDS Length1020
Paired RNAseq reads  2914
Single RNAseq reads  10508
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002915 (5e-46)
Best Drosophila hit  X box binding protein-1 (5e-20)
Best Human hitX-box-binding protein 1 isoform XBP1(S) (4e-12)
Best NR hit (blastp)  PREDICTED: similar to X box binding protein-1 CG9415-PA [Tribolium castaneum] (4e-27)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC012878 [Tribolium castaneum] (2e-25)
GeneOntology terms









  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0042803 protein homodimerization activity
GO:0043565 sequence-specific DNA binding
GO:0046983 protein dimerization activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0030968 endoplasmic reticulum unfolded protein response
GO:0034976 response to endoplasmic reticulum stress
GO:0010506 regulation of autophagy
GO:0007040 lysosome organization
InterPro families
  
IPR004827 Basic-leucine zipper (bZIP) transcription factor
IPR011700 Basic leucine zipper
Orthology groupMCL16146

Nucleotide sequence:

ATGAGCGCTCCGATAATCATAACTGTGCCTAATAATTATCTGGCGGTGGACGATGTGGAG
TCGAAGGTGGTTCTCGATGTGTCTCCTAGTCCACCGTCCAGGAAAAGGAGGCTGGACCAT
CTAACATGGGAAGAAAAGATGCAAAGGAAGAAACTCAAGAACAGAGTTGCAGCTCAGACA
TCGCGCGACCGGAAGAAGGCGAAGATGGATGAAATGGAGGGTCGTATCAAGCACTTCATG
GACTTAAACGAGCGGCTTCTTGGTGAGGTGGAGAACCTAAAGGCGATGAACGAGCGGCTT
CTGAGTGAGAACTCAGCCCTGCGCGAGGCGGCGAGGAGCGTCGCGGTGGCCCCGAGACCA
GCAGAGTCTCATCCTCAGCAGAAGGTGGGGCCCCTGTCGGCACTCAACGCGGCTCGTCTA
GTGATGCTGATGTATGTGCTCTCTCAGAACTCCTGCAACACTTGGACTCCCCCGAGTATT
TGGACACCCTCCACCAACTTGCAGATCAATTACTCCAAGAAATTGATGGAAAAACTGCAG
GAGAAGCTGCCGATGATAAAGCCAGCAGCAATAGACATTGTCCTGAAAGAAATGAAGTGG
TGGGGTCCACAGCAGAACAATTGGAATCCTGTCAAAACAGACACTATTAAAGAAGAAAAC
GGGGACAAGGGTGACTTATTCTATGCAAGCTACGAAGCAAATGATTGTGTGACAATTGAA
GTTCCTTGCGAGGAACAAACAGAAGAATCGGCTCCAATAAAATTGGACACTGATTTTAAT
AAATTTACGGACGACTGTTTGGATGTCACATTGGAATCTGATATGAAGTTATTGTCACCT
CTGCCTATGTCAATAAAATCTGTGGATGAAAATGTATTGGCAGTGTCCCCGTCACACAGT
AACTTGAGCTCTGACATGGGCTACGAGTCACTCTCCTCCCCGCTCAGTGAACCTGAGTCT
ATGGATCTGTCAGATTTTTGGTGTGAATCATTCCCGGAACTGTTCCCGGACCTGGTGTAA

Protein sequence:

MSAPIIITVPNNYLAVDDVESKVVLDVSPSPPSRKRRLDHLTWEEKMQRKKLKNRVAAQT
SRDRKKAKMDEMEGRIKHFMDLNERLLGEVENLKAMNERLLSENSALREAARSVAVAPRP
AESHPQQKVGPLSALNAARLVMLMYVLSQNSCNTWTPPSIWTPSTNLQINYSKKLMEKLQ
EKLPMIKPAAIDIVLKEMKWWGPQQNNWNPVKTDTIKEENGDKGDLFYASYEANDCVTIE
VPCEEQTEESAPIKLDTDFNKFTDDCLDVTLESDMKLLSPLPMSIKSVDENVLAVSPSHS
NLSSDMGYESLSSPLSEPESMDLSDFWCESFPELFPDLV