New model in OGS2.0 | DPOGS200164  |
---|---|
Genomic Position | scaffold2269:- 4866-6644 |
See gene structure | |
CDS Length | 1020 |
Paired RNAseq reads   | 2914 |
Single RNAseq reads   | 10508 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002915 (5e-46) |
Best Drosophila hit   | X box binding protein-1 (5e-20) |
Best Human hit | X-box-binding protein 1 isoform XBP1(S) (4e-12) |
Best NR hit (blastp)   | PREDICTED: similar to X box binding protein-1 CG9415-PA [Tribolium castaneum] (4e-27) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC012878 [Tribolium castaneum] (2e-25) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0045449 regulation of transcription GO:0042803 protein homodimerization activity GO:0043565 sequence-specific DNA binding GO:0046983 protein dimerization activity GO:0006355 regulation of transcription, DNA-dependent GO:0030968 endoplasmic reticulum unfolded protein response GO:0034976 response to endoplasmic reticulum stress GO:0010506 regulation of autophagy GO:0007040 lysosome organization |
InterPro families    | IPR004827 Basic-leucine zipper (bZIP) transcription factor IPR011700 Basic leucine zipper |
Orthology group | MCL16146 |
Nucleotide sequence:
ATGAGCGCTCCGATAATCATAACTGTGCCTAATAATTATCTGGCGGTGGACGATGTGGAG
TCGAAGGTGGTTCTCGATGTGTCTCCTAGTCCACCGTCCAGGAAAAGGAGGCTGGACCAT
CTAACATGGGAAGAAAAGATGCAAAGGAAGAAACTCAAGAACAGAGTTGCAGCTCAGACA
TCGCGCGACCGGAAGAAGGCGAAGATGGATGAAATGGAGGGTCGTATCAAGCACTTCATG
GACTTAAACGAGCGGCTTCTTGGTGAGGTGGAGAACCTAAAGGCGATGAACGAGCGGCTT
CTGAGTGAGAACTCAGCCCTGCGCGAGGCGGCGAGGAGCGTCGCGGTGGCCCCGAGACCA
GCAGAGTCTCATCCTCAGCAGAAGGTGGGGCCCCTGTCGGCACTCAACGCGGCTCGTCTA
GTGATGCTGATGTATGTGCTCTCTCAGAACTCCTGCAACACTTGGACTCCCCCGAGTATT
TGGACACCCTCCACCAACTTGCAGATCAATTACTCCAAGAAATTGATGGAAAAACTGCAG
GAGAAGCTGCCGATGATAAAGCCAGCAGCAATAGACATTGTCCTGAAAGAAATGAAGTGG
TGGGGTCCACAGCAGAACAATTGGAATCCTGTCAAAACAGACACTATTAAAGAAGAAAAC
GGGGACAAGGGTGACTTATTCTATGCAAGCTACGAAGCAAATGATTGTGTGACAATTGAA
GTTCCTTGCGAGGAACAAACAGAAGAATCGGCTCCAATAAAATTGGACACTGATTTTAAT
AAATTTACGGACGACTGTTTGGATGTCACATTGGAATCTGATATGAAGTTATTGTCACCT
CTGCCTATGTCAATAAAATCTGTGGATGAAAATGTATTGGCAGTGTCCCCGTCACACAGT
AACTTGAGCTCTGACATGGGCTACGAGTCACTCTCCTCCCCGCTCAGTGAACCTGAGTCT
ATGGATCTGTCAGATTTTTGGTGTGAATCATTCCCGGAACTGTTCCCGGACCTGGTGTAA
Protein sequence:
MSAPIIITVPNNYLAVDDVESKVVLDVSPSPPSRKRRLDHLTWEEKMQRKKLKNRVAAQT
SRDRKKAKMDEMEGRIKHFMDLNERLLGEVENLKAMNERLLSENSALREAARSVAVAPRP
AESHPQQKVGPLSALNAARLVMLMYVLSQNSCNTWTPPSIWTPSTNLQINYSKKLMEKLQ
EKLPMIKPAAIDIVLKEMKWWGPQQNNWNPVKTDTIKEENGDKGDLFYASYEANDCVTIE
VPCEEQTEESAPIKLDTDFNKFTDDCLDVTLESDMKLLSPLPMSIKSVDENVLAVSPSHS
NLSSDMGYESLSSPLSEPESMDLSDFWCESFPELFPDLV