DPGLEAN16626 in OGS1.0

New model in OGS2.0DPOGS215281 
Genomic Positionscaffold316:+ 150810-163122
See gene structure
CDS Length1896
Paired RNAseq reads  5407
Single RNAseq reads  12198
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001460 (1e-53)
Best Drosophila hit  Cyclic-AMP response element binding protein A, isoform A (3e-06)
Best Human hitcyclic AMP-responsive element-binding protein 3-like protein 3 (2e-13)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL003646 [Aedes aegypti] (4e-55)
Best NR hit (blastx)  AGAP001464-PA [Anopheles gambiae str. PEST] (4e-23)
GeneOntology terms









  
GO:0043565 sequence-specific DNA binding
GO:0046983 protein dimerization activity
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0003677 DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0016020 membrane
GO:0005783 endoplasmic reticulum
GO:0016021 integral to membrane
GO:0006350 transcription
InterPro families

  
IPR004827 Basic-leucine zipper (bZIP) transcription factor
IPR008917 Eukaryotic transcription factor, Skn-1-like, DNA-binding
IPR011616 bZIP transcription factor, bZIP-1
Orthology groupMCL18984

Nucleotide sequence:

ATGGATAGTTTTCCTCCCGAGCACATGATGTGCGATCCCTGGCAGGGGACGGAAGATCTG
GAGAATATTTTCTCTCTGGATCAGAGTTCTTTGGACTTCCTGGAGAATGCTTTGCCGGAT
TTCAATTTAACAACGGACAATGCACAGCCAGAGGGACTTAGTAGTTCATGTTCTGATAGC
GGGTTATCAAGCGACCATGCTGAGCTCGACTTCGAGCAGCAGTTGTCGCCGAACCTCATC
CAGAGTACTGATTATGAGGATTTACCAACAACTATTCTAGAGCCGCTGAGTCCTTGTAAT
ACGAGTGACGTCATCATCCAGGACAACCAGACATTAGACATGCTGGACTTCGAACAGAAC
GTTGTCCCTGGATTCATTAACACAACATTCCAGAGCCCCGGTAAAGGCGGCAGAAAACGT
CGTTTCTCATCAACTCAAACAGTCGTTCAACCCAAGGTTCAGAAACAGACGATAAAGTTG
CCAGCGCCAGCGGGCAACAACAAACCGCAGCTGGTTGTGAAGGCGCAACCCCAGAAACCA
TTAAAAGTAACCAACATCCAAGTCATAAACCCTCAGACTAAGGTCTACTCTAAACCAGTG
GAAAGTGTAGCGCCCCAACGCAGAGTGATCCGCGTGGCTCCGATGGCCGGAAACCCTAGA
TCTATATTACTACCCGTAACATTCAAAGATATGAAAGATTTGAAATCGATCAAAATCATA
AATGCATCAGATTTGAAGAACTCGCCCAATATAAAGTTAGCTGCGGCTAATCTACTGTCG
CAGAGCAAACTGCAAGATCTCAAGATTGAAACACGCGGTGATGATTACGAACACAACGCT
AAATACGACGACTCGGCCAGCGATCACAGTGACGACGACGACGAGAAAGAGACGCAAATA
AATGACGGTAGGAACGGATATCCCCGTCTCGTTCTAACGGCCGAAGAGCGCCGACTGCTG
GCGAAGGAGGGCATCCAGCTGCCGAACAGTTACCCACTAACGAAGCACGAGGAAAGGGAG
CTGAAGAGGATCAGACGCAAAATACGCAACAAGATATCAGCACAGGATTCAAGGAAAAGG
AAGAAGGAATACGTCGATGGGCTCGAAGACAGGGTAAAACAATGCACAGCCGAAAATCAG
ACATTGTTGAGACGGATAAAGATGCTGCAGTCGCAGAATCAATCTCTCAGTCAGCAACTG
AAGAGGTTACAGAGCGTGTTGACCGGAGCGTCGTCGTCGGGTCGAGCTCAGCCGGCCACG
TGTCTGCTGGTGTTGTTACTGTCCGTGGCGTTGGTAGCGCTGCCGTCCGTCAGGGACGAG
GTCCGCCGTCGACCAGCCACCACCACCAGCAGCGCCACCACCACACCACCCTCGCCGGCT
ATCACACGAGCCCTGCTGTCCGCTACACACAAAATGGTGTTCGATGAGACGGTCATAGAT
GACGGGGAGTTCAATATGGACGAGCTGATAACGTTCAACAAGGCGCACTCCGACCACGAC
TACCAGGTGGTGAAGAACAGCGACAGACGGACACACAACGGATACATCGACCTCCCGATA
GACGAGGACTGGCCGCCCAAGAAGAAGCGGATGAAAAAGATTGAGTTCGACTACGGCGAC
GGCAAGGATTACATACCGATAGTCAAAGACGAGAACTACGAGAACATACAGCAGACCGGC
AGCGCTGTGGGCCACGACGTCCAGATAGGTGACAACTACCTGACGAACACGCTGCTGTCC
ACTGGCCGGAAACTCGGCGAGCTCTTGGACATATTCCCTCCCATACCCGTCAAGAACGAA
GACATACTGGTCGAGGAAGTAGCGGACTTCGACGAGAGGCACAACGTCACCGAAGTCAAA
AGTTTCGTAGTAAATGGGACCTTAAATGAATTCTAG

Protein sequence:

MDSFPPEHMMCDPWQGTEDLENIFSLDQSSLDFLENALPDFNLTTDNAQPEGLSSSCSDS
GLSSDHAELDFEQQLSPNLIQSTDYEDLPTTILEPLSPCNTSDVIIQDNQTLDMLDFEQN
VVPGFINTTFQSPGKGGRKRRFSSTQTVVQPKVQKQTIKLPAPAGNNKPQLVVKAQPQKP
LKVTNIQVINPQTKVYSKPVESVAPQRRVIRVAPMAGNPRSILLPVTFKDMKDLKSIKII
NASDLKNSPNIKLAAANLLSQSKLQDLKIETRGDDYEHNAKYDDSASDHSDDDDEKETQI
NDGRNGYPRLVLTAEERRLLAKEGIQLPNSYPLTKHEERELKRIRRKIRNKISAQDSRKR
KKEYVDGLEDRVKQCTAENQTLLRRIKMLQSQNQSLSQQLKRLQSVLTGASSSGRAQPAT
CLLVLLLSVALVALPSVRDEVRRRPATTTSSATTTPPSPAITRALLSATHKMVFDETVID
DGEFNMDELITFNKAHSDHDYQVVKNSDRRTHNGYIDLPIDEDWPPKKKRMKKIEFDYGD
GKDYIPIVKDENYENIQQTGSAVGHDVQIGDNYLTNTLLSTGRKLGELLDIFPPIPVKNE
DILVEEVADFDERHNVTEVKSFVVNGTLNEF