DPGLEAN17956 in OGS1.0

New model in OGS2.0DPOGS205178 
Genomic Positionscaffold347:+ 121080-130834
See gene structure
CDS Length1221
Paired RNAseq reads  268
Single RNAseq reads  948
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000874 (4e-55)
Best Drosophila hit  Cyclic-AMP response element binding protein A, isoform A (8e-25)
Best Human hitcyclic AMP-responsive element-binding protein 3-like protein 1 (1e-21)
Best NR hit (blastp)  Cyclic AMP-dependent transcription factor ATF-6 beta, putative [Pediculus humanus corporis] (2e-64)
Best NR hit (blastx)  AGAP011038-PA [Anopheles gambiae str. PEST] (2e-45)
GeneOntology terms












  
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0003702 RNA polymerase II transcription factor activity
GO:0035293 chitin-based larval cuticle pattern formation
GO:0007431 salivary gland development
GO:0009953 dorsal/ventral pattern formation
GO:0006366 transcription from RNA polymerase II promoter
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007435 salivary gland morphogenesis
GO:0042803 protein homodimerization activity
GO:0043565 sequence-specific DNA binding
GO:0046983 protein dimerization activity
GO:0008363 larval chitin-based cuticle development
InterPro families

  
IPR004827 Basic-leucine zipper (bZIP) transcription factor
IPR008917 Eukaryotic transcription factor, Skn-1-like, DNA-binding
IPR011616 bZIP transcription factor, bZIP-1
Orthology groupMCL16069

Nucleotide sequence:

ATGACCTTGTTTGGCGGACGTTTAGTGTTTAGCTGTTATTCACTCTCGGCCAAGGTGAGG
GCCGTGGATAGACGCGAATTAATACCTCATGTTATTGACCTCGTGTTGTTTGCTGTGAAC
TCTTCAATACGTATGGATATGCAGGACGTGTTAGACGAAGATTCAGAGATGAGCGACTGG
CTGATCGAGAGGGATTCCAAGCTGGGTGTAGTGCTTCATGATCGTCTGATGACGGACGCC
GCGCTAGGAGCAGCTCCCATCAAGACTGAACACTCATATAGCCTCCACTCCGATGTGGAA
TCAGCACCGCCCTCGCCACACCACACCAAAGTTGACGATATGGAAGACGAATGCTATCCA
GCCATTCCAGCGAGTGCGTGGCGGAGTCGCCGGCGCTCCAGTGATTCCAGTCCGGAGGTG
AAGGCCGAGCCCAAGTCTGAACCAGAGTCGCCAGCCTCCTCGTGTCCTCCGTCGCCAACA
CCCACTATGTCGACCGTCGACTATGTCATAGACCACACAAGAACATTAAAGAAACTAGTT
CTTATAAGAGTTGCCAGTAAGATCTGGCCGTCAATCCACATGCCGCAAGCTGTGCTGCAG
AAAGTTGGTGCCGCCGGCGTGCCGCAAATACTTCTGAGTGCGGCACCACGTATCGCAAAC
TCGCTCAACAACAATAAACTGTCTATCAAAGTCACCTCATCAGGGACATCAGGTTTCAAC
CTACCCCCAACTCCTCCGTCATCATTGTCGTCGTCTGACAGTGAGGGTGCTCTGTCGCCA
TCTCACGAGCCCCCCGCCCCCATCACCCCGGCACCCCCGCGCCGCTCTCACCTATACGTG
TCACACCACTCCAGGCAGCCTATCAACACACCCCTCATCAGCAGCCAGCCGAAAGGATCC
ACTGGCACATTAGTTTTAACAGAGGAAGAGAAACGCACGTTACTAGCCGAAGGTTACCCG
GTGCCGACACGTCTGCCGCTCACCAAGGCCGAGGAGAAGTCGCTGAAGAAAGTCAGGAGG
AAAATTAAGAATAAGATTTCTGCACAAGAAAGTAGACGCAAGAAGAAAGAATACATGGAC
CAATTAGAGAGGAAAGTAGAAATATTAGTATCAGAGAACACAGACTACAGGAAGAGGGTC
GAGACCTTGGAGAGCACCAATGCTAATCTATTGAGCCAGCTGGCAGCCCTGCAGGCGATG
GTGAGGGCATCCAGGAAGTGA

Protein sequence:

MTLFGGRLVFSCYSLSAKVRAVDRRELIPHVIDLVLFAVNSSIRMDMQDVLDEDSEMSDW
LIERDSKLGVVLHDRLMTDAALGAAPIKTEHSYSLHSDVESAPPSPHHTKVDDMEDECYP
AIPASAWRSRRRSSDSSPEVKAEPKSEPESPASSCPPSPTPTMSTVDYVIDHTRTLKKLV
LIRVASKIWPSIHMPQAVLQKVGAAGVPQILLSAAPRIANSLNNNKLSIKVTSSGTSGFN
LPPTPPSSLSSSDSEGALSPSHEPPAPITPAPPRRSHLYVSHHSRQPINTPLISSQPKGS
TGTLVLTEEEKRTLLAEGYPVPTRLPLTKAEEKSLKKVRRKIKNKISAQESRRKKKEYMD
QLERKVEILVSENTDYRKRVETLESTNANLLSQLAALQAMVRASRK