DPGLEAN18677 in OGS1.0

New model in OGS2.0DPOGS203155 
Genomic Positionscaffold117:- 213388-215803
See gene structure
CDS Length1578
Paired RNAseq reads  493
Single RNAseq reads  1354
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011510 (6e-178)
Best Drosophila hit  CG13624, isoform F (7e-71)
Best Human hitluman-recruiting factor isoform 1 (2e-37)
Best NR hit (blastp)  conserved hypothetical protein [Pediculus humanus corporis] (2e-135)
Best NR hit (blastx)  conserved hypothetical protein [Pediculus humanus corporis] (3e-105)
GeneOntology terms



  
GO:0042803 protein homodimerization activity
GO:0005575 cellular_component
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families  IPR004827 Basic-leucine zipper (bZIP) transcription factor
Orthology groupMCL16312

Nucleotide sequence:

ATGTCGGATTCTATTTACACTTTTGACTTTCTTTTAGAACCGAGTTTAGAGATAAAACAA
GAGTCAAACTTCGAGTTGGGCGCGATGTCAGCATCGGTGCCAATACCACAGAGGCGCACA
GAACTTGCGGATTTTAATACTGACTTAGATTTGTGTCTCCAAGACAATCAGATTGGATCA
TTCCACAGTGTTCCTACATTACCCTACAACAAATTTAATTTCGAAGCTGATTCATCAAGA
ATGGAGCCCTTCAAAATGGAGGATGATGATATATTCCAAGTAGACAAAGCTGATTTAGTG
TTAGGTCCAACTTTGGCAGAGTTAAATGCAAACCCAGATACATCTTTGGATGATCTTAAT
TTCGATGATCTGCTCTTGCCAGAGGAGAGTCGGTACTGTTTACAGATAGGTGGAGCCATG
AGTGGTTCAAAGAACTCTCCAAATGTTTTCCAAACCAACACATTGACTTCCGAGAGCCCC
TGTAGCCCTTATGGCAGAGCTCAGTTGGCTTTCTCGCCATCTAGTCAGCATAGTTCAGCA
TCTTCTAGCTTTGTTCCACCAATGAATCAGTTACCGGAACTGCTCCTAAGAATGGATGGT
TACAGTGGCGAAATTGCTCTTGGACAATCTGTCCCAGCTTCATCTGTTCTGCCACCGTTC
CCACCTAGTGTTAAAACCAAAGCACAATTATCCTCATCGGCTCCCACACATTTAACTATG
GACCAGATATGGCAACGTCGGGAGCCAAGAAAACATCTACTATCCACCAGTTCTCTCGCT
GAAGCGGGATCTGTGTCTTCCTTGTCCGGGGGACTTCTTAGTCCAGGAACTGGAGATTTT
TCTCAAGACGAAGATGACAGAGATATAGAATCTGACGAAGACAGTGATAGATATGAGGAT
CTATCATCTGATGAATCAAATGATGAATGCCCCGAGCGTAAGGAAGCTCGTCAGGCCAAA
AAAGAAAAATACTTCTGGCAGTACAATGTACAGGCTAAAGGTCCAAAAGGTCAAAGATTA
ATACTAAAGAAAAAATCAGAAGATCCACATGTACTAAATCTAGTTACAGACCCTGTATTT
AGCCCCAACTGCAATGTGAAAGGCATAAAACACAGTGGGAAGGCAAGAAAAGGTGACGGT
AATGATTTAACACCGAATCCCCGAAAACTTTATCTCATCGGTTTGGAACTAGATAAATTA
GGAAAAATTATCAATGATATGATACCGGTCAGCGAATTACCATTTAATGTACGACCGAAA
ACCAGAAAAGAGAAAAATAAACTGGCATCGAGAGCTTGCAGGTTAAAGAAAAAAGCTCAG
CACGAGGCAAATAAATTAAAACTATATGGATTACAGCACGAACATAGACGACTCCTTAAT
GGAATAAATCAAGTGAAACAAATACTTTGCAATAGGGTAACAAATCCAGATAACAATGTA
GACTGGTCTTCACATGTACAGACTTTAGTTAATACAGCCACCGAGGTAAAAATAGCTGGC
AAAACATCAGAGTTTGTTAACAAAATAGTGAACAATGTGAAGTCTGGACAAAATAATGGC
GGTTTGAACGAAATATGA

Protein sequence:

MSDSIYTFDFLLEPSLEIKQESNFELGAMSASVPIPQRRTELADFNTDLDLCLQDNQIGS
FHSVPTLPYNKFNFEADSSRMEPFKMEDDDIFQVDKADLVLGPTLAELNANPDTSLDDLN
FDDLLLPEESRYCLQIGGAMSGSKNSPNVFQTNTLTSESPCSPYGRAQLAFSPSSQHSSA
SSSFVPPMNQLPELLLRMDGYSGEIALGQSVPASSVLPPFPPSVKTKAQLSSSAPTHLTM
DQIWQRREPRKHLLSTSSLAEAGSVSSLSGGLLSPGTGDFSQDEDDRDIESDEDSDRYED
LSSDESNDECPERKEARQAKKEKYFWQYNVQAKGPKGQRLILKKKSEDPHVLNLVTDPVF
SPNCNVKGIKHSGKARKGDGNDLTPNPRKLYLIGLELDKLGKIINDMIPVSELPFNVRPK
TRKEKNKLASRACRLKKKAQHEANKLKLYGLQHEHRRLLNGINQVKQILCNRVTNPDNNV
DWSSHVQTLVNTATEVKIAGKTSEFVNKIVNNVKSGQNNGGLNEI