DPGLEAN03513 in OGS1.0

New model in OGS2.0DPOGS206613 
Genomic Positionscaffold2204:- 12080-16249
See gene structure
CDS Length1179
Paired RNAseq reads  2405
Single RNAseq reads  5792
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008330 (5e-44)
Best Drosophila hit  A3-3 (4e-18)
Best Human hitjun dimerization protein 2 isoform b (1e-09)
Best NR hit (blastp)  PREDICTED: similar to AGAP001536-PA [Tribolium castaneum] (3e-40)
Best NR hit (blastx)  PREDICTED: similar to AGAP001536-PA [Tribolium castaneum] (2e-33)
GeneOntology terms




  
GO:0046983 protein dimerization activity
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007399 nervous system development
InterPro families

  
IPR004827 Basic-leucine zipper (bZIP) transcription factor
IPR011616 bZIP transcription factor, bZIP-1
IPR000837 Fos transforming protein
Orthology groupMCL16433

Nucleotide sequence:

ATGAACTCGACGAGCATTAGCTCAGCAGTGCCGACAATTAAATGTGAGGATACGTCTCCG
TCGCCGACAGCCGCCGTTGGTACTGACGGAGAATCAGGGGCATATCTTAGTGTTAACGTC
AACCTGAGCACTGCCATGATGAACCTCCTCGCTGCGGAAGGCGCCAACACCACACTGCGC
ACCCCAGAAATCGTCAACGACCTGATCACCATGTCAAACCCCATGGACCAATATAACTAC
GACAAAAATTCTAGCTTCAAGAATAGCAATGACTCGAACTCTTCAATGTCAAACAGCTCG
TCGGCTACTTCACCAGCCTCCGGGACCCCGCCCAGCATACAAAAGACTTGCTCGGAACTG
ATTAAGGCCGGTTTGAAGTTGTCCATAGAGTCGAAAAGGAAAATGTCGGGAAGCGACACG
GATGTCGGCATCAAGAGAATGAAGAAGGAGGAGAGCGATGATGATTACGACAGCAGTCAT
ACCCAGGTGTCTAGAAACGAGCTGACACCAGAGGACGAGGAGAGGAGGCGTCGGAGAAGA
GAAAGAAACAAAATAGCAGCTACCAAGTGCAGGATGAAGAAGAGGGAACGAACAGTGAAC
CTCGTTAATGAAAGTGAAGTGCTCGAAAACCAGAATATTGACCTCAAGGCGCAACTTAAG
GATCTCGAAGTTCAGAAGCGACAGTTGCTAGACATGTTATCGCAACACGCGTCCTCCTGC
GTACGGAACAACACTAACGCGAGAACGTCGCCAAACTTCAACATGATGAGGACGTTCGAA
TCACCGACAACATTCCCCGTCAACTACGACGCCCACTCGCCCTACATACGACCAGAATCA
GCCAACATACTAGCCTCGAGCTACACGTGTGCCACCCCACTCAACGAGACCATAGACACT
ATGTCTTTAGACGCGGCCTACATGACGCCGCAGAACATAGACGTCGAATACAACAGACCC
GACAGCGTCATCAGTCTGCCCCCTAATTCCGACAGTTACATCACCACGGATGGATACCTG
CCAAAAGCCACAGCGATCCTAGGTCCGATCGAACCGGAAACGGAATACTATGACAATGAG
ATCAATTACGTCACCCAACAATGTCACAGTTACCCCAACAACATACAGGACTCACAACAG
AAGTTAAACAACAGTCTGAACGACGGCTGTCTGGTCTAA

Protein sequence:

MNSTSISSAVPTIKCEDTSPSPTAAVGTDGESGAYLSVNVNLSTAMMNLLAAEGANTTLR
TPEIVNDLITMSNPMDQYNYDKNSSFKNSNDSNSSMSNSSSATSPASGTPPSIQKTCSEL
IKAGLKLSIESKRKMSGSDTDVGIKRMKKEESDDDYDSSHTQVSRNELTPEDEERRRRRR
ERNKIAATKCRMKKRERTVNLVNESEVLENQNIDLKAQLKDLEVQKRQLLDMLSQHASSC
VRNNTNARTSPNFNMMRTFESPTTFPVNYDAHSPYIRPESANILASSYTCATPLNETIDT
MSLDAAYMTPQNIDVEYNRPDSVISLPPNSDSYITTDGYLPKATAILGPIEPETEYYDNE
INYVTQQCHSYPNNIQDSQQKLNNSLNDGCLV