New model in OGS2.0 | DPOGS206613  |
---|---|
Genomic Position | scaffold2204:- 12080-16249 |
See gene structure | |
CDS Length | 1179 |
Paired RNAseq reads   | 2405 |
Single RNAseq reads   | 5792 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008330 (5e-44) |
Best Drosophila hit   | A3-3 (4e-18) |
Best Human hit | jun dimerization protein 2 isoform b (1e-09) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP001536-PA [Tribolium castaneum] (3e-40) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP001536-PA [Tribolium castaneum] (2e-33) |
GeneOntology terms    | GO:0046983 protein dimerization activity GO:0043565 sequence-specific DNA binding GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0007399 nervous system development |
InterPro families    | IPR004827 Basic-leucine zipper (bZIP) transcription factor IPR011616 bZIP transcription factor, bZIP-1 IPR000837 Fos transforming protein |
Orthology group | MCL16433 |
Nucleotide sequence:
ATGAACTCGACGAGCATTAGCTCAGCAGTGCCGACAATTAAATGTGAGGATACGTCTCCG
TCGCCGACAGCCGCCGTTGGTACTGACGGAGAATCAGGGGCATATCTTAGTGTTAACGTC
AACCTGAGCACTGCCATGATGAACCTCCTCGCTGCGGAAGGCGCCAACACCACACTGCGC
ACCCCAGAAATCGTCAACGACCTGATCACCATGTCAAACCCCATGGACCAATATAACTAC
GACAAAAATTCTAGCTTCAAGAATAGCAATGACTCGAACTCTTCAATGTCAAACAGCTCG
TCGGCTACTTCACCAGCCTCCGGGACCCCGCCCAGCATACAAAAGACTTGCTCGGAACTG
ATTAAGGCCGGTTTGAAGTTGTCCATAGAGTCGAAAAGGAAAATGTCGGGAAGCGACACG
GATGTCGGCATCAAGAGAATGAAGAAGGAGGAGAGCGATGATGATTACGACAGCAGTCAT
ACCCAGGTGTCTAGAAACGAGCTGACACCAGAGGACGAGGAGAGGAGGCGTCGGAGAAGA
GAAAGAAACAAAATAGCAGCTACCAAGTGCAGGATGAAGAAGAGGGAACGAACAGTGAAC
CTCGTTAATGAAAGTGAAGTGCTCGAAAACCAGAATATTGACCTCAAGGCGCAACTTAAG
GATCTCGAAGTTCAGAAGCGACAGTTGCTAGACATGTTATCGCAACACGCGTCCTCCTGC
GTACGGAACAACACTAACGCGAGAACGTCGCCAAACTTCAACATGATGAGGACGTTCGAA
TCACCGACAACATTCCCCGTCAACTACGACGCCCACTCGCCCTACATACGACCAGAATCA
GCCAACATACTAGCCTCGAGCTACACGTGTGCCACCCCACTCAACGAGACCATAGACACT
ATGTCTTTAGACGCGGCCTACATGACGCCGCAGAACATAGACGTCGAATACAACAGACCC
GACAGCGTCATCAGTCTGCCCCCTAATTCCGACAGTTACATCACCACGGATGGATACCTG
CCAAAAGCCACAGCGATCCTAGGTCCGATCGAACCGGAAACGGAATACTATGACAATGAG
ATCAATTACGTCACCCAACAATGTCACAGTTACCCCAACAACATACAGGACTCACAACAG
AAGTTAAACAACAGTCTGAACGACGGCTGTCTGGTCTAA
Protein sequence:
MNSTSISSAVPTIKCEDTSPSPTAAVGTDGESGAYLSVNVNLSTAMMNLLAAEGANTTLR
TPEIVNDLITMSNPMDQYNYDKNSSFKNSNDSNSSMSNSSSATSPASGTPPSIQKTCSEL
IKAGLKLSIESKRKMSGSDTDVGIKRMKKEESDDDYDSSHTQVSRNELTPEDEERRRRRR
ERNKIAATKCRMKKRERTVNLVNESEVLENQNIDLKAQLKDLEVQKRQLLDMLSQHASSC
VRNNTNARTSPNFNMMRTFESPTTFPVNYDAHSPYIRPESANILASSYTCATPLNETIDT
MSLDAAYMTPQNIDVEYNRPDSVISLPPNSDSYITTDGYLPKATAILGPIEPETEYYDNE
INYVTQQCHSYPNNIQDSQQKLNNSLNDGCLV