New model in OGS2.0 | DPOGS215281  |
---|---|
Genomic Position | scaffold316:+ 150810-163122 |
See gene structure | |
CDS Length | 1896 |
Paired RNAseq reads   | 5407 |
Single RNAseq reads   | 12198 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001460 (1e-53) |
Best Drosophila hit   | Cyclic-AMP response element binding protein A, isoform A (3e-06) |
Best Human hit | cyclic AMP-responsive element-binding protein 3-like protein 3 (2e-13) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL003646 [Aedes aegypti] (4e-55) |
Best NR hit (blastx)   | AGAP001464-PA [Anopheles gambiae str. PEST] (4e-23) |
GeneOntology terms    | GO:0043565 sequence-specific DNA binding GO:0046983 protein dimerization activity GO:0003700 sequence-specific DNA binding transcription factor activity GO:0003677 DNA binding GO:0006355 regulation of transcription, DNA-dependent GO:0045449 regulation of transcription GO:0005634 nucleus GO:0016020 membrane GO:0005783 endoplasmic reticulum GO:0016021 integral to membrane GO:0006350 transcription |
InterPro families    | IPR004827 Basic-leucine zipper (bZIP) transcription factor IPR008917 Eukaryotic transcription factor, Skn-1-like, DNA-binding IPR011616 bZIP transcription factor, bZIP-1 |
Orthology group | MCL18984 |
Nucleotide sequence:
ATGGATAGTTTTCCTCCCGAGCACATGATGTGCGATCCCTGGCAGGGGACGGAAGATCTG
GAGAATATTTTCTCTCTGGATCAGAGTTCTTTGGACTTCCTGGAGAATGCTTTGCCGGAT
TTCAATTTAACAACGGACAATGCACAGCCAGAGGGACTTAGTAGTTCATGTTCTGATAGC
GGGTTATCAAGCGACCATGCTGAGCTCGACTTCGAGCAGCAGTTGTCGCCGAACCTCATC
CAGAGTACTGATTATGAGGATTTACCAACAACTATTCTAGAGCCGCTGAGTCCTTGTAAT
ACGAGTGACGTCATCATCCAGGACAACCAGACATTAGACATGCTGGACTTCGAACAGAAC
GTTGTCCCTGGATTCATTAACACAACATTCCAGAGCCCCGGTAAAGGCGGCAGAAAACGT
CGTTTCTCATCAACTCAAACAGTCGTTCAACCCAAGGTTCAGAAACAGACGATAAAGTTG
CCAGCGCCAGCGGGCAACAACAAACCGCAGCTGGTTGTGAAGGCGCAACCCCAGAAACCA
TTAAAAGTAACCAACATCCAAGTCATAAACCCTCAGACTAAGGTCTACTCTAAACCAGTG
GAAAGTGTAGCGCCCCAACGCAGAGTGATCCGCGTGGCTCCGATGGCCGGAAACCCTAGA
TCTATATTACTACCCGTAACATTCAAAGATATGAAAGATTTGAAATCGATCAAAATCATA
AATGCATCAGATTTGAAGAACTCGCCCAATATAAAGTTAGCTGCGGCTAATCTACTGTCG
CAGAGCAAACTGCAAGATCTCAAGATTGAAACACGCGGTGATGATTACGAACACAACGCT
AAATACGACGACTCGGCCAGCGATCACAGTGACGACGACGACGAGAAAGAGACGCAAATA
AATGACGGTAGGAACGGATATCCCCGTCTCGTTCTAACGGCCGAAGAGCGCCGACTGCTG
GCGAAGGAGGGCATCCAGCTGCCGAACAGTTACCCACTAACGAAGCACGAGGAAAGGGAG
CTGAAGAGGATCAGACGCAAAATACGCAACAAGATATCAGCACAGGATTCAAGGAAAAGG
AAGAAGGAATACGTCGATGGGCTCGAAGACAGGGTAAAACAATGCACAGCCGAAAATCAG
ACATTGTTGAGACGGATAAAGATGCTGCAGTCGCAGAATCAATCTCTCAGTCAGCAACTG
AAGAGGTTACAGAGCGTGTTGACCGGAGCGTCGTCGTCGGGTCGAGCTCAGCCGGCCACG
TGTCTGCTGGTGTTGTTACTGTCCGTGGCGTTGGTAGCGCTGCCGTCCGTCAGGGACGAG
GTCCGCCGTCGACCAGCCACCACCACCAGCAGCGCCACCACCACACCACCCTCGCCGGCT
ATCACACGAGCCCTGCTGTCCGCTACACACAAAATGGTGTTCGATGAGACGGTCATAGAT
GACGGGGAGTTCAATATGGACGAGCTGATAACGTTCAACAAGGCGCACTCCGACCACGAC
TACCAGGTGGTGAAGAACAGCGACAGACGGACACACAACGGATACATCGACCTCCCGATA
GACGAGGACTGGCCGCCCAAGAAGAAGCGGATGAAAAAGATTGAGTTCGACTACGGCGAC
GGCAAGGATTACATACCGATAGTCAAAGACGAGAACTACGAGAACATACAGCAGACCGGC
AGCGCTGTGGGCCACGACGTCCAGATAGGTGACAACTACCTGACGAACACGCTGCTGTCC
ACTGGCCGGAAACTCGGCGAGCTCTTGGACATATTCCCTCCCATACCCGTCAAGAACGAA
GACATACTGGTCGAGGAAGTAGCGGACTTCGACGAGAGGCACAACGTCACCGAAGTCAAA
AGTTTCGTAGTAAATGGGACCTTAAATGAATTCTAG
Protein sequence:
MDSFPPEHMMCDPWQGTEDLENIFSLDQSSLDFLENALPDFNLTTDNAQPEGLSSSCSDS
GLSSDHAELDFEQQLSPNLIQSTDYEDLPTTILEPLSPCNTSDVIIQDNQTLDMLDFEQN
VVPGFINTTFQSPGKGGRKRRFSSTQTVVQPKVQKQTIKLPAPAGNNKPQLVVKAQPQKP
LKVTNIQVINPQTKVYSKPVESVAPQRRVIRVAPMAGNPRSILLPVTFKDMKDLKSIKII
NASDLKNSPNIKLAAANLLSQSKLQDLKIETRGDDYEHNAKYDDSASDHSDDDDEKETQI
NDGRNGYPRLVLTAEERRLLAKEGIQLPNSYPLTKHEERELKRIRRKIRNKISAQDSRKR
KKEYVDGLEDRVKQCTAENQTLLRRIKMLQSQNQSLSQQLKRLQSVLTGASSSGRAQPAT
CLLVLLLSVALVALPSVRDEVRRRPATTTSSATTTPPSPAITRALLSATHKMVFDETVID
DGEFNMDELITFNKAHSDHDYQVVKNSDRRTHNGYIDLPIDEDWPPKKKRMKKIEFDYGD
GKDYIPIVKDENYENIQQTGSAVGHDVQIGDNYLTNTLLSTGRKLGELLDIFPPIPVKNE
DILVEEVADFDERHNVTEVKSFVVNGTLNEF