New model in OGS2.0 | DPOGS215362  |
---|---|
Genomic Position | scaffold2176:+ 27407-31747 |
See gene structure | |
CDS Length | 1254 |
Paired RNAseq reads   | 444 |
Single RNAseq reads   | 1937 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009565 (1e-27) |
Best Drosophila hit   | Sequence-specific single-stranded DNA-binding protein, isoform C (2e-55) |
Best Human hit | single-stranded DNA-binding protein 3 isoform a (1e-51) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (9e-121) |
Best NR hit (blastx)   | conserved hypothetical protein [Culex quinquefasciatus] (6e-65) |
GeneOntology terms    | GO:0003697 single-stranded DNA binding GO:0005634 nucleus GO:0045449 regulation of transcription GO:0030528 transcription regulator activity |
InterPro families    | IPR008116 Sequence-specific single-strand DNA-binding protein IPR007591 Single-stranded DNA-binding protein, SSDP IPR006594 LisH dimerisation motif |
Orthology group | MCL14924 |
Nucleotide sequence:
ATGTATGCCAAGGGCAAAAGCTCTGCGGTACCTTCGGACGCTCAGGCACGGGAGAAGTTG
GCCCTTTATGTGTATGAGTACTTACTGCACGTCGGGGCACAGAAAGCGGCGCAGACTTTC
CTTTCTGAAATACGATGGGAAAAGAACATAACACTCGGCGAGCCGCCCGGATTCCTGCAT
TCCTGGTGGTGTGTTTTCTGGGACCTGTACTGCGCCGCGCCTGAAAGGAGGGACACATGC
GAACACTCCTCTGAGGCTAAGGCATTCCATGACTATGGATTCGTCAATTCAGGTTATGGT
GTTAACGGCATCGGTCACAACGCAGGCCCGGCGCCGCCTAATGACGGTATGGGTGGCGGA
GGTATGCCACCAGGTTTCTTCCCCAACTCCTCACTCCGACCATCACCGCCAGCCCCACAT
CCTGGATCTCAGCCCTCACCGCATGGACCACAGCCACAGTTGATGGGGACAGGCCAGCCG
TTCATAGGACCCTGGTACTCGGGAGGACCAAGAACAGCCGTCAGAATGGGCATGGGAAAT
GATTTTAATGGTCCTCCGGGTCAAGGCATGATGTCGAACTCCTTGGAGCGAGGCAGCGGT
ATGCTGGGCGGGCCGCGCATGACCCCGCCCCGCCCCGGCATGGGACCCATGAGCCCTGGT
GCATATGCAGCCGGCATGCGTGGCCCACCGCCACAAGCCCCAGGTATGCCACCAATGGGT
ATGGGACCACGTGGCGCTTGGGCCGGCGGAAGTGGCGGCGCTGGTGGGGGATCCGCCCCC
CTCAACTACAGCGGAGGCTCGCCCGGCGCGTACGGGGCGCCTCCCGGGTCCAATGGACCC
CCAGGACCTCCGACTCCCATCATGCCAAGCCCACAGGACTCATCCAATTCGGGCGGTGAC
AACATGTACACATTGATGAAGCCGGTGGGCGCAGCCCTAGGGGCAGAGTTCCCGCTCGCC
GGCGAGCACGGGCCCTCGTCGCAGCACCTACCTCAGCCTCCCACTTCCGAAGGGCTAGGC
GGGGTGGACGGTATGAAGGCGTCCCCGGGCGGTGTCGGGGGCGGAGGCCCGGGGACTCCG
AGAGAGGACTCCGGCTCTGGAATGGGGGATTACAATTTAAGTTTCGGCGGACCGGGCGGC
GATCAGAACGACCAGACGGAGTCGGCGGCCATTCTCAAGATAAAGGAGAGCATGCAAGAG
GAGGCGAAGAGATTCGAGAAGGATCCGGACCATCCAGATTACTTTATGCAGTGA
Protein sequence:
MYAKGKSSAVPSDAQAREKLALYVYEYLLHVGAQKAAQTFLSEIRWEKNITLGEPPGFLH
SWWCVFWDLYCAAPERRDTCEHSSEAKAFHDYGFVNSGYGVNGIGHNAGPAPPNDGMGGG
GMPPGFFPNSSLRPSPPAPHPGSQPSPHGPQPQLMGTGQPFIGPWYSGGPRTAVRMGMGN
DFNGPPGQGMMSNSLERGSGMLGGPRMTPPRPGMGPMSPGAYAAGMRGPPPQAPGMPPMG
MGPRGAWAGGSGGAGGGSAPLNYSGGSPGAYGAPPGSNGPPGPPTPIMPSPQDSSNSGGD
NMYTLMKPVGAALGAEFPLAGEHGPSSQHLPQPPTSEGLGGVDGMKASPGGVGGGGPGTP
REDSGSGMGDYNLSFGGPGGDQNDQTESAAILKIKESMQEEAKRFEKDPDHPDYFMQ