DPGLEAN21100 in OGS1.0

New model in OGS2.0DPOGS215362 
Genomic Positionscaffold2176:+ 27407-31747
See gene structure
CDS Length1254
Paired RNAseq reads  444
Single RNAseq reads  1937
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009565 (1e-27)
Best Drosophila hit  Sequence-specific single-stranded DNA-binding protein, isoform C (2e-55)
Best Human hitsingle-stranded DNA-binding protein 3 isoform a (1e-51)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (9e-121)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (6e-65)
GeneOntology terms


  
GO:0003697 single-stranded DNA binding
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0030528 transcription regulator activity
InterPro families

  
IPR008116 Sequence-specific single-strand DNA-binding protein
IPR007591 Single-stranded DNA-binding protein, SSDP
IPR006594 LisH dimerisation motif
Orthology groupMCL14924

Nucleotide sequence:

ATGTATGCCAAGGGCAAAAGCTCTGCGGTACCTTCGGACGCTCAGGCACGGGAGAAGTTG
GCCCTTTATGTGTATGAGTACTTACTGCACGTCGGGGCACAGAAAGCGGCGCAGACTTTC
CTTTCTGAAATACGATGGGAAAAGAACATAACACTCGGCGAGCCGCCCGGATTCCTGCAT
TCCTGGTGGTGTGTTTTCTGGGACCTGTACTGCGCCGCGCCTGAAAGGAGGGACACATGC
GAACACTCCTCTGAGGCTAAGGCATTCCATGACTATGGATTCGTCAATTCAGGTTATGGT
GTTAACGGCATCGGTCACAACGCAGGCCCGGCGCCGCCTAATGACGGTATGGGTGGCGGA
GGTATGCCACCAGGTTTCTTCCCCAACTCCTCACTCCGACCATCACCGCCAGCCCCACAT
CCTGGATCTCAGCCCTCACCGCATGGACCACAGCCACAGTTGATGGGGACAGGCCAGCCG
TTCATAGGACCCTGGTACTCGGGAGGACCAAGAACAGCCGTCAGAATGGGCATGGGAAAT
GATTTTAATGGTCCTCCGGGTCAAGGCATGATGTCGAACTCCTTGGAGCGAGGCAGCGGT
ATGCTGGGCGGGCCGCGCATGACCCCGCCCCGCCCCGGCATGGGACCCATGAGCCCTGGT
GCATATGCAGCCGGCATGCGTGGCCCACCGCCACAAGCCCCAGGTATGCCACCAATGGGT
ATGGGACCACGTGGCGCTTGGGCCGGCGGAAGTGGCGGCGCTGGTGGGGGATCCGCCCCC
CTCAACTACAGCGGAGGCTCGCCCGGCGCGTACGGGGCGCCTCCCGGGTCCAATGGACCC
CCAGGACCTCCGACTCCCATCATGCCAAGCCCACAGGACTCATCCAATTCGGGCGGTGAC
AACATGTACACATTGATGAAGCCGGTGGGCGCAGCCCTAGGGGCAGAGTTCCCGCTCGCC
GGCGAGCACGGGCCCTCGTCGCAGCACCTACCTCAGCCTCCCACTTCCGAAGGGCTAGGC
GGGGTGGACGGTATGAAGGCGTCCCCGGGCGGTGTCGGGGGCGGAGGCCCGGGGACTCCG
AGAGAGGACTCCGGCTCTGGAATGGGGGATTACAATTTAAGTTTCGGCGGACCGGGCGGC
GATCAGAACGACCAGACGGAGTCGGCGGCCATTCTCAAGATAAAGGAGAGCATGCAAGAG
GAGGCGAAGAGATTCGAGAAGGATCCGGACCATCCAGATTACTTTATGCAGTGA

Protein sequence:

MYAKGKSSAVPSDAQAREKLALYVYEYLLHVGAQKAAQTFLSEIRWEKNITLGEPPGFLH
SWWCVFWDLYCAAPERRDTCEHSSEAKAFHDYGFVNSGYGVNGIGHNAGPAPPNDGMGGG
GMPPGFFPNSSLRPSPPAPHPGSQPSPHGPQPQLMGTGQPFIGPWYSGGPRTAVRMGMGN
DFNGPPGQGMMSNSLERGSGMLGGPRMTPPRPGMGPMSPGAYAAGMRGPPPQAPGMPPMG
MGPRGAWAGGSGGAGGGSAPLNYSGGSPGAYGAPPGSNGPPGPPTPIMPSPQDSSNSGGD
NMYTLMKPVGAALGAEFPLAGEHGPSSQHLPQPPTSEGLGGVDGMKASPGGVGGGGPGTP
REDSGSGMGDYNLSFGGPGGDQNDQTESAAILKIKESMQEEAKRFEKDPDHPDYFMQ