DPGLEAN13540 in OGS1.0

Genomic Positionscaffold20:+ 336347-338588
See gene structure
CDS Length1518
Paired RNAseq reads  82
Single RNAseq reads  301
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012290 (2e-21)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  PREDICTED: similar to pol-like protein [Acyrthosiphon pisum] (1e-32)
Best NR hit (blastx)  PREDICTED: similar to pol-like protein [Acyrthosiphon pisum] (1e-31)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003964 RNA-directed DNA polymerase activity
GO:0003723 RNA binding
InterPro families  IPR005135 Endonuclease/exonuclease/phosphatase
Orthology groupND

Nucleotide sequence:

ATGTTTGAACCTCCGGGGGATACCCCCCCGGACGATCGTGGGGCCTGTAGACCTCCACCC
CCTAAAAGACACTATTCCGGAGACGAGGTTTCGCGGAACAATACCAGCGAGGACAGTGAT
AAGTATTATAAACCATATTATAAACAAGGATACACACGTGTATTCCCTGAGAGTACCAGC
AAAGGTGAGTACTCTGTTTTTGTAGAAAGTACTAAAGACACCAAACTTGGGAACCAAAAT
CCAATAACACTTGGCAGCTTATTCAAAAACGAAGTCAAGGGCGTGAAAAATATCAAACGA
GTCAATGCAAATAAAGTGAGCGTATTGTTTGAACAAACAATAAATGCCAATGCGTTTCTT
AAAAATTCCGACTTTCATACTAAAAACGATTTTAAAGTTTACATACCAGCTAGGGCCGTG
GAAACAATAGGAGTGGTAAGATTTGTTCCCACAAGCATATCTAATGAAGAACTGTTTAAA
AAACTCTCCTCTTCACATGAAATTATTGGTGTCAGGCGTTTTATGAAAAGGGTGAACGGT
GAAAGAAAACCATTAGAAAAGGATATAGATATATGTTTATTGAACGAAACTTGGCTGAAG
GATGGAAATAGATTTAGGATTCCCAACTACAATATGATAAATCAGAATGGTGTTAATGGA
CGCGGAGGAGTTGCCATACTAATTAAAAATACATTTAAATACCAAATAGTACCCACAAAT
TTTTATAATTTCTTACAATCTGTGGCAATAGTTTTAAAAACAGACTTTGGTGAATTATCG
ATTCTCTGTGCATATAGCCCTCCTCGAGGAAGTAGATTTAAGTGTAGGAGACTGAAGCAA
ATAATCGATGATTTACCAACGCCAATGTTACTAGCAGGTGATTTGAATGCTCATCATGTA
GCATTTGGATGTCGCTCCACTAACTCGAGAGGCAATGATGTATACAATTTATTAGATGAA
TGTAACTTATGTATTCTTAATACAGGGGCTTACACTACTGTAGGGAATATATCCCATAAC
CAGTCAGCCATAGACATTGCTTGCGTCACTCCATCTATAGCCCCTTTCTGCCATTGGAGA
GTACATGACGACCCAATGGGCAGCTACCACTATCCCACTATTTGTGACATTTACCTAAAT
GCAGAAAAATATGAAGTTAATAATCCCTCTGACCGTTATTTGTACAAAAAAGCAAACTGG
ACTTTGTTCAATTCTGAAACAGGAAAAGCTTTTAAAGAATTTTCCATTAACACTTCAGAT
CCGCTTTCGTGCTATGATAAATTTATTAGCATTTTAAATATCATTAAAGATAAATGTATA
CCAAAAGCCACTCGCATATCTCAATACATTAACAAAAAACCAGTTCCATGGTGGGACAAT
GAATGCGCAGATGCTGTAAAGAAAAGTAAAGATGCACTTAATTATTACAGACTGCTGCCT
AGTACAGAAAACTTTATAAAATATAAAAGATTGGATGCTATTAAGAAAAAACTACTGAGG
CAAAAGAAAAAAGAATAG

Protein sequence:

MFEPPGDTPPDDRGACRPPPPKRHYSGDEVSRNNTSEDSDKYYKPYYKQGYTRVFPESTS
KGEYSVFVESTKDTKLGNQNPITLGSLFKNEVKGVKNIKRVNANKVSVLFEQTINANAFL
KNSDFHTKNDFKVYIPARAVETIGVVRFVPTSISNEELFKKLSSSHEIIGVRRFMKRVNG
ERKPLEKDIDICLLNETWLKDGNRFRIPNYNMINQNGVNGRGGVAILIKNTFKYQIVPTN
FYNFLQSVAIVLKTDFGELSILCAYSPPRGSRFKCRRLKQIIDDLPTPMLLAGDLNAHHV
AFGCRSTNSRGNDVYNLLDECNLCILNTGAYTTVGNISHNQSAIDIACVTPSIAPFCHWR
VHDDPMGSYHYPTICDIYLNAEKYEVNNPSDRYLYKKANWTLFNSETGKAFKEFSINTSD
PLSCYDKFISILNIIKDKCIPKATRISQYINKKPVPWWDNECADAVKKSKDALNYYRLLP
STENFIKYKRLDAIKKKLLRQKKKE