DPGLEAN05207 in OGS1.0

New model in OGS2.0DPOGS209116 
Genomic Positionscaffold2185:+ 7830-11365
See gene structure
CDS Length3402
Paired RNAseq reads  101
Single RNAseq reads  291
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012290 (6e-26)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  PREDICTED: similar to pol-like protein [Nasonia vitripennis] (1e-80)
Best NR hit (blastx)  PREDICTED: similar to pol-like protein [Nasonia vitripennis] (1e-76)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003964 RNA-directed DNA polymerase activity
GO:0003723 RNA binding
InterPro families


  
IPR000477 Reverse transcriptase
IPR002156 Ribonuclease H domain
IPR005135 Endonuclease/exonuclease/phosphatase
IPR012337 Ribonuclease H-like
Orthology groupMCL10203

Nucleotide sequence:

ATGCCAAACATCCTTCAATGGAACTGTCACAGCATTCACCATAAAAAAAATGATATTATT
TCTCTCATTAATAATTACAACCCCTCTATTTTTGCCATTCAGGAGACATGGCTCAAGCCT
GGCTCCAGCTTTAGGATTCCAGGTTATTCATGCCTCCGTGACGACAGGCCAGACGGTTAT
GCTGGTTCTGCCATTTTGATTTCTAGAAAAGTAATCTTTAACCAAATTTCTCTTCCTCCC
CACAGTGAAGGCATTTCTGCCGTTGCTGTCAGAGCTTTGAATATTTCTTCCCTCTCTATT
TATATCCCTAACCCTTCCCTTGCCCTACTCTCTGAGTTAAGGCATATTATCTCATTTCTT
CCCCCTCCTGTCCTTATCTTGGGTGATCTTAATATCCACCATATCTCTTGGGGTTGTTAC
TACTCTGACTCTATTTCTAACCCCTTTCTTGATCTCTGTGATGAACTAAATCTATGTGTC
TTGAATGATGGTTCCCCTACTCGTAGAGTGTCCCCGCATCAGAATCCAAATACTGCAGTT
GATCTTTCAATAACTTCCCCCTCCTTGGCTCCTAGTTTGACCTGGTCTGTTCTCCCCTCT
ACTCATGGTAGCGACCATTTTACCATATTACTCTCTACCCCCGACCCCGCCTCCCCAAAA
CCTAACCCCACTCTTCCCCCTCTCCTTAAATTTAATTTATTCAAAGCTGATTGGCCACTT
TTCAGATCCCTTCTGGATCAAGAAATCTCTCAAATTCCCTTAGTTAACTCGCACAACGTT
CTTGCTATGTACCAACTCTTCAAAAACGCCGTTATAGTATCCGCGACTAATTCTTTCCCT
CTTAAAAATTCAGTTACTGGTAAAATCTCTTCTCCTCCATGGTGGGATTCACAATGCACG
GAGTGGTGTGAGAAACGCTGGAATGCAGAAAGAATTTATTCAGAGACTATGAGTCCAGAG
AATTTCACCTCTTATCAGAGAGTGGCAGCCCAGACTATCAGGTTTCTGTCAGAAAAGAAA
AGATCTGGTTGGTTTCGCTTCTGTGAAGAACTCTCTCCTAGTACTCCCCCTACTAAGATT
TGGAAGAATTTGAGACGTTTTCGGAATTCTGTTTCTGGTGCTATCATTTCATCTAATGAC
TCCTCAGGCTGGATTGACCAGTTCTCCTTCAAATTAGCCCCTCCTTCAGTTCCTTCGTTG
GAAGAACTTTTCCCCCCCTTATCGCATTGTTATTCTTCTGACAAGTTTGAGTCCCCTTTC
TCTTGGGAGGAGCTTTCAACAGTCCTTGAGGGTCTCAAGGACTCATCGCCAGGAATAGAT
GCTAAATCTCAATTTGGCTTCCGCAAAGGATTTGGTACCATCGACAGCTTAAGTATAATT
CTGACCGACATCCGCATTGCTCTATCAAAAAATGAATGTGTGGTTGGCGTTTTTTTAGAT
ATTTCTTCAGCTTATGATAATGTTCTTCTTCCAATACTCAGGCAGAAAATGCTCCAGCTG
AGTATTCCTGCGAGGTTGCTAAACATTATCCTCAGTCTTCTGTCTTCTAGATCTGTTTCC
ATTCGCTCTCCTAACTATAATTCTTCTCCTAGACAAGTATGGAAAGGGCTTCCCCAAGGC
TCAGTCCTTAGCCCGTTACTCTTTAGTATGTACACATTCGACTTAGAACTCTCAGTCAAT
CCTTTTTGTGAAGTCCTCCAATATGCCGACGACTTGGCTCTTTATGTCTCCGCAAAGAAA
ATTGATGAGGCCTCTTCCCGTCTCAACTCAGCTGTAAGCTACCTTCAGGATTGGCTGCAT
AACCACGGGTTATCTCTATCTATTCCTAAAAGCAAAGTGGTAGTTTTTTCTCGTTTCAGA
TCTATTCCAGATATCTCTATTTCTTATAGACAACAAAAGTTTATGGTTAAGGATAAAGTC
AACTTTCTTGGGTTTACTTTGGACTCGAGGCTAACTGGCATCCAACATATAAATAATATT
ATGAAAAAATGTGAAAATAATATTAACATTTTGCGTTCTCTTTCTGGTGTTTGGTGGGGC
AGCCACCCCTATACTCAAAAAATTTTATACAATGCTATAATACGCAGTCATTTTGATTAT
GGATCCTTTCTCCTTGTCCCTTGTATTAAATCTGCCTTGTCTATTCTTGATAAAATTCAA
GCTAAATGCCTGAGAATAATTTGTGGGGCTATGAAATCATCTCCAATTAACGCTCTTCAG
GTAGAATGTGGTGAAGCCCCTCTGCATCTTAGAAGACAATACTTAAGTGACCGTTTCTTT
TTAAAAGTCATTCAATTTTCTAATCACCCCCTCATTCCTAAACTGAACTCTCTCTCTGAT
CTCATTCCTTCTAACAAGTATTGGTCCCATAAAGAATATCCTTGTCTCCTTACTAGTTTA
GTCAAATTCCTTCGTCTCCCTTGTCCCGTTCTACAGAACCAAATGTTCCCGCTTTTTGCC
ACTCCATATGATGTCCTTAACTTCCATCCTCAAATTTTGCTTGAGTTTGGCATTGATAAA
GGTTCCGCTATTGCAAACGTTCAATTTCAAAATTACGTAAAAGAGCACTGGTCTGATTGG
CTCTGCATTTATACGGATGCCTCAAAAATGGCCGACCAAAGTAATGCAGGTGCTGCTGTT
TGGATCCCAAAATACAACATTATTTTAAATTTTAAATTTCCCTCCGAAATATCAATCTTC
ACTGCGGAGTCTATAGCTATTTTGGAAGCAGTGAGTTTTGTCGAGTCCCATAAGCTGAAC
AATTCTATTATTTTTAGTGATTCTAAAAGTTGCCTCCAGGCTATTGCAAGGAACCCTTTT
ATATCCAAACATAACTATCCCTACATTTTAAAGATTAAAGATATTCTCTTCCGTTGTCAA
AGTAGTAATATCCAGGTCAGACTTGCGTGGATCCCTAGCCACTCTGGCATCCACGGCAAT
GAGACCGTAGACTACTATGCTAAAGATGCTACTAATACCGGCTGTATGGACCACTTCGGG
GTTTATCCCAATGATTTGATCCCCATTGCAAAACAGCGTTTTTTCTCATCCTGGACACAG
TATTGGCTAAAAACTAGCCGTTCCAAAGGGAAATACTATGCGGATGTCCAATCCTTGATT
CCATTTCGTCCTTGGTTTTGCAATTTTAAAAACTTACATAAGCGCGTTTCTAGTATAATC
TGTAGATTAAGGCTAGGTCACGCGTGTACCCCCGTCCACTTGGCTAAGCTACGCATAAAG
GATAGCTCTCTCTGTGAGTGCGGTCTTGATGAAGGATCCCCAGATCACATTTTTTTCACT
TGCCCTCGACTGTCTTCCTCTCTCTATGATCTCCTCCCTCCCGATATCCCCCCCATTGAC
TTCAAGTCCTTACTCTCTTTTGTTAATTCTATGTTCTTTTAA

Protein sequence:

MPNILQWNCHSIHHKKNDIISLINNYNPSIFAIQETWLKPGSSFRIPGYSCLRDDRPDGY
AGSAILISRKVIFNQISLPPHSEGISAVAVRALNISSLSIYIPNPSLALLSELRHIISFL
PPPVLILGDLNIHHISWGCYYSDSISNPFLDLCDELNLCVLNDGSPTRRVSPHQNPNTAV
DLSITSPSLAPSLTWSVLPSTHGSDHFTILLSTPDPASPKPNPTLPPLLKFNLFKADWPL
FRSLLDQEISQIPLVNSHNVLAMYQLFKNAVIVSATNSFPLKNSVTGKISSPPWWDSQCT
EWCEKRWNAERIYSETMSPENFTSYQRVAAQTIRFLSEKKRSGWFRFCEELSPSTPPTKI
WKNLRRFRNSVSGAIISSNDSSGWIDQFSFKLAPPSVPSLEELFPPLSHCYSSDKFESPF
SWEELSTVLEGLKDSSPGIDAKSQFGFRKGFGTIDSLSIILTDIRIALSKNECVVGVFLD
ISSAYDNVLLPILRQKMLQLSIPARLLNIILSLLSSRSVSIRSPNYNSSPRQVWKGLPQG
SVLSPLLFSMYTFDLELSVNPFCEVLQYADDLALYVSAKKIDEASSRLNSAVSYLQDWLH
NHGLSLSIPKSKVVVFSRFRSIPDISISYRQQKFMVKDKVNFLGFTLDSRLTGIQHINNI
MKKCENNINILRSLSGVWWGSHPYTQKILYNAIIRSHFDYGSFLLVPCIKSALSILDKIQ
AKCLRIICGAMKSSPINALQVECGEAPLHLRRQYLSDRFFLKVIQFSNHPLIPKLNSLSD
LIPSNKYWSHKEYPCLLTSLVKFLRLPCPVLQNQMFPLFATPYDVLNFHPQILLEFGIDK
GSAIANVQFQNYVKEHWSDWLCIYTDASKMADQSNAGAAVWIPKYNIILNFKFPSEISIF
TAESIAILEAVSFVESHKLNNSIIFSDSKSCLQAIARNPFISKHNYPYILKIKDILFRCQ
SSNIQVRLAWIPSHSGIHGNETVDYYAKDATNTGCMDHFGVYPNDLIPIAKQRFFSSWTQ
YWLKTSRSKGKYYADVQSLIPFRPWFCNFKNLHKRVSSIICRLRLGHACTPVHLAKLRIK
DSSLCECGLDEGSPDHIFFTCPRLSSSLYDLLPPDIPPIDFKSLLSFVNSMFF