New model in OGS2.0 | DPOGS209116  |
---|---|
Genomic Position | scaffold2185:+ 7830-11365 |
See gene structure | |
CDS Length | 3402 |
Paired RNAseq reads   | 101 |
Single RNAseq reads   | 291 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012290 (6e-26) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | PREDICTED: similar to pol-like protein [Nasonia vitripennis] (1e-80) |
Best NR hit (blastx)   | PREDICTED: similar to pol-like protein [Nasonia vitripennis] (1e-76) |
GeneOntology terms    | GO:0006278 RNA-dependent DNA replication GO:0003964 RNA-directed DNA polymerase activity GO:0003723 RNA binding |
InterPro families    | IPR000477 Reverse transcriptase IPR002156 Ribonuclease H domain IPR005135 Endonuclease/exonuclease/phosphatase IPR012337 Ribonuclease H-like |
Orthology group | MCL10203 |
Nucleotide sequence:
ATGCCAAACATCCTTCAATGGAACTGTCACAGCATTCACCATAAAAAAAATGATATTATT
TCTCTCATTAATAATTACAACCCCTCTATTTTTGCCATTCAGGAGACATGGCTCAAGCCT
GGCTCCAGCTTTAGGATTCCAGGTTATTCATGCCTCCGTGACGACAGGCCAGACGGTTAT
GCTGGTTCTGCCATTTTGATTTCTAGAAAAGTAATCTTTAACCAAATTTCTCTTCCTCCC
CACAGTGAAGGCATTTCTGCCGTTGCTGTCAGAGCTTTGAATATTTCTTCCCTCTCTATT
TATATCCCTAACCCTTCCCTTGCCCTACTCTCTGAGTTAAGGCATATTATCTCATTTCTT
CCCCCTCCTGTCCTTATCTTGGGTGATCTTAATATCCACCATATCTCTTGGGGTTGTTAC
TACTCTGACTCTATTTCTAACCCCTTTCTTGATCTCTGTGATGAACTAAATCTATGTGTC
TTGAATGATGGTTCCCCTACTCGTAGAGTGTCCCCGCATCAGAATCCAAATACTGCAGTT
GATCTTTCAATAACTTCCCCCTCCTTGGCTCCTAGTTTGACCTGGTCTGTTCTCCCCTCT
ACTCATGGTAGCGACCATTTTACCATATTACTCTCTACCCCCGACCCCGCCTCCCCAAAA
CCTAACCCCACTCTTCCCCCTCTCCTTAAATTTAATTTATTCAAAGCTGATTGGCCACTT
TTCAGATCCCTTCTGGATCAAGAAATCTCTCAAATTCCCTTAGTTAACTCGCACAACGTT
CTTGCTATGTACCAACTCTTCAAAAACGCCGTTATAGTATCCGCGACTAATTCTTTCCCT
CTTAAAAATTCAGTTACTGGTAAAATCTCTTCTCCTCCATGGTGGGATTCACAATGCACG
GAGTGGTGTGAGAAACGCTGGAATGCAGAAAGAATTTATTCAGAGACTATGAGTCCAGAG
AATTTCACCTCTTATCAGAGAGTGGCAGCCCAGACTATCAGGTTTCTGTCAGAAAAGAAA
AGATCTGGTTGGTTTCGCTTCTGTGAAGAACTCTCTCCTAGTACTCCCCCTACTAAGATT
TGGAAGAATTTGAGACGTTTTCGGAATTCTGTTTCTGGTGCTATCATTTCATCTAATGAC
TCCTCAGGCTGGATTGACCAGTTCTCCTTCAAATTAGCCCCTCCTTCAGTTCCTTCGTTG
GAAGAACTTTTCCCCCCCTTATCGCATTGTTATTCTTCTGACAAGTTTGAGTCCCCTTTC
TCTTGGGAGGAGCTTTCAACAGTCCTTGAGGGTCTCAAGGACTCATCGCCAGGAATAGAT
GCTAAATCTCAATTTGGCTTCCGCAAAGGATTTGGTACCATCGACAGCTTAAGTATAATT
CTGACCGACATCCGCATTGCTCTATCAAAAAATGAATGTGTGGTTGGCGTTTTTTTAGAT
ATTTCTTCAGCTTATGATAATGTTCTTCTTCCAATACTCAGGCAGAAAATGCTCCAGCTG
AGTATTCCTGCGAGGTTGCTAAACATTATCCTCAGTCTTCTGTCTTCTAGATCTGTTTCC
ATTCGCTCTCCTAACTATAATTCTTCTCCTAGACAAGTATGGAAAGGGCTTCCCCAAGGC
TCAGTCCTTAGCCCGTTACTCTTTAGTATGTACACATTCGACTTAGAACTCTCAGTCAAT
CCTTTTTGTGAAGTCCTCCAATATGCCGACGACTTGGCTCTTTATGTCTCCGCAAAGAAA
ATTGATGAGGCCTCTTCCCGTCTCAACTCAGCTGTAAGCTACCTTCAGGATTGGCTGCAT
AACCACGGGTTATCTCTATCTATTCCTAAAAGCAAAGTGGTAGTTTTTTCTCGTTTCAGA
TCTATTCCAGATATCTCTATTTCTTATAGACAACAAAAGTTTATGGTTAAGGATAAAGTC
AACTTTCTTGGGTTTACTTTGGACTCGAGGCTAACTGGCATCCAACATATAAATAATATT
ATGAAAAAATGTGAAAATAATATTAACATTTTGCGTTCTCTTTCTGGTGTTTGGTGGGGC
AGCCACCCCTATACTCAAAAAATTTTATACAATGCTATAATACGCAGTCATTTTGATTAT
GGATCCTTTCTCCTTGTCCCTTGTATTAAATCTGCCTTGTCTATTCTTGATAAAATTCAA
GCTAAATGCCTGAGAATAATTTGTGGGGCTATGAAATCATCTCCAATTAACGCTCTTCAG
GTAGAATGTGGTGAAGCCCCTCTGCATCTTAGAAGACAATACTTAAGTGACCGTTTCTTT
TTAAAAGTCATTCAATTTTCTAATCACCCCCTCATTCCTAAACTGAACTCTCTCTCTGAT
CTCATTCCTTCTAACAAGTATTGGTCCCATAAAGAATATCCTTGTCTCCTTACTAGTTTA
GTCAAATTCCTTCGTCTCCCTTGTCCCGTTCTACAGAACCAAATGTTCCCGCTTTTTGCC
ACTCCATATGATGTCCTTAACTTCCATCCTCAAATTTTGCTTGAGTTTGGCATTGATAAA
GGTTCCGCTATTGCAAACGTTCAATTTCAAAATTACGTAAAAGAGCACTGGTCTGATTGG
CTCTGCATTTATACGGATGCCTCAAAAATGGCCGACCAAAGTAATGCAGGTGCTGCTGTT
TGGATCCCAAAATACAACATTATTTTAAATTTTAAATTTCCCTCCGAAATATCAATCTTC
ACTGCGGAGTCTATAGCTATTTTGGAAGCAGTGAGTTTTGTCGAGTCCCATAAGCTGAAC
AATTCTATTATTTTTAGTGATTCTAAAAGTTGCCTCCAGGCTATTGCAAGGAACCCTTTT
ATATCCAAACATAACTATCCCTACATTTTAAAGATTAAAGATATTCTCTTCCGTTGTCAA
AGTAGTAATATCCAGGTCAGACTTGCGTGGATCCCTAGCCACTCTGGCATCCACGGCAAT
GAGACCGTAGACTACTATGCTAAAGATGCTACTAATACCGGCTGTATGGACCACTTCGGG
GTTTATCCCAATGATTTGATCCCCATTGCAAAACAGCGTTTTTTCTCATCCTGGACACAG
TATTGGCTAAAAACTAGCCGTTCCAAAGGGAAATACTATGCGGATGTCCAATCCTTGATT
CCATTTCGTCCTTGGTTTTGCAATTTTAAAAACTTACATAAGCGCGTTTCTAGTATAATC
TGTAGATTAAGGCTAGGTCACGCGTGTACCCCCGTCCACTTGGCTAAGCTACGCATAAAG
GATAGCTCTCTCTGTGAGTGCGGTCTTGATGAAGGATCCCCAGATCACATTTTTTTCACT
TGCCCTCGACTGTCTTCCTCTCTCTATGATCTCCTCCCTCCCGATATCCCCCCCATTGAC
TTCAAGTCCTTACTCTCTTTTGTTAATTCTATGTTCTTTTAA
Protein sequence:
MPNILQWNCHSIHHKKNDIISLINNYNPSIFAIQETWLKPGSSFRIPGYSCLRDDRPDGY
AGSAILISRKVIFNQISLPPHSEGISAVAVRALNISSLSIYIPNPSLALLSELRHIISFL
PPPVLILGDLNIHHISWGCYYSDSISNPFLDLCDELNLCVLNDGSPTRRVSPHQNPNTAV
DLSITSPSLAPSLTWSVLPSTHGSDHFTILLSTPDPASPKPNPTLPPLLKFNLFKADWPL
FRSLLDQEISQIPLVNSHNVLAMYQLFKNAVIVSATNSFPLKNSVTGKISSPPWWDSQCT
EWCEKRWNAERIYSETMSPENFTSYQRVAAQTIRFLSEKKRSGWFRFCEELSPSTPPTKI
WKNLRRFRNSVSGAIISSNDSSGWIDQFSFKLAPPSVPSLEELFPPLSHCYSSDKFESPF
SWEELSTVLEGLKDSSPGIDAKSQFGFRKGFGTIDSLSIILTDIRIALSKNECVVGVFLD
ISSAYDNVLLPILRQKMLQLSIPARLLNIILSLLSSRSVSIRSPNYNSSPRQVWKGLPQG
SVLSPLLFSMYTFDLELSVNPFCEVLQYADDLALYVSAKKIDEASSRLNSAVSYLQDWLH
NHGLSLSIPKSKVVVFSRFRSIPDISISYRQQKFMVKDKVNFLGFTLDSRLTGIQHINNI
MKKCENNINILRSLSGVWWGSHPYTQKILYNAIIRSHFDYGSFLLVPCIKSALSILDKIQ
AKCLRIICGAMKSSPINALQVECGEAPLHLRRQYLSDRFFLKVIQFSNHPLIPKLNSLSD
LIPSNKYWSHKEYPCLLTSLVKFLRLPCPVLQNQMFPLFATPYDVLNFHPQILLEFGIDK
GSAIANVQFQNYVKEHWSDWLCIYTDASKMADQSNAGAAVWIPKYNIILNFKFPSEISIF
TAESIAILEAVSFVESHKLNNSIIFSDSKSCLQAIARNPFISKHNYPYILKIKDILFRCQ
SSNIQVRLAWIPSHSGIHGNETVDYYAKDATNTGCMDHFGVYPNDLIPIAKQRFFSSWTQ
YWLKTSRSKGKYYADVQSLIPFRPWFCNFKNLHKRVSSIICRLRLGHACTPVHLAKLRIK
DSSLCECGLDEGSPDHIFFTCPRLSSSLYDLLPPDIPPIDFKSLLSFVNSMFF