DPGLEAN16155 in OGS1.0

Genomic Positionscaffold9804:+ 811-3410
See gene structure
CDS Length2298
Paired RNAseq reads  13
Single RNAseq reads  61
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006276 (2e-06)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase [Bombyx mori] (1e-176)
Best NR hit (blastx)  endonuclease-reverse transcriptase [Bombyx mori] (2e-164)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families
  
IPR005135 Endonuclease/exonuclease/phosphatase
IPR000477 Reverse transcriptase
Orthology groupMCL10020

Nucleotide sequence:

ATGGGTGCGACAAAAAAGCCACGGCCCCCAGTTCTCGGCGGTCCCGCTATTCTTGGTCAT
GGTGGCGACCATGGTAACGGCGGGACGGGGGGTGCGAAGAATCTCCGGGCTACGGGAGGC
CACCAAAATAAACTGACCCTGGCGACGTACAACGGACGCACGTTACGGCTTGACGAGCAT
CTGGCGCAACTCGAGGAGGAACTTGGGAAGATTAGGTGGCACATACTAGGGTTAAGTGAA
GTCCGAAGATCGGGGGAGGACACCGTAACTCTTGAATCGGGCCACCTAATGTACTTTCGC
GAGGGAGACCAACAATCCCAAGGTGGCGTGGGGTTTTTGGTAAATAAGTCCCTATTCGAT
AGCGTTGTGGAAATGTCTAGTGTGTCGAACAGGGTAGCGTACCTCATAGTAATGCTCACC
GAGAGATTCAGCCTCAAGGTGGTGCAAGTGTACGCTCCGACCTCGACACACTCGGATGGT
GAAGTGGAAGAAATGTTCGATGATATATCGAAGGCCCTCCACTACACTACTAAGACTCAC
TACAACGTTGTTATGGGAGACTTTAACGCTAAAGTGGGAGTACGGACTTGCAACGAATCG
GTAGTAGGATCCCATGGATTTGGAAGTAGGAATCATAGGGGGCAAATGCTCGTTAACTTC
CTGGAACGAGAGGGTCTCTTCTTGATGAACTCGTTCTTCAAGAAACAGCCCCAAAGGAAG
TGGACGTGGCAAAGCCCCGATACTATGACTAAAAATGAGATCGATTTCATCATGACGGAC
AAGAAACACATATTCAGAGACGTCTCCGTGATCAACAGTCAAATCTGGAGAACCGATTCA
AAAGTGTTCGTACAATCTCTTGGAAGAAGCCACCTGACGAAGTTGACTACAGCAAGTGGA
GAAGTCGTTGCTTCAATGCCGGCAGTCCTTTCAGAAGTGGAAGACTTCTATAGCCGGCTC
TACGCATCGCAGACATCTCGGCCTGATCCCGTGTGTGAGGATCCTAGAGCCACATTAACA
CGCCACTTTTCCGAAGACCTGCCAGAAGTCAGTATTGGCGAAATTGAGATCGCTCTTGGG
CAGCTTAAAAATGGAAAAGCCCCTGGAGAGGACGGCATAACAACAGAATTGTTGAATGCC
GGAGCTAAACCCGTACTGAGGGAGCTCCAAAAGCTGTTCAATTCTGTCCTCTTCGAAGGG
AGAACTCCGGAGGCGTGGAGTAGGAGTGTGGTCATCCTGTTCTTCAAAAAGGGAGACAAA
TCCCTGCTGAAGAACTATCGACCCATTTCCCTGCTAAGCCACGTATATAAGCTGTTTTCA
AGAGTGATCACAAATCGTCTTGCGCGAAGACTCGACGAATTCCAACCACCGGAACAGGCT
GGGTTTCGGAGCGGATACGGCACCATAGACCACATTCACACGGTGCGGCAGATTATACAG
AAGACCGAAGAGTATAATCAGCCCCTTTGTCTAGCATTCGTGGACTATGAGAAAGCTTTT
GACTCGATTGAAACTTGGTCTGTTCTGGAGTCCCTGCAACGTTGCCAAGTTGATTGGCGG
TATATCCAAGTGATAAGATGTCTCTACGAAGCCGCCACCATGTCCGTCCAAGTACAGAAT
CAGCAAACAAGTCCCATACCGTTGCATCGAGGAGTGAGACAGGGGGACGTTATCTCCCCG
AAACTGTTCACAAATGCAATGGAGGATATGTTTAAGACGCTGCGCTGGAAAGGACGAGGT
ATTAACATTAATGGCGAACACATCTCTCACTTGCGATTTGCAGACGACATCGTCATTATG
GCAGAAACGCTGCAGGACCTACAACAGATGCTTGATGACCTGGCTGACTCTTCTATACGC
ATCGGCCTACGGATGAACTTGGACAAAACCAAGGACATGTTCAATGAACATGTCCTACCG
GAACCGATTGCAGTCCACCGTGTCGTTCTCGAAGTTGTTCAAAAATATGTCTATCTGGGG
CAAGTATTGCAGTTGGGTAGAAACAATTTCGAGGACGAGTATATCCAAGTGATAAGATGT
CTCTACGAAGCCGCCACCATGTCCGTCCAAGTACAGAATCAGCAAACAAGTCCCATACCG
TTGCATCGAGGAGTGAGACAAGGGGACGTTATCTCCCCGAAATTGTTCACAAATGCAATG
GAGGATATGTTTAAGACGCTGCGCTGGAAAGGACGAGGTATTAACATTAATGGCGAACAC
ATCTCTCACTTGCGATTTGCAGACGACATNNAGTGGCGTGCTATTGGAGAGGCCTATGTC
CAGCAGTGTATAGGCTGA

Protein sequence:

MGATKKPRPPVLGGPAILGHGGDHGNGGTGGAKNLRATGGHQNKLTLATYNGRTLRLDEH
LAQLEEELGKIRWHILGLSEVRRSGEDTVTLESGHLMYFREGDQQSQGGVGFLVNKSLFD
SVVEMSSVSNRVAYLIVMLTERFSLKVVQVYAPTSTHSDGEVEEMFDDISKALHYTTKTH
YNVVMGDFNAKVGVRTCNESVVGSHGFGSRNHRGQMLVNFLEREGLFLMNSFFKKQPQRK
WTWQSPDTMTKNEIDFIMTDKKHIFRDVSVINSQIWRTDSKVFVQSLGRSHLTKLTTASG
EVVASMPAVLSEVEDFYSRLYASQTSRPDPVCEDPRATLTRHFSEDLPEVSIGEIEIALG
QLKNGKAPGEDGITTELLNAGAKPVLRELQKLFNSVLFEGRTPEAWSRSVVILFFKKGDK
SLLKNYRPISLLSHVYKLFSRVITNRLARRLDEFQPPEQAGFRSGYGTIDHIHTVRQIIQ
KTEEYNQPLCLAFVDYEKAFDSIETWSVLESLQRCQVDWRYIQVIRCLYEAATMSVQVQN
QQTSPIPLHRGVRQGDVISPKLFTNAMEDMFKTLRWKGRGININGEHISHLRFADDIVIM
AETLQDLQQMLDDLADSSIRIGLRMNLDKTKDMFNEHVLPEPIAVHRVVLEVVQKYVYLG
QVLQLGRNNFEDEYIQVIRCLYEAATMSVQVQNQQTSPIPLHRGVRQGDVISPKLFTNAM
EDMFKTLRWKGRGININGEHISHLRFADDXXWRAIGEAYVQQCIG