DPGLEAN00498 in OGS1.0

Genomic Positionscaffold4553:+ 798-2927
See gene structure
CDS Length2130
Paired RNAseq reads  475
Single RNAseq reads  1253
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitND
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  PREDICTED: similar to polyprotein [Strongylocentrotus purpuratus] (6e-82)
Best NR hit (blastx)  PREDICTED: similar to polyprotein [Strongylocentrotus purpuratus] (6e-81)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10002

Nucleotide sequence:

ATGACGCGATTCATGCTCAGCAACAGGTATTTTAGGCCAAATCACCAGAAAGGGTTTTTA
CCCGGAATTTCTGGATGCCTAGAACACAACACTTTGCTGTCGGAGAGTTTAAAAGATGCT
AGGAAAAGCGAGAGGCAAATTACAGTTTGTTGGATAGACTTAGAGAATGCGTTCGGGTCG
ATACAACACGAATTGATGCTATTCGCGCTGAGATGGTACAACTTTCCGCCCCTAGTTACC
GATATGATCGCGTCGTACTACTCAAAATTAAAGTTTTCTATAACAACTAAAGAAGGCCAT
TCAAAAACTTTTAGTTACAATGTAGGACTATTTCAAGGTTGCTGTTTGTCTCCAATTGTA
TTTAATATTGTAATTAACATCTTAGTAGATAAATTAACCAGCAACGAGAAAAAATGGGGG
TATCGGTTCAAGTTTAATAATAAATACACGGAATCCATTTTAGCCTTCGCTGACGATCTC
GCAATACTGACACGTAACCCCAAACACTGTCAAGTACTATTAGATGAAGTGGATAGATTC
TGTGAGTGGACCGATGGATTGAGGACAAAACCAAGTAAATGTCACTGTCTGTGTCTCGGT
AGGCGGAGTACAAGATACACCTCATACGATCCGGGATTATCGTTAGGCGGTCAATGCATT
TCTACGGTTACAGAAAATGCACCATTTAAATTTCTCGGTCGGAAGATTGATAATATAGGT
CGTACTCCATCTTTAGAAGGTATAGTAGATAGCTTTTTAAACGATCTCAACAAGGTAGAT
AGCCAACAGATCAGTAACGTTAAAAAAGCTTGGATCTACGATAATTACCTAACTTCACGT
TTAAATTGGCCTTTCCTCGTTTATGATTTTAATAAAACCCTTTTGTCAAAGTTAGATGCA
GGCGTCATAAAGATGTTGAAGTTGTGGCTCGGGCTCGCGCTAACGGCTGATTCATCGGCT
TTATTTAGGGATCGCAATAGTTTTGGGATGAGTCTAAAAAGGCCATCGGAGCTCTACAAA
CACCTGAGAGTTTCCAAGAGATACATCCTGGGGAAATCCCAGGATGACGTCGTTACATCG
CTCCCAAAAGACAAAGATGCCCCAGAGCTAGAGTCAAGGCTTCAATTCCACAAGCAGTTT
ATGATAGGAGCGCAAAGTAACAGAGTAGGGTTAGGATCAAGTAGGAAGGTCCAAGATACG
GATATATTGAAGTCTTTTATTCGACAAGACGAGAATGATAAATATAAGATCCATGCAATA
AGTTTAGAAATGCAGAACGAGTGGTTAGACATAGGAGATTTTTGCATCCCATTAGCACTA
AAATGGCGCACCTTAATCCATGATTGGTCGCCAGCATTGCTAAAATTCTATCTCAATGCA
TTCCAGATGACTCTCCCAGATCAGAGTAATTTAGTAAGATGGGGTAAAGGTACCGAAAAG
ACTTGCTATATCTGTGGGAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAG
GTACTCCTCGATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTAGAAATCATACGT
GAAGCGGTTAGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCA
GTAGGTTTTGTGAGAGAGGGCACTAGGGCTATAAAAACAAATGTTAAGCCTTACTCCATC
CTTAAAGCGGCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATC
CCCGAGGATATTTGTGCGTCGGCCTCCAGACCGGACATATTCATGTATTCGCGAATCTTA
AAGCGCGTTGTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCAT
ACCATCAAGGTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTC
GTGGATTTATACGCGGTAGAAGTGGGAGCGAGAGGTATAACGGCTAAATCTCTCTACAAC
CTACTAAAAGACTTAGGCCTGTCCAGAACTCACATCAATTCGTTCTTGGAACGTACTTCG
AAGGCAGCCCTAGTAGGTTCTTTTCAAATATGGTTAGGTAGGGAGAGGAGCTTGGACAGT
GGAGGTGAGCGTTTAACGCGCGTTAGTTAG

Protein sequence:

MTRFMLSNRYFRPNHQKGFLPGISGCLEHNTLLSESLKDARKSERQITVCWIDLENAFGS
IQHELMLFALRWYNFPPLVTDMIASYYSKLKFSITTKEGHSKTFSYNVGLFQGCCLSPIV
FNIVINILVDKLTSNEKKWGYRFKFNNKYTESILAFADDLAILTRNPKHCQVLLDEVDRF
CEWTDGLRTKPSKCHCLCLGRRSTRYTSYDPGLSLGGQCISTVTENAPFKFLGRKIDNIG
RTPSLEGIVDSFLNDLNKVDSQQISNVKKAWIYDNYLTSRLNWPFLVYDFNKTLLSKLDA
GVIKMLKLWLGLALTADSSALFRDRNSFGMSLKRPSELYKHLRVSKRYILGKSQDDVVTS
LPKDKDAPELESRLQFHKQFMIGAQSNRVGLGSSRKVQDTDILKSFIRQDENDKYKIHAI
SLEMQNEWLDIGDFCIPLALKWRTLIHDWSPALLKFYLNAFQMTLPDQSNLVRWGKGTEK
TCYICGKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIREAVSLSVARAQKGITTNERS
VGFVREGTRAIKTNVKPYSILKAATDWTIMMDTCEKQYKIPEDICASASRPDIFMYSRIL
KRVVMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVVDLYAVEVGARGITAKSLYN
LLKDLGLSRTHINSFLERTSKAALVGSFQIWLGRERSLDSGGERLTRVS