Genomic Position | scaffold4553:+ 798-2927 |
---|---|
See gene structure | |
CDS Length | 2130 |
Paired RNAseq reads   | 475 |
Single RNAseq reads   | 1253 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | ND |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | PREDICTED: similar to polyprotein [Strongylocentrotus purpuratus] (6e-82) |
Best NR hit (blastx)   | PREDICTED: similar to polyprotein [Strongylocentrotus purpuratus] (6e-81) |
GeneOntology terms    | GO:0003964 RNA-directed DNA polymerase activity GO:0006278 RNA-dependent DNA replication GO:0005622 intracellular GO:0008270 zinc ion binding GO:0003723 RNA binding |
InterPro families   | IPR000477 Reverse transcriptase |
Orthology group | MCL10002 |
Nucleotide sequence:
ATGACGCGATTCATGCTCAGCAACAGGTATTTTAGGCCAAATCACCAGAAAGGGTTTTTA
CCCGGAATTTCTGGATGCCTAGAACACAACACTTTGCTGTCGGAGAGTTTAAAAGATGCT
AGGAAAAGCGAGAGGCAAATTACAGTTTGTTGGATAGACTTAGAGAATGCGTTCGGGTCG
ATACAACACGAATTGATGCTATTCGCGCTGAGATGGTACAACTTTCCGCCCCTAGTTACC
GATATGATCGCGTCGTACTACTCAAAATTAAAGTTTTCTATAACAACTAAAGAAGGCCAT
TCAAAAACTTTTAGTTACAATGTAGGACTATTTCAAGGTTGCTGTTTGTCTCCAATTGTA
TTTAATATTGTAATTAACATCTTAGTAGATAAATTAACCAGCAACGAGAAAAAATGGGGG
TATCGGTTCAAGTTTAATAATAAATACACGGAATCCATTTTAGCCTTCGCTGACGATCTC
GCAATACTGACACGTAACCCCAAACACTGTCAAGTACTATTAGATGAAGTGGATAGATTC
TGTGAGTGGACCGATGGATTGAGGACAAAACCAAGTAAATGTCACTGTCTGTGTCTCGGT
AGGCGGAGTACAAGATACACCTCATACGATCCGGGATTATCGTTAGGCGGTCAATGCATT
TCTACGGTTACAGAAAATGCACCATTTAAATTTCTCGGTCGGAAGATTGATAATATAGGT
CGTACTCCATCTTTAGAAGGTATAGTAGATAGCTTTTTAAACGATCTCAACAAGGTAGAT
AGCCAACAGATCAGTAACGTTAAAAAAGCTTGGATCTACGATAATTACCTAACTTCACGT
TTAAATTGGCCTTTCCTCGTTTATGATTTTAATAAAACCCTTTTGTCAAAGTTAGATGCA
GGCGTCATAAAGATGTTGAAGTTGTGGCTCGGGCTCGCGCTAACGGCTGATTCATCGGCT
TTATTTAGGGATCGCAATAGTTTTGGGATGAGTCTAAAAAGGCCATCGGAGCTCTACAAA
CACCTGAGAGTTTCCAAGAGATACATCCTGGGGAAATCCCAGGATGACGTCGTTACATCG
CTCCCAAAAGACAAAGATGCCCCAGAGCTAGAGTCAAGGCTTCAATTCCACAAGCAGTTT
ATGATAGGAGCGCAAAGTAACAGAGTAGGGTTAGGATCAAGTAGGAAGGTCCAAGATACG
GATATATTGAAGTCTTTTATTCGACAAGACGAGAATGATAAATATAAGATCCATGCAATA
AGTTTAGAAATGCAGAACGAGTGGTTAGACATAGGAGATTTTTGCATCCCATTAGCACTA
AAATGGCGCACCTTAATCCATGATTGGTCGCCAGCATTGCTAAAATTCTATCTCAATGCA
TTCCAGATGACTCTCCCAGATCAGAGTAATTTAGTAAGATGGGGTAAAGGTACCGAAAAG
ACTTGCTATATCTGTGGGAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAG
GTACTCCTCGATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTAGAAATCATACGT
GAAGCGGTTAGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCA
GTAGGTTTTGTGAGAGAGGGCACTAGGGCTATAAAAACAAATGTTAAGCCTTACTCCATC
CTTAAAGCGGCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATC
CCCGAGGATATTTGTGCGTCGGCCTCCAGACCGGACATATTCATGTATTCGCGAATCTTA
AAGCGCGTTGTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCAT
ACCATCAAGGTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTC
GTGGATTTATACGCGGTAGAAGTGGGAGCGAGAGGTATAACGGCTAAATCTCTCTACAAC
CTACTAAAAGACTTAGGCCTGTCCAGAACTCACATCAATTCGTTCTTGGAACGTACTTCG
AAGGCAGCCCTAGTAGGTTCTTTTCAAATATGGTTAGGTAGGGAGAGGAGCTTGGACAGT
GGAGGTGAGCGTTTAACGCGCGTTAGTTAG
Protein sequence:
MTRFMLSNRYFRPNHQKGFLPGISGCLEHNTLLSESLKDARKSERQITVCWIDLENAFGS
IQHELMLFALRWYNFPPLVTDMIASYYSKLKFSITTKEGHSKTFSYNVGLFQGCCLSPIV
FNIVINILVDKLTSNEKKWGYRFKFNNKYTESILAFADDLAILTRNPKHCQVLLDEVDRF
CEWTDGLRTKPSKCHCLCLGRRSTRYTSYDPGLSLGGQCISTVTENAPFKFLGRKIDNIG
RTPSLEGIVDSFLNDLNKVDSQQISNVKKAWIYDNYLTSRLNWPFLVYDFNKTLLSKLDA
GVIKMLKLWLGLALTADSSALFRDRNSFGMSLKRPSELYKHLRVSKRYILGKSQDDVVTS
LPKDKDAPELESRLQFHKQFMIGAQSNRVGLGSSRKVQDTDILKSFIRQDENDKYKIHAI
SLEMQNEWLDIGDFCIPLALKWRTLIHDWSPALLKFYLNAFQMTLPDQSNLVRWGKGTEK
TCYICGKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIREAVSLSVARAQKGITTNERS
VGFVREGTRAIKTNVKPYSILKAATDWTIMMDTCEKQYKIPEDICASASRPDIFMYSRIL
KRVVMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVVDLYAVEVGARGITAKSLYN
LLKDLGLSRTHINSFLERTSKAALVGSFQIWLGRERSLDSGGERLTRVS