Genomic Position | scaffold1623:- 15674-22870 |
---|---|
See gene structure | |
CDS Length | 2172 |
Paired RNAseq reads   | 962 |
Single RNAseq reads   | 3534 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013472 (2e-07) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | endonuclease-reverse transcriptase [Bombyx mori] (6e-110) |
Best NR hit (blastx)   | endonuclease-reverse transcriptase [Bombyx mori] (7e-108) |
GeneOntology terms    | GO:0006278 RNA-dependent DNA replication GO:0003723 RNA binding GO:0003964 RNA-directed DNA polymerase activity |
InterPro families   | IPR000477 Reverse transcriptase |
Orthology group | MCL10014 |
Nucleotide sequence:
ATGGAGGACTTCGAGGTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAA
CGGTATGTAGATGACACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAAC
CATCTCAATTCTATCAATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCT
TTAGCTTTCCTTGATATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTT
TATAGGAAACCCACACATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATC
CAGTTAGCTACCGTTGGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGAC
CACCTAGAGGCCGAGCTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTG
CCTCGCCAGCATCGCAAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATA
CTACCATATGTGAAGGGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATT
AAAACTATTTACAAACCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAAC
ATTCCTTTACAACAAGCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATT
GGACAGACGAAGAGGAGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAAC
AGGCGCGCGTCGAAGTCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATT
CGTTTTGATAAACCTCAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGC
GAGGCTATTGAAATTAAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTATCA
AACACCTGGGACCCCGTTCTTAAAAATATAAAATCCCATGTCCGAAACCACACCGCAGGA
CCTCAAGACACCGTGAGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGA
AATCGATGGCGGCTATCGAGGGCCATTGAACCTCTTCTTCGCAGAGAACAGGCTGGATTC
CGACCCAACAGATCGTGCACCGACCAGATTATTACCTTACGCATAATCCTCGAACAAGCA
TCAGAATGGCAGAGGGAAATGTATTTGACCTTTGTGGATTTCGAAAAAGCTTTCGACACG
CTGAGATGGACAGGTATCTGGGAACGTTTACGTGAAGTTGGAGTCCCCGACAAAATAATC
AACCTGATAAAAGCCCTCTACAGGAAATATTCCTGTAAAGTAATTCACAACGGTCTTTTG
TCGGAGGACATACCAGTCAATGCAGGTGTTCGCCAGGGGTGTCTCCTTTCTCCTATTCTC
TTCCTTGTCGTTCTGGATGGTATCATGCAGAAAGTGACGAAAAGCAAGCGCCGCGGCATA
GAATGGGGACTGTCCAGCACTTTGGAAGATTTGGACTATGCCGATGACCTATGCCTGCTG
AGCCATACACACGCCAACATGCAAACCAAACTGGACGACCTACGACGAGAAGCATTAGAG
ATGGGGCTAAAAATAAACACGCGAAAGACCCAGGAGATGAGGTGCGGAGCAACAACCTCT
CTGCCGTTGCTCATTGGCACAGAGGCTATAGAAAAAATCCACAAATACACCTACCTAGGA
AGCATAGTATCGGAGAGTGGAGGTGCCGAAGAGGACATCGCTTCGAGAATCGCCAAATCA
AGAGCAGCCTTCGCGCAACTCCGTCCTGTATGGCAGTCGCGGAAACTAACCAGGAGAGTT
AAACTCAAAATATTCCGGTCCAACGTCAAATCCGTGCTGTTATACGGATGTGAGACGTGG
AAGGTTACTAAGGACATCTCGCATCGGCTTCAGGTCTTCGTCAATTGGTGTCTTCGCCGT
ATTCTCGGTATTTACTGGCCCGAGAAAATTTCCAACGTTAATCTCTGGGAACGCTGCGGT
GAGACACCGATTGACCTGCAAATCAAACGTCGCAAGTGGAAGTGGATCGGCCACGCGCTC
CGAAGGGATCCGGAACACATACCGAAACAAGCCCTAGACTGGACCCCTGAAGGAAAGCGG
AAACGACGAGATGGCGGCGTACTGTGGCTTCATCTAGGGGACATAGGACCAAGTAGAGAA
GAAGAGAATTAG
Protein sequence:
MEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSINSKIQCTIELEANNS
LAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATVGKSLLQRAQHLCDAD
HLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVKGVTDRIGNILKKVSI
KTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKRSIGTRVKEHISDIKN
RRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEIKKHPNFNREDGWNLS
NTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRLSRAIEPLLRREQAGF
RPNRSCTDQIITLRIILEQASEWQREMYLTFVDFEKAFDTLRWTGIWERLREVGVPDKII
NLIKALYRKYSCKVIHNGLLSEDIPVNAGVRQGCLLSPILFLVVLDGIMQKVTKSKRRGI
EWGLSSTLEDLDYADDLCLLSHTHANMQTKLDDLRREALEMGLKINTRKTQEMRCGATTS
LPLLIGTEAIEKIHKYTYLGSIVSESGGAEEDIASRIAKSRAAFAQLRPVWQSRKLTRRV
KLKIFRSNVKSVLLYGCETWKVTKDISHRLQVFVNWCLRRILGIYWPEKISNVNLWERCG
ETPIDLQIKRRKWKWIGHALRRDPEHIPKQALDWTPEGKRKRRDGGVLWLHLGDIGPSRE
EEN