Genomic Position | scaffold40:+ 22089-63374 |
---|---|
See gene structure | |
CDS Length | 1353 |
Paired RNAseq reads   | 2 |
Single RNAseq reads   | 137 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003794 (4e-23) |
Best Drosophila hit   | CG42313 (1e-08) |
Best Human hit | ND |
Best NR hit (blastp)   | endonuclease-reverse transcriptase [Bombyx mori] (2e-126) |
Best NR hit (blastx)   | endonuclease-reverse transcriptase [Bombyx mori] (3e-122) |
GeneOntology terms    | GO:0003964 RNA-directed DNA polymerase activity GO:0006278 RNA-dependent DNA replication GO:0005622 intracellular GO:0008270 zinc ion binding GO:0003723 RNA binding |
InterPro families   | IPR000477 Reverse transcriptase |
Orthology group | MCL10020 |
Nucleotide sequence:
ATGAAAGGTGTTGATGATGGCGGCGATGCTGGTGTGCGCACGGTGGACGTTCAGGCCGTG
GAGGGGATGGTAGCGCAATTCCCATGCGACCTCGGCACTGCGGCTAATGACAAAGTATAC
ATGGTGTTCTGGTTCAGGGATGACGCTGGCATTCCATTATACAGGTCTGGCTGTTTAAAC
GCAGATGGAATACCCCTTAAAATCAGTCTACCTCCTAGTCACGCTGATGCTGACTGTGAC
AAGAACTTATCGACATGTGGATTAATTTTGCTTGTAGTTTTACCAAGGGACGACTGTATG
GAGTTCCTCAATAGAGGAGTGAATAAGTATTTGCAGGGTTGGCAACGCGCAATAATATCC
CTGGTATTGCAGGCCTCTATAGGCACCGGTGACAACTTAACATCCGGCGGACTGGGCGGT
ACAAATGTTTACAACCGTCGTGAACAAGTCGAGTCTCGTAAAGGCTTCAGCACCGTAGAC
CACATCATCATCGTTCGGCAGATTGTACAGAAAACCGACGAGTACAATCAGCCGCTGTGT
CTGGCTTTTGTGGACTACGAAAAAGCCTTCGACTCCATCGAAACTTGGGCGGGTCTGGAC
GCTCTGCAACGATGTGGTATAGATTGGCAATACATCGAGGTGCTGAAAAGCCAATATGAA
ACCGCCCTCATGACCGTCCAGCTCCAGGACCATAAGACCAATCCCATCGAGCTGCATCGA
GGTGTGAGACAGGGGGATGTTATATTCCCGAAGCTGTTCACCAACGCACTCGATGACGTC
TTTAAGACTCTGGACTGGGCTGGAAGGGGTATAACGGTGAACGGTGAGCACATCTCGCAC
CTTCGGTTTGCCGACGACATCGTTATAGAAGCAGAGTCGCTGGAGCAGTTAAGCGGGATG
CTGCATAGCCTTAATGAAGCCTCCGGTGGTCTTGGCATGAACCTGGATAAAACCAAAGCC
ATGTTCAATGAACATGTTCTGCTAAGTCCGATATATGTCGAAGGATCGATGCTTGAAGTT
GTTCAGGAGTATATCTACCTAGGGCAAGTAATCAAGCTCGGTAGAAACAACTTCGAGCAA
GAAGTCGACCGCAGGGTTCAGTTGGGTTCGGCAACATTTAGCAAACTCCGTCGAGTTTTC
TCTTCGCCTATATCGCAATGCCTGAAGACAAAAGTGTACGACCAGTGCGTCCTACCTGTT
ATGACTTACGGTGCTGAAACGTGGACATTGACGGTTGGACTGGTCCATAGATTTAAATTC
GCACAGCGGGTTATGGACCGGGCTATGCTCGGAGTTTCTTTGAAGGATAAAGTTCGCAAT
GAGGTCATCCGACAAAGAACCAAGGTAACATAG
Protein sequence:
MKGVDDGGDAGVRTVDVQAVEGMVAQFPCDLGTAANDKVYMVFWFRDDAGIPLYRSGCLN
ADGIPLKISLPPSHADADCDKNLSTCGLILLVVLPRDDCMEFLNRGVNKYLQGWQRAIIS
LVLQASIGTGDNLTSGGLGGTNVYNRREQVESRKGFSTVDHIIIVRQIVQKTDEYNQPLC
LAFVDYEKAFDSIETWAGLDALQRCGIDWQYIEVLKSQYETALMTVQLQDHKTNPIELHR
GVRQGDVIFPKLFTNALDDVFKTLDWAGRGITVNGEHISHLRFADDIVIEAESLEQLSGM
LHSLNEASGGLGMNLDKTKAMFNEHVLLSPIYVEGSMLEVVQEYIYLGQVIKLGRNNFEQ
EVDRRVQLGSATFSKLRRVFSSPISQCLKTKVYDQCVLPVMTYGAETWTLTVGLVHRFKF
AQRVMDRAMLGVSLKDKVRNEVIRQRTKVT