Genomic Position | scaffold33:- 245896-257062 |
---|---|
See gene structure | |
CDS Length | 2133 |
Paired RNAseq reads   | 407 |
Single RNAseq reads   | 1973 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002745 (6e-09) |
Best Drosophila hit   | CG42342, isoform D (1e-07) |
Best Human hit | ND |
Best NR hit (blastp)   | endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (3e-163) |
Best NR hit (blastx)   | endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (4e-158) |
GeneOntology terms    | GO:0003964 RNA-directed DNA polymerase activity GO:0006278 RNA-dependent DNA replication GO:0005622 intracellular GO:0008270 zinc ion binding GO:0003723 RNA binding |
InterPro families    | IPR008160 Collagen triple helix repeat IPR000477 Reverse transcriptase |
Orthology group | MCL10012 |
Nucleotide sequence:
ATGCCACATCAGTGGCGTTATAGTTACATTACCCCTATATACAAAGGCAGGGGCAGTGTT
CAAGATTGTGGTAGTTATAGGGGCGTTAAGATCATGAGTCACACCATGAAGCTCTTTGAG
CGTATGATCGACCTCAGGCTCCGCCGAGAGTGTACTGTCTCGGAATGTCAATATGGATTT
CAGCCAGGATCGGGCACCTTGGACGCCATCTTTGCCATCAGAACTCTGATGGAGGCATAC
AGGGAAAAAAGGAGAGCTCTGCATGTCGCATTCCTAGATCTGCAGAAGGCCTTTGACTGC
GTGCCTCGTCAATGTATCTGGTGGGCATTGCGATTCAAAGGGATCCCTGAGGCCTATATT
GACATCATCAGAGACATGTACCGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGAT
ACAAAACCCTTTCCGATCTCAGTAGGGCTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTG
TTCAATGTAGTGCTGGACACTGTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATG
ATGTATGCCGATGACATAGCGCTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTG
AACCTCTGGAAGGGTACGCTTGAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAG
TACATGGCTTGCGGAAGCCCGGACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTT
AAGTCGGAAAAGTTCAGGTACCTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCAC
GATGTCCAAGCCCAGATCAGCGCTACTTGGGAGAAATGGCGTGATGTCACAGGTGTGGTC
TGCGACCGCAGAATACCGCCCAAGCTCAAGGGACTAATATATAAGAGCATAATCCGACCG
GGAGACAAGGGAGACAAGGGTGAACGTGGTTTCACGACGACACTGAAAGGCGATGCGTTC
CCAACTGGCATCATCGAGGGTCCACCAGGTCCCCCCGGGCCTCCCGGGGCGGAAGGTGCG
CGCGGCGAGCGCGGAGCGGGGGGTGCTCCCGGCCCCCCCGGGGAGCGCGGCGCGAGAGGC
AAGCGGGGCAAGCGGGTAACACCACCCACTTCTGAATACGACCGCTATTGTACACTCTGT
ACTGACGGACACTGTGCGGTAGGCAAGGAAGGTGCGTCAGGACCTCGCGGACCGCCTGGT
TCGGACGGCCGACCCGGGGTCGCCGGGGTTCCAGGCCCGCCGGGAAAACCGGGAGAAATT
GGACCAAAGGGTGAAAAGGGCGACTACGGTGACATGGGGTCCCCGGGCATGCTCGGAGCT
CCGGGACTTCCTGGACCCCCGGGATACCCAGGCCTTAAGGGGGAGAAAGGAGACAAGGGG
GACTCGCAGAAGTACCGGAAGCTGAGACGCAGGCAGGGAGACGGGACCGGGTACGAGCTT
TATGGACACGAACTGATGATGGGCCCCCCGGGCTCGCCGGGCCCCGCGGGTCCCCCGGGC
GTGGCGGGCCCGCCCGGTATCAAGGGCGACAAGGGCGAGCCCGGAACACGCGGCAAGACT
GGTGAGCGCGGAGAGAAAGGTGACCCAGGACCCATGGGACTCCCGGGCCCAGTAGGTCTC
CCGGGGGAGGCGGGCGAGCCGGGCCGGCCGGGCGATACGGGGCCGAGGGGACCGCCCGGG
CTCGACGGGATGAAGGGAGCGCAGGGCGAGCCGGGCAGCAAGGGTGAGCGAGGAGATCCT
GGACTACCCGGAACAGATGGAATTCCAGGACAAGAGGGTCCGAAGGGTGACAAAGGCTAT
AAAGGAGAACCCGGACCAGGCGGAAAACGCGGCCGTAAGGGTGACAAAGGTGACCGTGGG
GAGCAAGGAGTTCCGGGACTGGACGCACCCTGCCCGCTAGGACCAGACGGACTGCCACTG
CCGGGATGCGGCTGGCGACCCTCGAAGGAAGTGGCGCGGGAGGAGCGGCTGGGAGGAGGA
GGTGACGGGACGCGCTCGGAGGACGACGCGGAGGAAGAAGATGCGGAGCCAGAAGACGAG
GGCGGTGACTATGAAGGGAGAGACGACCTCGAGCCGCCGAGAGACTACGACGACTACACA
GACAACGCGCATCACGACTCGCACCGGGACTGA
Protein sequence:
MPHQWRYSYITPIYKGRGSVQDCGSYRGVKIMSHTMKLFERMIDLRLRRECTVSECQYGF
QPGSGTLDAIFAIRTLMEAYREKRRALHVAFLDLQKAFDCVPRQCIWWALRFKGIPEAYI
DIIRDMYRDSVSMVRTAVGDTKPFPISVGLHQGSALSPFLFNVVLDTVSANIQDQPPWLM
MYADDIALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSPDSCTIHIGPEPAV
KSEKFRYLGSILHESGGIDHDVQAQISATWEKWRDVTGVVCDRRIPPKLKGLIYKSIIRP
GDKGDKGERGFTTTLKGDAFPTGIIEGPPGPPGPPGAEGARGERGAGGAPGPPGERGARG
KRGKRVTPPTSEYDRYCTLCTDGHCAVGKEGASGPRGPPGSDGRPGVAGVPGPPGKPGEI
GPKGEKGDYGDMGSPGMLGAPGLPGPPGYPGLKGEKGDKGDSQKYRKLRRRQGDGTGYEL
YGHELMMGPPGSPGPAGPPGVAGPPGIKGDKGEPGTRGKTGERGEKGDPGPMGLPGPVGL
PGEAGEPGRPGDTGPRGPPGLDGMKGAQGEPGSKGERGDPGLPGTDGIPGQEGPKGDKGY
KGEPGPGGKRGRKGDKGDRGEQGVPGLDAPCPLGPDGLPLPGCGWRPSKEVAREERLGGG
GDGTRSEDDAEEEDAEPEDEGGDYEGRDDLEPPRDYDDYTDNAHHDSHRD