DPGLEAN04678 in OGS1.0

Genomic Positionscaffold1623:- 15674-22870
See gene structure
CDS Length2172
Paired RNAseq reads  962
Single RNAseq reads  3534
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013472 (2e-07)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase [Bombyx mori] (6e-110)
Best NR hit (blastx)  endonuclease-reverse transcriptase [Bombyx mori] (7e-108)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003723 RNA binding
GO:0003964 RNA-directed DNA polymerase activity
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10014

Nucleotide sequence:

ATGGAGGACTTCGAGGTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAA
CGGTATGTAGATGACACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAAC
CATCTCAATTCTATCAATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCT
TTAGCTTTCCTTGATATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTT
TATAGGAAACCCACACATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATC
CAGTTAGCTACCGTTGGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGAC
CACCTAGAGGCCGAGCTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTG
CCTCGCCAGCATCGCAAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATA
CTACCATATGTGAAGGGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATT
AAAACTATTTACAAACCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAAC
ATTCCTTTACAACAAGCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATT
GGACAGACGAAGAGGAGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAAC
AGGCGCGCGTCGAAGTCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATT
CGTTTTGATAAACCTCAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGC
GAGGCTATTGAAATTAAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTATCA
AACACCTGGGACCCCGTTCTTAAAAATATAAAATCCCATGTCCGAAACCACACCGCAGGA
CCTCAAGACACCGTGAGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGA
AATCGATGGCGGCTATCGAGGGCCATTGAACCTCTTCTTCGCAGAGAACAGGCTGGATTC
CGACCCAACAGATCGTGCACCGACCAGATTATTACCTTACGCATAATCCTCGAACAAGCA
TCAGAATGGCAGAGGGAAATGTATTTGACCTTTGTGGATTTCGAAAAAGCTTTCGACACG
CTGAGATGGACAGGTATCTGGGAACGTTTACGTGAAGTTGGAGTCCCCGACAAAATAATC
AACCTGATAAAAGCCCTCTACAGGAAATATTCCTGTAAAGTAATTCACAACGGTCTTTTG
TCGGAGGACATACCAGTCAATGCAGGTGTTCGCCAGGGGTGTCTCCTTTCTCCTATTCTC
TTCCTTGTCGTTCTGGATGGTATCATGCAGAAAGTGACGAAAAGCAAGCGCCGCGGCATA
GAATGGGGACTGTCCAGCACTTTGGAAGATTTGGACTATGCCGATGACCTATGCCTGCTG
AGCCATACACACGCCAACATGCAAACCAAACTGGACGACCTACGACGAGAAGCATTAGAG
ATGGGGCTAAAAATAAACACGCGAAAGACCCAGGAGATGAGGTGCGGAGCAACAACCTCT
CTGCCGTTGCTCATTGGCACAGAGGCTATAGAAAAAATCCACAAATACACCTACCTAGGA
AGCATAGTATCGGAGAGTGGAGGTGCCGAAGAGGACATCGCTTCGAGAATCGCCAAATCA
AGAGCAGCCTTCGCGCAACTCCGTCCTGTATGGCAGTCGCGGAAACTAACCAGGAGAGTT
AAACTCAAAATATTCCGGTCCAACGTCAAATCCGTGCTGTTATACGGATGTGAGACGTGG
AAGGTTACTAAGGACATCTCGCATCGGCTTCAGGTCTTCGTCAATTGGTGTCTTCGCCGT
ATTCTCGGTATTTACTGGCCCGAGAAAATTTCCAACGTTAATCTCTGGGAACGCTGCGGT
GAGACACCGATTGACCTGCAAATCAAACGTCGCAAGTGGAAGTGGATCGGCCACGCGCTC
CGAAGGGATCCGGAACACATACCGAAACAAGCCCTAGACTGGACCCCTGAAGGAAAGCGG
AAACGACGAGATGGCGGCGTACTGTGGCTTCATCTAGGGGACATAGGACCAAGTAGAGAA
GAAGAGAATTAG

Protein sequence:

MEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSINSKIQCTIELEANNS
LAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATVGKSLLQRAQHLCDAD
HLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVKGVTDRIGNILKKVSI
KTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKRSIGTRVKEHISDIKN
RRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEIKKHPNFNREDGWNLS
NTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRLSRAIEPLLRREQAGF
RPNRSCTDQIITLRIILEQASEWQREMYLTFVDFEKAFDTLRWTGIWERLREVGVPDKII
NLIKALYRKYSCKVIHNGLLSEDIPVNAGVRQGCLLSPILFLVVLDGIMQKVTKSKRRGI
EWGLSSTLEDLDYADDLCLLSHTHANMQTKLDDLRREALEMGLKINTRKTQEMRCGATTS
LPLLIGTEAIEKIHKYTYLGSIVSESGGAEEDIASRIAKSRAAFAQLRPVWQSRKLTRRV
KLKIFRSNVKSVLLYGCETWKVTKDISHRLQVFVNWCLRRILGIYWPEKISNVNLWERCG
ETPIDLQIKRRKWKWIGHALRRDPEHIPKQALDWTPEGKRKRRDGGVLWLHLGDIGPSRE
EEN