DPGLEAN21669 in OGS1.0

Genomic Positionscaffold1223:- 17326-19834
See gene structure
CDS Length1473
Paired RNAseq reads  672
Single RNAseq reads  1665
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007995 (8e-08)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (5e-159)
Best NR hit (blastx)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (1e-147)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003723 RNA binding
GO:0003964 RNA-directed DNA polymerase activity
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10012

Nucleotide sequence:

ATGTACCGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGATACAAAACCCTTTCCG
ATCTCAGTAGGGGTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTGTTCAATGTAGTGCTG
GACACTGTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATGATGTATGCCGATGAC
ATAGCGCTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTGAACCTCTGGAAGGGT
ACGCTTGAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAGTACATGGCTTGCGGA
AGCCCGGACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTTAAGTCGGAAAAGTTC
AGGTACCTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCACGATGTCCAAGCCCGG
ATCAGCGCTGCTTGGGCGAAATGGCGTGAGGTCACAGGTGTGGTCTGCGATCGCAGAATA
CCTACCAAGCTCAAGGGAATAATATACAAGAGCATAATCCGACCGGTTCTCTTATATGGA
AGCGAATGTTGGCCAACACTGTCCAGGCACACTCAGGAGCTTCACGTCACGGAGATGAAG
ATGCTGAGGTGGATGTGTGGCGTAACGCGGGCTGACCGTATACGTAACACATTTATCCGA
GGTAGTCTTGGAGTCCGTGACGTAGCGGATAAGCTTCAAGAGAGTCGCCTGAGATGGTAT
GGCCACGTTGCACGCCGGCCTGAGAATTACGTCGGAAAAATTTGCCTTGATATGTCGGTC
CCTGGAGCAAGACCCCCAGGACGCCCAAGAAAGCGATGGCTGGACACCGTGAAGCAGGAT
ATGAGAGCCAATGGACTTACCACCGCAGATGCTAAAGACCGTGCAAAGTGGAGGAGTTTG
AGCAGGAAGGGTTGGGGTTGGATAATATTGGAGAACTGTCAACCCAGAGAGTGTCCTACT
AAGTATTTTCCGATATTTGGTCTTGGGCATGCGGGTACGCGACCCCGGATCCTCGAACCA
CCACTTTCATACCTTTATCTCCAGTCCATGAAGGTGATGTCCTGTACTTCCCTCCCATTC
TCCTTCCTAATCTCTTTCCTCCCTTCTCTATCTTTCCCTTCCCTGGTTTTACCTATAAAT
TGGGGTTTTGGAGGACAAACGGCAGTCGTCCCGTTAAAACCAATTCTCGCCACTCATATG
ACTGACAAAGCTGAGCGATGTCTGGACACCGTGAAGCAGGATATGAGAGCCAATAGACTT
ACCACCGAAGATGCAAAAGACCGTTCAAAGTGGAGGAATTTGAGAACGAAGGCAGCCCCG
CCTAAAGCTGGGATAATTGCCAAGAAGAAGAATAGAAGAAGAATATTAGTGAACTGTGAG
AGTGAAAGTAGTAGCTGTGGTCAAGTAACTTCAGCTCGAGCATGGGCTGGTTTGACCGGG
GAAGTACCACTCTCTCACAGAAGATCGGCTTAA

Protein sequence:

MYRDSVSMVRTAVGDTKPFPISVGVHQGSALSPFLFNVVLDTVSANIQDQPPWLMMYADD
IALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSPDSCTIHIGPEPAVKSEKF
RYLGSILHESGGIDHDVQARISAAWAKWREVTGVVCDRRIPTKLKGIIYKSIIRPVLLYG
SECWPTLSRHTQELHVTEMKMLRWMCGVTRADRIRNTFIRGSLGVRDVADKLQESRLRWY
GHVARRPENYVGKICLDMSVPGARPPGRPRKRWLDTVKQDMRANGLTTADAKDRAKWRSL
SRKGWGWIILENCQPRECPTKYFPIFGLGHAGTRPRILEPPLSYLYLQSMKVMSCTSLPF
SFLISFLPSLSFPSLVLPINWGFGGQTAVVPLKPILATHMTDKAERCLDTVKQDMRANRL
TTEDAKDRSKWRNLRTKAAPPKAGIIAKKKNRRRILVNCESESSSCGQVTSARAWAGLTG
EVPLSHRRSA