DPGLEAN20101 in OGS1.0

Genomic Positionscaffold3139:- 2105-3820
See gene structure
CDS Length1716
Paired RNAseq reads  314
Single RNAseq reads  794
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002632 (4e-37)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase [Bombyx mori] (1e-171)
Best NR hit (blastx)  endonuclease-reverse transcriptase [Bombyx mori] (1e-166)
GeneOntology terms

  
GO:0003723 RNA binding
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10254

Nucleotide sequence:

ATGTACAAGGCGACGAGGATCTTGACCGGAAAACGGGCCTTGAAGAAGAAGCCCCTGAAG
GACAAAAGGGGTGAACTGATTGCTACATCGGAGGGACAGCTCCAACGGTGGCGTGAACAT
TTTGAGGAAATTTTCCAAGTCTCAAATAGCCCCATAACACCAACCACAGCTGCCAACCTG
CCCGCTGTAAGCCTCGATATCGACGAGTCTCCCCCCACTGAAGAGGAAGTAGTGAAAGCC
ATCCAGTCGCTTAAGAACGGTAAAGCTCCTGGATACGACCTCATCACGGCGGAAATGCTA
AAAGCAGATATACCTACCGCTGCGAACGCGCTAACACCTCTTCTGCGGAACATTTGGTCA
GCCGAGGAGTTACCGGATGACTGGACTAAAGGGCTTCTCATCACCGTGCCTAAGAAGGGA
GATCTCAGTGAATGCGCCAACTGGAGAGGCATTACCTTGCTTTCTACTCCGTCTAAAGTG
CTGTGCAAGATTATCCTAGACAGACTATCGAGGGCCATTGAACCTCTCCTTCGCAGAGAA
CAGGCTGGATTCCGACCCAACAGATCGTGCACCGACCAGATTATTACCTTACGCATAATC
CTCGAACAAGCATCAGAATGGCAGAGGGAAATGTATTTGACCTTTGTGGATTTCGAAAAA
GCTTTCGACACGCTGAGATGGACAGGTATCTGGGAACGTTTACGTGAAGTTGGAGTCCCC
GACAAAATAATCAACCTGATAAAAGCCCTCTACAGGAAATATTCCTGTAAAGTAATTCAC
AACGGTCTTTTGTCGGAGGACATACCAGTCAATGCAGGTGTTCGCCAGGGGTGTCTCCTT
TCTCCTATTCTCTTCCTTGTCGTTCTGGATGGTATCATGCAGAAAGTGACGAAAAGCAAG
CGCCGCGGCATAGAATGGGGACTGTCCAGCACTTTGGAAGATTTGGACTATGCCGATGAC
CTATGCCTGCTGAGCCATACACACGCCAACATGCAAACCAAACTGGACGACCTACGACGA
GAAGCATTAGAGATGGGGCTAAAAATAAACACGCGAAAGACCCAGGAGATGAGGTGCGGA
GCAACAACCTCTCTGCCGTTGCTCATTGGCACAGAGGCTATAGAAAAAATCCACAAATAC
ACCTACCTAGGAAGCATAGTATCGGAGAGTGGAGGTGCCGAAGAGGACATCGCTTCGAGA
ATCGCCAAATCAAGAGCAGCCTTCGCGCAACTCCGTCCTGTATGGCAGTCGCGGAAACTA
ACCAGGAGAGTTAAACTCAAAATATTCCGGTCCAACGTCAAATCCGTGCTGTTATACGGA
TGTGAGACGTGGAAGGTTACTAAGGACATCTCGCATCGGCTTCAGGTCTTCGTCAATTGG
TGTCTTCGCCGTATTCTCGGTATTTACTGGCCCGAGAAAATTTCCAACGTTAATCTCTGG
GAACGCTGCGGTGAGACACCGATTGACCTGCAAATCAAACGTCGCAAGTGGAAGTGGATC
GGCCACGCGCTCCGAAGGGATCCGGAACACATACCGAGACAAGCCTTAGACTGGACCCCT
GAAGGAAAGCGGAAACGTGGTCGCCCTAAGCAGACCTGGCGGCGGACGATCATTGCAGAG
GTCAAAAACATCGGCAAGACTTGGAGCGACATAAAAGGCGAAGCTCAAGATCGAACGAGA
TGGCGACGTACTGTGGATGCCCTCTGCCCCATCTAG

Protein sequence:

MYKATRILTGKRALKKKPLKDKRGELIATSEGQLQRWREHFEEIFQVSNSPITPTTAANL
PAVSLDIDESPPTEEEVVKAIQSLKNGKAPGYDLITAEMLKADIPTAANALTPLLRNIWS
AEELPDDWTKGLLITVPKKGDLSECANWRGITLLSTPSKVLCKIILDRLSRAIEPLLRRE
QAGFRPNRSCTDQIITLRIILEQASEWQREMYLTFVDFEKAFDTLRWTGIWERLREVGVP
DKIINLIKALYRKYSCKVIHNGLLSEDIPVNAGVRQGCLLSPILFLVVLDGIMQKVTKSK
RRGIEWGLSSTLEDLDYADDLCLLSHTHANMQTKLDDLRREALEMGLKINTRKTQEMRCG
ATTSLPLLIGTEAIEKIHKYTYLGSIVSESGGAEEDIASRIAKSRAAFAQLRPVWQSRKL
TRRVKLKIFRSNVKSVLLYGCETWKVTKDISHRLQVFVNWCLRRILGIYWPEKISNVNLW
ERCGETPIDLQIKRRKWKWIGHALRRDPEHIPRQALDWTPEGKRKRGRPKQTWRRTIIAE
VKNIGKTWSDIKGEAQDRTRWRRTVDALCPI