DPGLEAN17435 in OGS1.0

Genomic Positionscaffold33:- 245896-257062
See gene structure
CDS Length2133
Paired RNAseq reads  407
Single RNAseq reads  1973
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002745 (6e-09)
Best Drosophila hit  CG42342, isoform D (1e-07)
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (3e-163)
Best NR hit (blastx)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (4e-158)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families
  
IPR008160 Collagen triple helix repeat
IPR000477 Reverse transcriptase
Orthology groupMCL10012

Nucleotide sequence:

ATGCCACATCAGTGGCGTTATAGTTACATTACCCCTATATACAAAGGCAGGGGCAGTGTT
CAAGATTGTGGTAGTTATAGGGGCGTTAAGATCATGAGTCACACCATGAAGCTCTTTGAG
CGTATGATCGACCTCAGGCTCCGCCGAGAGTGTACTGTCTCGGAATGTCAATATGGATTT
CAGCCAGGATCGGGCACCTTGGACGCCATCTTTGCCATCAGAACTCTGATGGAGGCATAC
AGGGAAAAAAGGAGAGCTCTGCATGTCGCATTCCTAGATCTGCAGAAGGCCTTTGACTGC
GTGCCTCGTCAATGTATCTGGTGGGCATTGCGATTCAAAGGGATCCCTGAGGCCTATATT
GACATCATCAGAGACATGTACCGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGAT
ACAAAACCCTTTCCGATCTCAGTAGGGCTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTG
TTCAATGTAGTGCTGGACACTGTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATG
ATGTATGCCGATGACATAGCGCTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTG
AACCTCTGGAAGGGTACGCTTGAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAG
TACATGGCTTGCGGAAGCCCGGACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTT
AAGTCGGAAAAGTTCAGGTACCTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCAC
GATGTCCAAGCCCAGATCAGCGCTACTTGGGAGAAATGGCGTGATGTCACAGGTGTGGTC
TGCGACCGCAGAATACCGCCCAAGCTCAAGGGACTAATATATAAGAGCATAATCCGACCG
GGAGACAAGGGAGACAAGGGTGAACGTGGTTTCACGACGACACTGAAAGGCGATGCGTTC
CCAACTGGCATCATCGAGGGTCCACCAGGTCCCCCCGGGCCTCCCGGGGCGGAAGGTGCG
CGCGGCGAGCGCGGAGCGGGGGGTGCTCCCGGCCCCCCCGGGGAGCGCGGCGCGAGAGGC
AAGCGGGGCAAGCGGGTAACACCACCCACTTCTGAATACGACCGCTATTGTACACTCTGT
ACTGACGGACACTGTGCGGTAGGCAAGGAAGGTGCGTCAGGACCTCGCGGACCGCCTGGT
TCGGACGGCCGACCCGGGGTCGCCGGGGTTCCAGGCCCGCCGGGAAAACCGGGAGAAATT
GGACCAAAGGGTGAAAAGGGCGACTACGGTGACATGGGGTCCCCGGGCATGCTCGGAGCT
CCGGGACTTCCTGGACCCCCGGGATACCCAGGCCTTAAGGGGGAGAAAGGAGACAAGGGG
GACTCGCAGAAGTACCGGAAGCTGAGACGCAGGCAGGGAGACGGGACCGGGTACGAGCTT
TATGGACACGAACTGATGATGGGCCCCCCGGGCTCGCCGGGCCCCGCGGGTCCCCCGGGC
GTGGCGGGCCCGCCCGGTATCAAGGGCGACAAGGGCGAGCCCGGAACACGCGGCAAGACT
GGTGAGCGCGGAGAGAAAGGTGACCCAGGACCCATGGGACTCCCGGGCCCAGTAGGTCTC
CCGGGGGAGGCGGGCGAGCCGGGCCGGCCGGGCGATACGGGGCCGAGGGGACCGCCCGGG
CTCGACGGGATGAAGGGAGCGCAGGGCGAGCCGGGCAGCAAGGGTGAGCGAGGAGATCCT
GGACTACCCGGAACAGATGGAATTCCAGGACAAGAGGGTCCGAAGGGTGACAAAGGCTAT
AAAGGAGAACCCGGACCAGGCGGAAAACGCGGCCGTAAGGGTGACAAAGGTGACCGTGGG
GAGCAAGGAGTTCCGGGACTGGACGCACCCTGCCCGCTAGGACCAGACGGACTGCCACTG
CCGGGATGCGGCTGGCGACCCTCGAAGGAAGTGGCGCGGGAGGAGCGGCTGGGAGGAGGA
GGTGACGGGACGCGCTCGGAGGACGACGCGGAGGAAGAAGATGCGGAGCCAGAAGACGAG
GGCGGTGACTATGAAGGGAGAGACGACCTCGAGCCGCCGAGAGACTACGACGACTACACA
GACAACGCGCATCACGACTCGCACCGGGACTGA

Protein sequence:

MPHQWRYSYITPIYKGRGSVQDCGSYRGVKIMSHTMKLFERMIDLRLRRECTVSECQYGF
QPGSGTLDAIFAIRTLMEAYREKRRALHVAFLDLQKAFDCVPRQCIWWALRFKGIPEAYI
DIIRDMYRDSVSMVRTAVGDTKPFPISVGLHQGSALSPFLFNVVLDTVSANIQDQPPWLM
MYADDIALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSPDSCTIHIGPEPAV
KSEKFRYLGSILHESGGIDHDVQAQISATWEKWRDVTGVVCDRRIPPKLKGLIYKSIIRP
GDKGDKGERGFTTTLKGDAFPTGIIEGPPGPPGPPGAEGARGERGAGGAPGPPGERGARG
KRGKRVTPPTSEYDRYCTLCTDGHCAVGKEGASGPRGPPGSDGRPGVAGVPGPPGKPGEI
GPKGEKGDYGDMGSPGMLGAPGLPGPPGYPGLKGEKGDKGDSQKYRKLRRRQGDGTGYEL
YGHELMMGPPGSPGPAGPPGVAGPPGIKGDKGEPGTRGKTGERGEKGDPGPMGLPGPVGL
PGEAGEPGRPGDTGPRGPPGLDGMKGAQGEPGSKGERGDPGLPGTDGIPGQEGPKGDKGY
KGEPGPGGKRGRKGDKGDRGEQGVPGLDAPCPLGPDGLPLPGCGWRPSKEVAREERLGGG
GDGTRSEDDAEEEDAEPEDEGGDYEGRDDLEPPRDYDDYTDNAHHDSHRD