DPGLEAN18386 in OGS1.0

Genomic Positionscaffold7689:+ 1785-4742
See gene structure
CDS Length2958
Paired RNAseq reads  1994
Single RNAseq reads  4880
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014330 (7e-40)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (0.0)
Best NR hit (blastx)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (0.0)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families
  
IPR000477 Reverse transcriptase
IPR005135 Endonuclease/exonuclease/phosphatase
Orthology groupMCL10012

Nucleotide sequence:

ATGGTTGCAAAAGTTTATGGACTGGGTACGGCTAGGGCGTCGCCGATAAACGACGCGCAG
GCACATCGCCCTACACCTGCGATAAGTGGGCAAGGGCTACCGTGTCAAGTACGGGCACGG
CTAAAGACGTTAGTACCCCAAAGGAAACTCCGCTTTGCAACGTGGAATATTGGCTCTCTA
ACCGGACGATGCAGAGAACTCGCAGATGTACTCATGAGACGTCGGGTCCAGTGTGCGTTC
CTCCAGGAAACTCGCTGGAAGGGCAATAAGTCTCGGAACATCGGACAGGGTTACCGGCTG
ATATACACTGGGTCGCCTTCAGGTAAAGCTGGTGTAGCGGTAGTGCTATCTGAAGAGCTT
CGGAATGGTCTTCTTGAAGTTGATCGTCGCTGCGACCGCCTGATGCGGGTGCGGGTACTG
ATCGAGGGAGTGATTACTAACTTGATCAGTGCTTATACTCCTCAGGCTGGATGTAGCGGG
TCCGAAAAAGAGTCATTCTGGGAGCAATTTGAAGAGGTTCTACGTGCTATACCAGCTGCT
GAAGTAATCATAGTTGGGGGGGACTTAAATGGCCACGTAGGTAGGGCTGCGGAAACGTTT
GACCGTGTACACGGTGGTTTTGGTTATGGTCGCCGCAATGCAGAGGGAGAAAATATTCTC
AGAACCTGTATTGCGTCTGACTTAGCCGTCGTGAACACGTTCTTCCAAAAGACTCCACAG
CACCTTATCACGTATAAGAGCGGGTCCCACTCAACCCAAATAGATTATCTGCTGACCAGA
CGGTGTCATATCAGCAAGGTGACTAACTGTAAAGTCATTCCTGGTGAAAGCCTGACGGCC
CAACATCGACTTCTTGTCATGGACTATGTCGTTACCCCGAAAAAGAAAGTGGCCGAGAAA
CGTAAGCCTCGCATCAGGTGGTGGTTGCTGAATGGAACGATGCAGACCAGCTTTCGGGCA
GAGGTTGAGAGTCAAAATCTGTCGACTAATACCGAAACTGCTCAGGAAGTTTGGGATCGA
GCCCAGTCAGCAATTATCACAGCAGGTAAACGAGTTCTAGGCCTTTCTAAGGGAGGACGG
GTCATTGACAAGGAGACATGGTGGTGGAATGACGAAGTGCAGGAGGTGATTCGTGAAAAG
AAGACTGCCTTTAAGAAGTGGCAGCAATCAAACTCTCCTGAAGACAGACTAGAGTACATA
GCAGCGAAACGTGCCAGTAAAAGGGCTGTTGCCAGAGCCCGCAGTGATAGGTTATCACCA
TTATATGATACACTTGAAACTGCGGAGGGGCAGAAGCTCATTTACAAATTGGCACGAGCT
CGGGATAAGGCGACGCAAGATATCGCAAAATGTCTTAGCGTCAAAGATTCCCAAGGCACG
TTGCTGTGTAATCATGCCTCTGTGAAGGAGAGATGGAGATGCTACTTCAAGGAGTTGCTA
AATACTCAGCACCCGTGCAGTCTTCCAACCGAAATACCTCCTAATCTTGGACTTATTGCC
CCGATAACACCTGACGAAACTCGGAATTGTCTTCGACGCATGAAGAATCGGAAAGCGGTG
GGACCTGACGATATTCCGATCGAAGCGTGGAAATCATTGGGCTCTCTTGGTGTGCTCATA
CTGACGGACCTTTTTAACCGCGTCTTGAACACTGGGACTATGCCACATCAGTGGCGTTAT
AGTTACATTACCCCTATATACAAAGGCAGGGGCAGTGTTCAAGATTGTGGTAGTTATAGG
GGCGTTAAGATCATGAGTCACACCATGAAGCTCTTTGAGCGTATGATCGACCTCAGGCTC
CGCCGAGAGTGTACTGTCTCGGAATGTCAATATGGATTTCAGCCAGGATCGGGCACCTTG
GACGCCATCTTTGCCATCAGAACTCTGATGGAGGCATACAGGGAAAAAAGGAGAGCTCTG
CATGTCGCATTCCTAGATCTGCAGAAGGCCTTTGACTGCGTGCCTCGTCAATGTATCTGG
TGGGCATTGCGATTCAAAGGGATCCCTGAGGCCTATATTGACATCATCAGAGACATGTAC
CGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGATACAAAACCCTTTCCGATCTCA
GTAGGGCTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTGTTCAATGTAGTGCTGGACACT
GTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATGATGTATGCCGATGACATAGCG
CTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTGAACCTCTGGAAGGGTACGCTT
GAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAGTACATGGCTTGCGGAAGCCCG
GACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTTAAGTCGGAAAAGTTCAGGTAC
CTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCACGATGTCCAAGCCCGGATCAGC
GCTGCTTGGGCGAAATGGCGTGAGGTCACAGGTGTGGTCTGCGATCGCAGAATACCTACC
AAGCTCAAGGGAATAATATACAAGAGCATAATCCGACCGGTTCTCTTATATGGAAGCGAA
TGTTGGCCAACACTGTCCAGGCACACTCAGGAGCTTCACGTCACGGAGATGAAGATGCTG
AGGTGGATGTGTGGCGTAACGCGGGCTGACCGTATACGTAACACATTTATCCGAGGTAGT
CTTGGAGTCCGTGACGTAGCGGATAAGCTTCAAGAGAGTCGCCTGAGATGGTATGGCCAC
GTTGCACGCCGGCCTGAGAATTACGTCGGAAAAATTTGCCTTGACATGTCGGTCCCTGGA
GCAAGACCCCCAGGACGCCCAAGAAAGCGATGGCTGGACACCGTGAAGCAGGATATGAGA
GCCAATGGACTTACCACCGCGGATGCTAAAGACCGTGCAAAGTGGAGGAGTTTGAGCAGG
AAGGCAGACCCTGGCTAA

Protein sequence:

MVAKVYGLGTARASPINDAQAHRPTPAISGQGLPCQVRARLKTLVPQRKLRFATWNIGSL
TGRCRELADVLMRRRVQCAFLQETRWKGNKSRNIGQGYRLIYTGSPSGKAGVAVVLSEEL
RNGLLEVDRRCDRLMRVRVLIEGVITNLISAYTPQAGCSGSEKESFWEQFEEVLRAIPAA
EVIIVGGDLNGHVGRAAETFDRVHGGFGYGRRNAEGENILRTCIASDLAVVNTFFQKTPQ
HLITYKSGSHSTQIDYLLTRRCHISKVTNCKVIPGESLTAQHRLLVMDYVVTPKKKVAEK
RKPRIRWWLLNGTMQTSFRAEVESQNLSTNTETAQEVWDRAQSAIITAGKRVLGLSKGGR
VIDKETWWWNDEVQEVIREKKTAFKKWQQSNSPEDRLEYIAAKRASKRAVARARSDRLSP
LYDTLETAEGQKLIYKLARARDKATQDIAKCLSVKDSQGTLLCNHASVKERWRCYFKELL
NTQHPCSLPTEIPPNLGLIAPITPDETRNCLRRMKNRKAVGPDDIPIEAWKSLGSLGVLI
LTDLFNRVLNTGTMPHQWRYSYITPIYKGRGSVQDCGSYRGVKIMSHTMKLFERMIDLRL
RRECTVSECQYGFQPGSGTLDAIFAIRTLMEAYREKRRALHVAFLDLQKAFDCVPRQCIW
WALRFKGIPEAYIDIIRDMYRDSVSMVRTAVGDTKPFPISVGLHQGSALSPFLFNVVLDT
VSANIQDQPPWLMMYADDIALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSP
DSCTIHIGPEPAVKSEKFRYLGSILHESGGIDHDVQARISAAWAKWREVTGVVCDRRIPT
KLKGIIYKSIIRPVLLYGSECWPTLSRHTQELHVTEMKMLRWMCGVTRADRIRNTFIRGS
LGVRDVADKLQESRLRWYGHVARRPENYVGKICLDMSVPGARPPGRPRKRWLDTVKQDMR
ANGLTTADAKDRAKWRSLSRKADPG