DPGLEAN10078 in OGS1.0

Genomic Positionscaffold7074:- 394-3084
See gene structure
CDS Length2691
Paired RNAseq reads  34
Single RNAseq reads  87
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001404 (4e-11)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  PREDICTED: similar to endonuclease and reverse transcriptase-like protein [Acyrthosiphon pisum] (2e-136)
Best NR hit (blastx)  PREDICTED: similar to endonuclease and reverse transcriptase-like protein [Acyrthosiphon pisum] (3e-123)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003964 RNA-directed DNA polymerase activity
GO:0003723 RNA binding
InterPro families
  
IPR000477 Reverse transcriptase
IPR005135 Endonuclease/exonuclease/phosphatase
Orthology groupMCL10070

Nucleotide sequence:

ATGCTGAAGATGATACAGTGGAACGCCGACGGTCTTAGCCGTCCGAAGCAGAAGCTCCTA
CGGATGCTCCTCTCCGAGTACAAAATCGACATCGCCCTCATCTCAGAGACCCATCTCAGG
CCATCGGATTCCCTGAAGTTGCCTGGATTCGTGATCTACCGGAGTGATCAGATATCTCCC
GCCGGGGTAGCCTACAGGGGCTTGGCTGCTCTGGTAAAACGACAAGTCATCCACCAGCCG
CTACCGGCAGTATCCCTCCGCTCAGCATATGCCCTAGGCGTAGAGGTGTGTCTTGACCGT
CGTCCCGTCCGTGTGTTTGCCTTTTATAAGCCGCCGCTCGCTCGCTTGGAGGAGAACGAC
ATCCACGTCCTCTTAAACCAGGCGACCCCCACCATCATCGCCGGCGACTTCAACTGCAAG
CACACGGCATGGAACTCTACACATGATGACCCCAACGGAATCAGGCTATTCACCGACGCG
GAGGCGGAAGGGTATGTAGTACTAGGACCGGAGGTTCCAACGCACTACCCCTACCAGCAG
ACCGCCGTGCCTGACGTTATCGACCTTACGATCGCACATGGTCTGAACACAGACCCCTCG
ATCGACGTCCTGGACGACCATATGATATCAGACCACCAACCAGTCATGATGACCCTAGAC
CTGACCCCTATCCGGACTGGGTTTCCCGCCCCTCGACAAAGGCAGGACTGGAGAAAGTTC
GCCGACCATCTGACCGAACACTTGAGGTCCTTCCCCCTGAACAATCCCGACGATGTAGAC
CGCCTGGCCAACGAGCTCACCGCCTCAATCACTCGGGCGCTTGAGGTCTCGCGGCTGTGC
ACTACATCTCACCGGAAAAAACCGCAACTGCCTGCCAAGATCCGCACAATGATAGAGGAG
AAGAGGAGGCTAAGGAGGGAATATCAGCGCACTCGATGCCCCACCACGAAATCCCAACTA
AACGCTCTGGCAGCCAGAGTCTCAGCCGTGCTGGAGGACCACGCGGTGGACTCCTGGTAC
AGGGCCATAGAGCAAGCTGGGGAAGACTGGATGGGCATCCACCGCATTTGCCGCCAAGTA
GCCAAGAAGCCTGTCCCGATCCGGCCCTTGCTCGCGCGAGATGGCACTCCACGCTACCGC
GCGGCGGACCGAGCGGAGATCTTCGCCGATCACCTAGAGACCCAATTCCAGCCAAACCCC
TCTGGGAACACGCAGCATGCAGAGGAAGTTCAACAGCTGCTGAGGAGCTACCTTGGGCAG
TCGATAGCCCCAGAGGAAAATCCCATCGTCCTCACCCCTGGCCAAGTGCAGCGTACGATC
CGCCGCACAAAGCTTCGGAAGGCCCCCGGTCCTGACGGAATCACTAACGAAGCCCTTCGT
CACCTTCCCGCGCGTGGCGTAGCAGCGGTGACACGCTTGTACAATGGAGTCCTCAGGACC
GGCCACTTCCCTACCCAGTGGAAGCTCGGACGAATCATCATGCTGCCGAAACCAGGGAAA
AACATCTTGCTGCCAGGGAGCTATCGCCCCATCACGCTCCTGTCAACCATCTCAAAGGTC
TTCGAAAAACTGCTGCTGCTGCACCTAGCACCACATATCCAGCCACGCGACGAACAGTTT
GGCTTCAGAGCGGAGCATTCCACCACGCTTCAGCTCGCGAGGGTCCTGCACGCTCTGTCG
GTGGCTCTAAACAAGAACGAGTCAGCCGTCGCAGTGATGCTCGACATGGAGAAAGCTTTC
GACCGCGTATGGCACCCCGGCCTACTGTACAAGCTGGCCACATCCACTACTCCTCGCCGA
ATAGTCAGGATCGTGGCCACCTTCCTACAAGATAGGCGATTCCAGGTGTCGGTGGAAGGC
ACCTTATCTACGCAACGCCCCATTAGAGCCGGAGTGCCGCAGGGGAGCTGCCTGTCTCCG
ATATGCTACTCTAAGTACACGGACGACATCCCAGTGGCGGAAGGCGCAACCTTAGCGCTC
TACGCCGACGATGCTGCCTATATTACTACATCCTTGACTCCCGCCCACGCGGCGACAAAA
ATGCAGCGTCAGCTGGACCAGCTCCCTCAATGGCTGGATAAGTGGCGTCTCAGAGTAAAC
GTTTCGAAGACCCAAGCGATCTCCATGGGACGACGACACCCACCACAGCCCCCCTCCCTG
ATGGGACAACCCCTCCAATGGGCACGGACGGTCAAGTACCTGGGCGTGACCATCGACCGT
GGCCTATCAATGAAACAACATACGAAGGAAGTCGTTGAGAAATCGCGAGCCGCGCGTGCT
CTCCTGCGACCCGTTCTCCGATCCGATCTCCCCCTCCGAGCCAAGCTGGCCGTCTACAAG
ACATACATAAGATCGCACCTAACATACGCCGCTCCCGCCTGGTATGCCTTAGTCAAGGAA
CCAGAGCGTGGACGCCTGCGCGCCCAGCAATCACTAGTGCTGAGGACTCTTGTAGACGCG
CCGTGGTGCGTTCGGAACGCGACCATCGCCAGAGACCTACACATGGAGAGCCTCGACGAC
TTCATAACTCGACTGAGTCGGGCCATGTTTCAGCGCGCCGACGCGTCGACCTTCGCACAC
ATACGCGAAGTGGCCCCGTATCACAGACGCCCGCCAGACGGACACGCCCTGCCGCGAGAT
CTTCTTCCCGCGGCCCTACCCGGACTTACCCCACACAAACATCCGCCGTAA

Protein sequence:

MLKMIQWNADGLSRPKQKLLRMLLSEYKIDIALISETHLRPSDSLKLPGFVIYRSDQISP
AGVAYRGLAALVKRQVIHQPLPAVSLRSAYALGVEVCLDRRPVRVFAFYKPPLARLEEND
IHVLLNQATPTIIAGDFNCKHTAWNSTHDDPNGIRLFTDAEAEGYVVLGPEVPTHYPYQQ
TAVPDVIDLTIAHGLNTDPSIDVLDDHMISDHQPVMMTLDLTPIRTGFPAPRQRQDWRKF
ADHLTEHLRSFPLNNPDDVDRLANELTASITRALEVSRLCTTSHRKKPQLPAKIRTMIEE
KRRLRREYQRTRCPTTKSQLNALAARVSAVLEDHAVDSWYRAIEQAGEDWMGIHRICRQV
AKKPVPIRPLLARDGTPRYRAADRAEIFADHLETQFQPNPSGNTQHAEEVQQLLRSYLGQ
SIAPEENPIVLTPGQVQRTIRRTKLRKAPGPDGITNEALRHLPARGVAAVTRLYNGVLRT
GHFPTQWKLGRIIMLPKPGKNILLPGSYRPITLLSTISKVFEKLLLLHLAPHIQPRDEQF
GFRAEHSTTLQLARVLHALSVALNKNESAVAVMLDMEKAFDRVWHPGLLYKLATSTTPRR
IVRIVATFLQDRRFQVSVEGTLSTQRPIRAGVPQGSCLSPICYSKYTDDIPVAEGATLAL
YADDAAYITTSLTPAHAATKMQRQLDQLPQWLDKWRLRVNVSKTQAISMGRRHPPQPPSL
MGQPLQWARTVKYLGVTIDRGLSMKQHTKEVVEKSRAARALLRPVLRSDLPLRAKLAVYK
TYIRSHLTYAAPAWYALVKEPERGRLRAQQSLVLRTLVDAPWCVRNATIARDLHMESLDD
FITRLSRAMFQRADASTFAHIREVAPYHRRPPDGHALPRDLLPAALPGLTPHKHPP