DPGLEAN18418 in OGS1.0

Genomic Positionscaffold1061:+ 37997-43278
See gene structure
CDS Length5190
Paired RNAseq reads  51
Single RNAseq reads  147
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013961 (2e-17)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  reverse transcriptase [Papilio xuthus] (0.0)
Best NR hit (blastx)  reverse transcriptase [Papilio xuthus] (0.0)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003964 RNA-directed DNA polymerase activity
GO:0003723 RNA binding
InterPro families


  
IPR013084 Zinc finger, CCHC retroviral-type
IPR001878 Zinc finger, CCHC-type
IPR000477 Reverse transcriptase
IPR005135 Endonuclease/exonuclease/phosphatase
Orthology groupMCL10007

Nucleotide sequence:

ATGAAAGCGAGTTGTAAAAACGATTTACCCCAGGAGGGTACCAGCCGCGCCGCGGCTGGA
GAATCCCTCACCGGTTCGCCGGTCTGCCCCTCGTATTCTGGGGGGGGCAACAACCGCTTC
GGGAGTCCTTCTTCTAAGGACTCTCTGGGACGAGGTGCAGTGGGCACGGGACCTGGCTTG
GGAAAAGGACACCGATCAGCAAAAGATTTGGACAAGGTGATGGTGGTAAGCTCTGACGAG
GAGCCTGTAGACGCGTCGGCGGTGCGACCAGTGGCGCGCCAGCCACTAGCGTCCAACGGT
GAGAAGGGAAGAGCAACCAGAAGGAGCCCGCGTACAACGAGCGGAAGTGAGATGGAGACG
GAGGAGACGCGCTCCGCCTTCTTCGCCGGTACGCCGACGAGCCTGGCGCCTCTCCGGAAG
CGGCCAGCGACAAGAAGACAACCAGGCGGGAGTTCGTCTGGCGGAAGTGACAAGGCTTCC
TTCGCTACTGCGGTGAAGAGAGGACGGGCAGTTGAGGAAGGCGAATCGAACTCGGAGGAG
GAAAACGTGGCGAGGTCGACGCGTCGGGTCGAGGTGGCCCTTTCCTCGGTTAAGACGCTG
CCAGCCTCGTGCCTCGCGAAAGAGATGGAGAGGGCCCTGAGCGTCATAGTCGACGTGGCC
CTCAAATCCAAGAATCTGAAGGGCGGATGCGTCAGGGCATTGAAGACGTCGGCGGCACTC
CTGGGGGAGGCAAAAGAGATTCTCCTGCAGCGGACCAGCGGCGAGGAGAATGAGATTCTC
CGAGCCCGGCTAGAGGAAGAAAGGAAGAAGAGCTCGCTGCTGGAGAAGGAGCTGGGGCTC
CTGAGAGAGGGGCAGGCCCGCTTGCGGGCAGACATGGACCTGCTCGCCACTGCCCCGAAA
CCAGCACGAGACGAGAAGAGCGAGGAGGAGCTTCGCGGGTCCCTCATGAGGGACCTAGGT
GCCATGATGGACGCGAAGCTCCAGGGGATCGCAGACCGGCTGCTCCCCGAAAAGCGCCTG
AGGCCGCCCCTAGCGGCGGACAAGAGGCCACCCCCAGCGCCTGCGTCGGCTGCTGTGGCT
GAGCCGGCAGGTAGAGTGGCGAGCAGGAAAAAGAACGGTGCCACAAGAGAACAGGAGAAG
ACGGCGAGACCTCTACCCCCGCCGCCTCCATCCATGGACAAGACATGGACGGAGGTGGTA
AAGAAGAAGGGCAAGAAGAAGACCGGCCCGCCAGTAGCCCCACCCCAAGAGGGGCCGAAG
GCGAAGAAGCCGGCCAAGGAGAAGAAGAAGGCAAAGAAGAAGAAGAAGAGGAGCCTCAAG
AGCGTGAAGAATCCTGCGGTGGTAATTACGCTGCCGCGGGATTCAGAAAGGACGTACGAG
GACGTCCTCAAAACGGCGCGGGCGGGACTCAACCTGGCAGAGCTAGGACTGCCGATGGGT
CTCGTCTGCAAGAAGACGGTGACGGGAGCCCGCATGTTCGAGCTGCCGGAGAGCGCGGAG
GAGGCGAAAGCCGACCTCCTGGCGCAAAAACTCCGGGAGCTCGTCCCTGAGGTGAAAGTC
GCCAGACCGGTGAAAAGCGGGGAACTGCGAATCACCGGTTTGGACGACTCCGTCACTAAG
GAGGACGTCCTCGCTGCGGTCGCTCGGCAGGGCAATTGCTCTGTCGAGCAAATACGGGTA
GGGCCGATCCGCTCCCGCGGGTCCTTCAGCACCGGGGCCGCTTGGGTGAGGTGCCCAGTG
GAGGTTGCCAAGTGCCTGGCCGAAGCCGGCCGCCTTTTGGTCGGCTGGAGCTCGGCACGG
GTGACACCGCTGGAGCCACGCCCTATGCGGTGCTACCGGTGCCTGGAGGTGGGACATGCG
GGCCTCAGATGTCCGTCCAAAACCGACCGCAGCCGTCTGTGCTTCCGTTGCGGCGGTGAA
GGACACATCGCTGAGGCCTGCACGTCGGACGCGAAGTGTCCGATCTGTGCGGAGGCGGGA
GTCGCCTCCAACCACATGGTGGGGGGCAAACAGTGCCACCCACCGAAGCGTAGGAAGGGT
AAGGGACCCGGAAAATCCTCGGTCCCCCCGCCCTCTAAATCTGCCGCGCTGGAGTCCATG
GTGGAGTGGTCCATCGATGTGGCAGTTCTCTGCGAGCCGTATTACGTGCCCGATCGCTCG
AATTGGGTATGTGATGCGGAGCAGTCTGTGGCGGTAGTCATCTCACCGGGTGCGCGCTCC
TCTTCTCCTCTCTCTGTCATCAGAAGGGGAGAAGGGTATGTCGTGGCTCAGTGGGGCGAC
TTCATACTTGTGGGCGTCTACCTCGCTCCGAGGAAGCCCGTCGTGGAGATCGAAAGACTC
CTCGACGAGGTCGGGGCTGAGGTCAGACGTGCGTCGCCGAGACAAGCAATTGTCCTCGGT
GACTTTAATGCCCACAGTACGGTGTGGCAATCGCCCGCTACGGACCCTCGCGGGGAGGTG
GTGGAGGAGTGGGCTGCAGCGGCTGGCCTATCCCTCCTCAACCGGGGCAGTGCAGTCACG
TGCGTACGACCGCAGGGCGAATCCATAGTGGACCTTTCGTTCGCTGCTCCTGCGGTTGCA
ACGCGTGTACGTGACTGGCGAGTCCTGGATGAGGTTGAGACGCTGTCGGACCACTTCTTC
ATCAGGTTCGAGTTGGCCCCACAGGATGGCTCTTCGTTCCGCCGCCGACCCGTCGGGAAC
GCTTCGGCGTTCCCACGGTGGTCCCTGACCGAGCTCGACAGGGACATGGCGGTGGAGGCG
TCGATCGTCCAAGCTTGGACGATGCGAGTGGAGCAGCCCGTTTCGGTGGATGGGGAGGCG
CGCAAATTTCGCCACAGCCTGTGGCGGATTTGCGATGCTGCGATGCCTCGCGTCGCGCAA
CGCTCCCCCAGACGCCAGGTGTACTGGTGGACGTCAGAGATCGCGCAGCTGCGTGCGGCT
TGCGCGGTGGCGAGACGCCAGTATATCCGACAACGGAGGCGGCACCCTCGTAACGAAGCC
GTCGAATGTCGGTTTCGTGACGCCTTCAACGGAGCGAAGAGCGGGCTTCAGCTCGCAATC
TGCAGGTCGAAGGAGTCGGCTCGCGAGGAGTTGCTTGCGCGTCTGGACAATGACCCGTGG
GGGCGTCCGTATCTCGGTGCCCGGAATAAGATCCGGGCCCAGATGGCCCCGGTCACGGAG
AGTCTGGAGCCGGAGCTTCTGCGAAGCGTCGTGAGTGCCTTATTCCCTGCGGAGACGGCG
CACTCGATGCTAGCGGAGGCTGGCACTTCACGCGGGCCGGACCGGGTGGAATTTATTCCG
CCCGTCTCTCTGGAGGAACTCGAGGAGTCTCTGCGGCCTCTCAAAGCCAAGAAAACCGCC
CCTGGGCCGGACGGCGTCCCCGGACGTGTCCTGGCCCTGGCTCTGGGCGAGTTGGCCGAG
TGGTTCGTGGAGATCCTCAACGAGTGTTTGAGATCGGGTCGCTTTCCATCGTGTTGGAAA
GAAGGGCGACTCGTTCTCCTCCAGAAAGAAGGTCGACCTGCAGACTCGCCATCTGCATAC
CGTCCCATAGTGCTGCTAGACGACGCGGGGAAGCTCTTCGAGCGTATCCTAGCTACCCGT
GTCGTCGCACACTTAAGCAGCAACGGACCCGACCTGGCCGAGTGCCAGTATGGTTTTCGG
GGTGGCCGGTCTACGATCGATGCCATCTCGAAGCTGAGGGAACTCGCCGATGATGCCGTT
TCGGGGGGTGGGGTGATGTTGGCGGTGTCCCTGGACATCTCCAACGCATTCAACACCCTT
CCGTTCGGTGTCATTGAAGAGGCCCTCAGATACCATGGTCTGCCACTTTACATCCGGCAG
ACCATCGGGTCCTATCTCCGCGAACGAGAGATCTCGTTCGTGGGGAGAGACGGTAGGGTC
CATCGCCACGAGGTGCGCTGCGGGGTTCCGCAGGGGTCAGTCCTCGGGCCGCTCCTGTGG
AACTTGGGGTACGACTTCGTACTTCGCGGCGCCCTCCAGACCGGGCTGAGCGTCGTCTGC
TGCGGACGACACGCTCTCGTCGGGGCCCAAGGCGTGGACCTGGCAGAGGCGACGTTGCGA
GCTGAAGCGGGAGCCGCCTCGATAGTACGGCGCATCGAGATGCTCGGGCTGAGGGTGGGT
CTCGACAAAACCGAGGCCCTCCTGTTCCACGGCCCTCGAGCACGACCACCGCCGGGCGCC
AGCATCAACATCTGCGGCGTCCGCGTCGAGCTCAGTCCCCGGATGAAATATCTGGGGCTG
ACTCTGGACGGTAGGTGGAACTTCCGGGAGCACTTTCGCGGTCTCGTCCCGAAATTACTC
GGGACGGCGAACGCTCTCGGAAGGCTTCTGCCTAATCTTGGTGGTCCAAGCGCGACATGC
CGGCGCTTGTACACCGGCGTGTTGCGTTCGATGGCGCTGTACGGAGCACCAGTGTGGGCC
GGTGCCCTCTCGAGGCCGAACGTGGCGTCGCTGCACAGAGCGCAGCGCGTCATGGCTTTG
AGAGTGGTGCGTGGATACCGCACGGTCTCCCACGAGGCGGCTTGCGTGCTCGCTGGGACG
CCTCCCTGGGACTTGGTCGCCCAAGTTCTGGCGGAGGTGCACCAGTGGCGCGCACGAGCT
CGATCCCAGGGTGTGAATCCGCCCTGGGATCCGGTGGATGGTTGGCGACGCTCCGCACAC
GAGGAGCTACTCCGTCGATGGAGGCGGCGGCTCTCCGAGCCAGTGGCCGGGTTGCGTACC
GTGGAGGCGATTCGACCGCTCCTTAAGGAGTGGGTGGATCGCCGACACGGTTCCTTGACC
TTCCGGCTGGTGCAGATCCTCTCGGGGCACGGTAGCTTCGGGCGGTATCTGTGCTACATA
GTCGGAAGAGAGCCGACGGCGGCGTGTCACCATTGCAGTTGCGTGGATGATACGCCCGAC
CATACATTAGCGGAGTGCCCTTCGTGGGAAGCGGAGCGTCGCCAACTAATCACCGAAGTT
GGCGCGGACCTTTCGCTGCCGGCCGTAGTCAAGGCTATGGTAGGTAGTGAGAGGGCCTGG
GCGGCGGTGGTCTCTTTCTGTGAGGTTGTCATCTCGCGGAAGGAGACTGCCGAAAGGGGG
AGGGAAGATGATCCCTCCTCGGCGCCGATGCGCCGACGTAGGCTGGGTCGTAGGCAGCGG
GCTTATGCCCGCCGAATGCCTCCCCAGTGA

Protein sequence:

MKASCKNDLPQEGTSRAAAGESLTGSPVCPSYSGGGNNRFGSPSSKDSLGRGAVGTGPGL
GKGHRSAKDLDKVMVVSSDEEPVDASAVRPVARQPLASNGEKGRATRRSPRTTSGSEMET
EETRSAFFAGTPTSLAPLRKRPATRRQPGGSSSGGSDKASFATAVKRGRAVEEGESNSEE
ENVARSTRRVEVALSSVKTLPASCLAKEMERALSVIVDVALKSKNLKGGCVRALKTSAAL
LGEAKEILLQRTSGEENEILRARLEEERKKSSLLEKELGLLREGQARLRADMDLLATAPK
PARDEKSEEELRGSLMRDLGAMMDAKLQGIADRLLPEKRLRPPLAADKRPPPAPASAAVA
EPAGRVASRKKNGATREQEKTARPLPPPPPSMDKTWTEVVKKKGKKKTGPPVAPPQEGPK
AKKPAKEKKKAKKKKKRSLKSVKNPAVVITLPRDSERTYEDVLKTARAGLNLAELGLPMG
LVCKKTVTGARMFELPESAEEAKADLLAQKLRELVPEVKVARPVKSGELRITGLDDSVTK
EDVLAAVARQGNCSVEQIRVGPIRSRGSFSTGAAWVRCPVEVAKCLAEAGRLLVGWSSAR
VTPLEPRPMRCYRCLEVGHAGLRCPSKTDRSRLCFRCGGEGHIAEACTSDAKCPICAEAG
VASNHMVGGKQCHPPKRRKGKGPGKSSVPPPSKSAALESMVEWSIDVAVLCEPYYVPDRS
NWVCDAEQSVAVVISPGARSSSPLSVIRRGEGYVVAQWGDFILVGVYLAPRKPVVEIERL
LDEVGAEVRRASPRQAIVLGDFNAHSTVWQSPATDPRGEVVEEWAAAAGLSLLNRGSAVT
CVRPQGESIVDLSFAAPAVATRVRDWRVLDEVETLSDHFFIRFELAPQDGSSFRRRPVGN
ASAFPRWSLTELDRDMAVEASIVQAWTMRVEQPVSVDGEARKFRHSLWRICDAAMPRVAQ
RSPRRQVYWWTSEIAQLRAACAVARRQYIRQRRRHPRNEAVECRFRDAFNGAKSGLQLAI
CRSKESAREELLARLDNDPWGRPYLGARNKIRAQMAPVTESLEPELLRSVVSALFPAETA
HSMLAEAGTSRGPDRVEFIPPVSLEELEESLRPLKAKKTAPGPDGVPGRVLALALGELAE
WFVEILNECLRSGRFPSCWKEGRLVLLQKEGRPADSPSAYRPIVLLDDAGKLFERILATR
VVAHLSSNGPDLAECQYGFRGGRSTIDAISKLRELADDAVSGGGVMLAVSLDISNAFNTL
PFGVIEEALRYHGLPLYIRQTIGSYLREREISFVGRDGRVHRHEVRCGVPQGSVLGPLLW
NLGYDFVLRGALQTGLSVVCCGRHALVGAQGVDLAEATLRAEAGAASIVRRIEMLGLRVG
LDKTEALLFHGPRARPPPGASINICGVRVELSPRMKYLGLTLDGRWNFREHFRGLVPKLL
GTANALGRLLPNLGGPSATCRRLYTGVLRSMALYGAPVWAGALSRPNVASLHRAQRVMAL
RVVRGYRTVSHEAACVLAGTPPWDLVAQVLAEVHQWRARARSQGVNPPWDPVDGWRRSAH
EELLRRWRRRLSEPVAGLRTVEAIRPLLKEWVDRRHGSLTFRLVQILSGHGSFGRYLCYI
VGREPTAACHHCSCVDDTPDHTLAECPSWEAERRQLITEVGADLSLPAVVKAMVGSERAW
AAVVSFCEVVISRKETAERGREDDPSSAPMRRRRLGRRQRAYARRMPPQ