DPGLEAN20108 in OGS1.0

Genomic Positionscaffold5109:- 269-2752
See gene structure
CDS Length2484
Paired RNAseq reads  102
Single RNAseq reads  228
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001404 (3e-06)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  reverse transcriptase [Papilio xuthus] (0.0)
Best NR hit (blastx)  reverse transcriptase [Papilio xuthus] (0.0)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003964 RNA-directed DNA polymerase activity
GO:0003723 RNA binding
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10007

Nucleotide sequence:

ATGGCGGTGGAGGCGTCGATCGTCCAAGCCTGGACCATGCGAGTTGAGCAGCCCGTTGAG
GTGGATGAGGAGGCGCGCAAGTTCCGCCGGAGTCTATGGCGGATTTGCGATGCGTCGATG
CCCCGTGTCGCGCAACGCCCTCCCAGACGCCAGGTGTATTGGTGGACGCCGGAGATCGCG
CAGCTGCGTGCAGTCTGCGTGGTGGCGAGGCGCCAGTACACCCGACAACGAAGGCAGCGT
CCTCGCAACGAGGCCGTCGAAAGTCGGCTTCGTGACGTCTACAACGAAGCGAAGAGCGAG
CTTCGGCAAGCTATCTGTAGGTCGAAGGATACGGCTCGTGAGGAGCTACTCGTGCGTCTA
GACAATGACCCGTGGGGGCGTCCGTATCTCGGTGCCCGGAATAAAACCCGGACCCAGACG
GCCCCGATCACGGAGAGTCTAGAGCCGGAGCTGCTGCGACGCGTCGTTTGTGCTCTCTTC
CCCGAGGAGGTCGCGCACTCGATGCCAACGGCAGGCACTTCTCGCGAGCTGGAACGGGCG
GTAACTATTGCGCCCGTTACTCTGGAGGAGCTCGAGAGGACTCTGTCACCGCTAAAGGCC
AAGAAAACCGCCCCCGGGCCGGACGGTGTCCCCGGACGCGTCCTGGCTCTCGCTCTGGGC
GAGTTGGCCGAGTGGTTCTTGGAGATCCTCAATGAGTGTTTGAGGACGGGCCGCTTTCCA
TCGGTTTGGAAAGAAGGACGGCTCGTTCTCCTCCAGAAGGTAGGTCGACCTGCAGACTCG
CCGTCTGCCTACCGTCCGATCGTGCTGCTAGACGACGCGGGTAAGCTCTTTGAGCGAATA
CTAGCCAACCGCGTCGTCGCTCACTTGCGCAGCGTAGGGCCCGATCTGGCCGAGTGCCAG
TATGGTTTTCGGGGTGAGCGTTCTACCATCGACGCCATCTCGAAGCTGAGGAGTCTTGCG
GATGATGCCGTGTCGAGGGGTGGGGTGCTGTTGGCGGTGTCCTTGGATATCTCCAACGCA
TTTAACACCCTTCCTTTCGGCGTCATTGAGGAGGCCCTCAGATACCATGGTCTGCCACTC
TACATCCGGCGGACCATCGGGTCTTATCTCCGCGGACGAGAGATCTCGTTCGTGGGGTGT
GACGGCCGGGTTCATCGCCATGAGGTGCGCTGCGGCGTTCCGCAGGGGTCGGTTCTTGGG
CCGCTCCTGTGGAACTTGGGCTACGACTTCGTGCTACGCGGTGCCCTCCTAACCGGGCTG
AGCGTCGTTTGCTACGCGGACGACACGCTCGTTGCAGCCCGAGGCGAGGACCTGGAAGAG
GCGACGGTGCTTGCTGAGGCGGGAGCCGCCTTGGTCGTGCGGCGCATCGAAATGCTCGGG
CTGAGGGTGGGCCTGGATAAAACAGAGGCCCTCCTGTTCCACAGCCCTCGAGCCAGACCG
CCGACGGGCGCCAGCATCAACATCTGCGGCGTCCGCGTCGAGCTCAGTTCCCGGATGAAA
TATCTGGGGCTGACTCTGGACGGAAGGTGGAGCTTCCGGGAGCACTTTCGCGGTCTAGTT
CCGAAACTCCTCGGGACGGCGAACGCGCTCGGAAAGCTTCTGCCAAATCTCGGTGGTCCC
AGCGCGACATGCCGGCGTCTGTACACCGGTGTGCTGCGCTCGATGGCGCTGTACGGAGCT
CCAGTGTGGGCCGGTGCCCTCACATCGCCGAACGTGACGGCGTTGCACAAAGTGCAGCGC
GTCATGGCGGTGAGGGTGGTACGGGGATACCGCACTGTCTCCCACGAGGCGGCTTGCGTG
CTGGCTGGGACGCCTCCTTGGGACTTGGAAGCTCAGGTCCTGGCGGAGGTTTACCAACAG
CGCGCACGAGCTCGTTCCCAGGGTGTGAATCCACCCCGGGAACAGGTGGAAAGTTGGCGT
CGCTCCGCGCAAGTGGCGCTCTTTCGTCGTTGGAAGCGACGGCTCTCTGTGCCAAAGGCC
GGGTTGCGCACCGTGGAGGCGGTTCGGCCGCTCCTCAGGGAGTGGGTGGATCGCCGACAT
GGTTCCTTAACCTTCCGGTTGGTGCAGATCCTTTCGGGACACGGCAGTTTCGGAAGGTAT
TTGTGCCACATAGCCGGGAGAGAGCCGACGTCGGCGTGTCATCACTGTACTTGTACGGAA
GACACTGCCGACCACACGCTGGCGGAGTGCCCTGCGTGGGAATCGGAGCGGCGCGAATTA
TCCACGGTGGTTGGCGCGAACCTCTCGTTGTCGGCCGTTGTTAAGGCAATGGTGGGTAGC
GGGAGGGCCTGGGCGGCGGTGGTCTCTTTCTGTGAGGTTGTCATCTCGCGGAAGGAGGCT
GCCGAAAGAGTGAGGGAAGACGATCCCTCCTCGATGCCGATGCGCCGACGGAGACCGGGT
CGTAGGCAGCGGGATATGCCCGCCGAATGCCTCCCCAATGAGAGGAGCCTGCGGGTGTCG
GAGGGGAAATCCGATGCCCGGTGA

Protein sequence:

MAVEASIVQAWTMRVEQPVEVDEEARKFRRSLWRICDASMPRVAQRPPRRQVYWWTPEIA
QLRAVCVVARRQYTRQRRQRPRNEAVESRLRDVYNEAKSELRQAICRSKDTAREELLVRL
DNDPWGRPYLGARNKTRTQTAPITESLEPELLRRVVCALFPEEVAHSMPTAGTSRELERA
VTIAPVTLEELERTLSPLKAKKTAPGPDGVPGRVLALALGELAEWFLEILNECLRTGRFP
SVWKEGRLVLLQKVGRPADSPSAYRPIVLLDDAGKLFERILANRVVAHLRSVGPDLAECQ
YGFRGERSTIDAISKLRSLADDAVSRGGVLLAVSLDISNAFNTLPFGVIEEALRYHGLPL
YIRRTIGSYLRGREISFVGCDGRVHRHEVRCGVPQGSVLGPLLWNLGYDFVLRGALLTGL
SVVCYADDTLVAARGEDLEEATVLAEAGAALVVRRIEMLGLRVGLDKTEALLFHSPRARP
PTGASINICGVRVELSSRMKYLGLTLDGRWSFREHFRGLVPKLLGTANALGKLLPNLGGP
SATCRRLYTGVLRSMALYGAPVWAGALTSPNVTALHKVQRVMAVRVVRGYRTVSHEAACV
LAGTPPWDLEAQVLAEVYQQRARARSQGVNPPREQVESWRRSAQVALFRRWKRRLSVPKA
GLRTVEAVRPLLREWVDRRHGSLTFRLVQILSGHGSFGRYLCHIAGREPTSACHHCTCTE
DTADHTLAECPAWESERRELSTVVGANLSLSAVVKAMVGSGRAWAAVVSFCEVVISRKEA
AERVREDDPSSMPMRRRRPGRRQRDMPAECLPNERSLRVSEGKSDAR