DPGLEAN08188 in OGS1.0

Genomic Positionscaffold40:+ 22089-63374
See gene structure
CDS Length1353
Paired RNAseq reads  2
Single RNAseq reads  137
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003794 (4e-23)
Best Drosophila hit  CG42313 (1e-08)
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase [Bombyx mori] (2e-126)
Best NR hit (blastx)  endonuclease-reverse transcriptase [Bombyx mori] (3e-122)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10020

Nucleotide sequence:

ATGAAAGGTGTTGATGATGGCGGCGATGCTGGTGTGCGCACGGTGGACGTTCAGGCCGTG
GAGGGGATGGTAGCGCAATTCCCATGCGACCTCGGCACTGCGGCTAATGACAAAGTATAC
ATGGTGTTCTGGTTCAGGGATGACGCTGGCATTCCATTATACAGGTCTGGCTGTTTAAAC
GCAGATGGAATACCCCTTAAAATCAGTCTACCTCCTAGTCACGCTGATGCTGACTGTGAC
AAGAACTTATCGACATGTGGATTAATTTTGCTTGTAGTTTTACCAAGGGACGACTGTATG
GAGTTCCTCAATAGAGGAGTGAATAAGTATTTGCAGGGTTGGCAACGCGCAATAATATCC
CTGGTATTGCAGGCCTCTATAGGCACCGGTGACAACTTAACATCCGGCGGACTGGGCGGT
ACAAATGTTTACAACCGTCGTGAACAAGTCGAGTCTCGTAAAGGCTTCAGCACCGTAGAC
CACATCATCATCGTTCGGCAGATTGTACAGAAAACCGACGAGTACAATCAGCCGCTGTGT
CTGGCTTTTGTGGACTACGAAAAAGCCTTCGACTCCATCGAAACTTGGGCGGGTCTGGAC
GCTCTGCAACGATGTGGTATAGATTGGCAATACATCGAGGTGCTGAAAAGCCAATATGAA
ACCGCCCTCATGACCGTCCAGCTCCAGGACCATAAGACCAATCCCATCGAGCTGCATCGA
GGTGTGAGACAGGGGGATGTTATATTCCCGAAGCTGTTCACCAACGCACTCGATGACGTC
TTTAAGACTCTGGACTGGGCTGGAAGGGGTATAACGGTGAACGGTGAGCACATCTCGCAC
CTTCGGTTTGCCGACGACATCGTTATAGAAGCAGAGTCGCTGGAGCAGTTAAGCGGGATG
CTGCATAGCCTTAATGAAGCCTCCGGTGGTCTTGGCATGAACCTGGATAAAACCAAAGCC
ATGTTCAATGAACATGTTCTGCTAAGTCCGATATATGTCGAAGGATCGATGCTTGAAGTT
GTTCAGGAGTATATCTACCTAGGGCAAGTAATCAAGCTCGGTAGAAACAACTTCGAGCAA
GAAGTCGACCGCAGGGTTCAGTTGGGTTCGGCAACATTTAGCAAACTCCGTCGAGTTTTC
TCTTCGCCTATATCGCAATGCCTGAAGACAAAAGTGTACGACCAGTGCGTCCTACCTGTT
ATGACTTACGGTGCTGAAACGTGGACATTGACGGTTGGACTGGTCCATAGATTTAAATTC
GCACAGCGGGTTATGGACCGGGCTATGCTCGGAGTTTCTTTGAAGGATAAAGTTCGCAAT
GAGGTCATCCGACAAAGAACCAAGGTAACATAG

Protein sequence:

MKGVDDGGDAGVRTVDVQAVEGMVAQFPCDLGTAANDKVYMVFWFRDDAGIPLYRSGCLN
ADGIPLKISLPPSHADADCDKNLSTCGLILLVVLPRDDCMEFLNRGVNKYLQGWQRAIIS
LVLQASIGTGDNLTSGGLGGTNVYNRREQVESRKGFSTVDHIIIVRQIVQKTDEYNQPLC
LAFVDYEKAFDSIETWAGLDALQRCGIDWQYIEVLKSQYETALMTVQLQDHKTNPIELHR
GVRQGDVIFPKLFTNALDDVFKTLDWAGRGITVNGEHISHLRFADDIVIEAESLEQLSGM
LHSLNEASGGLGMNLDKTKAMFNEHVLLSPIYVEGSMLEVVQEYIYLGQVIKLGRNNFEQ
EVDRRVQLGSATFSKLRRVFSSPISQCLKTKVYDQCVLPVMTYGAETWTLTVGLVHRFKF
AQRVMDRAMLGVSLKDKVRNEVIRQRTKVT