DPGLEAN18884 in OGS1.0

Genomic Positionscaffold2467:+ 8053-42031
See gene structure
CDS Length1449
Paired RNAseq reads  10
Single RNAseq reads  76
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011957 (4e-29)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase [Bombyx mori] (5e-75)
Best NR hit (blastx)  endonuclease-reverse transcriptase [Bombyx mori] (6e-72)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10020

Nucleotide sequence:

ATGCAGAGCGATAAGGATCAGAATGATTTGCACCCCGAGGAGAATGTTCTTGGAAGAGAC
GCTGCAAATAAAGTTACAGACATTGAGAACAACACTGTAGACAGGAACAAAGAACATACG
GCGAAAAATGTGAATGAATCGAATGAAAAAGGAAATAGCGTTTTAAGCAAATACGGCATC
ACATCTTATAAAAGACCGGAAATGTATGTGTCAAAGAAAGATAAACATTTTGATCCCGTC
AAAAGATATGTGCCCATAATTCATCTACAAGTGAACCAACCGACGTTAAGGGACTACGTT
CTTTACTCGATAGTTGGTAACTTCATCCGGGGCAACTTCACTCCGACCAGCTTCTCTGAG
AGACCTTGCAGGGACGAAGAGCCTCTCCCCTACTTCAAGCCTAAACACGCGAGAGCCGTG
CGTTCGAAGCACTCGAAGACCAAATGGCATGATGAAAGTGACAGTTATAGGACGAGGGGT
GACAGTGATTTCGATGGTTACTATTTTAGACCAAAATTTCTATCGCGAGTGGAGCGTTCG
ATTTACAAGTTCGGCAAAATGAGATACAGGTTCCTACCAGCAGCGATCGTGAGCGGACCA
TTGCGAGACAGGAGGTCCCTCACGTTGCAATGGTCTTTGAATGACGCTTCACATTTAGAG
AGAAAACAAACTAGTGTCGTCTTTAGATTACGTTATAGGAAAACAACGAGTGCCACCATA
TACCAAGTGTTCTTTACTAAAACTTTAAGATTAGGGTTGCTGCAAAGTGCTGGATGTAAA
CCGGATTCCCAGCCCCCGGTCAGTCAGGAGAGGTCGGAGCGCCACTGGGTTCTTTTAGTG
GGTATACCGGAAACCTATTTCTCAGGGAGTCCCACATACACCTCTCCGTCCCCTGAAAGG
GGATGCTACGGCTATAGACAACCTATTGGACAACCAATCAACCAGCCGAGAGCCAACCTG
ACCCGACACTTTACCAAGGATATCCCGGACATCACCCTGTCGGAGATTGAGATGGCCCTA
AAACAGCTAAAGAACAACAAGAAACCGGGTGAGGACGGAATTACTTCTGAACTTCTGAAA
GCGTGTGGATCACCGGTACTTAAAGTCCTTCAGAAGATCTTCAATTCTGTCTTGTTTGAA
GGTACCACGCCTGAGGCATGGAACAAGAGCGTAGTGGCGTTGTTCTTCAAAAAAGGTGAT
AACACCCTATTGAAGAGCTACACACCCATCTCGTTCTTGAGCCACGTTTACAAACTGTTT
TCTAGGGTCATTGTAAATCATCTCGAACGCAGGTTTGATGACTTCCAGCCTTCCGAACAA
GCCGGGTTCCGAAAAGGCTATAGTACCGTAGACCACATACATACGTTGCGCCAGGTTATA
CAGAAGACTGAGAAGTATAACTTGCCGCTCTGTCTAGCGTTTCTGGACCACGAAAAAGCC
TTTGATTAG

Protein sequence:

MQSDKDQNDLHPEENVLGRDAANKVTDIENNTVDRNKEHTAKNVNESNEKGNSVLSKYGI
TSYKRPEMYVSKKDKHFDPVKRYVPIIHLQVNQPTLRDYVLYSIVGNFIRGNFTPTSFSE
RPCRDEEPLPYFKPKHARAVRSKHSKTKWHDESDSYRTRGDSDFDGYYFRPKFLSRVERS
IYKFGKMRYRFLPAAIVSGPLRDRRSLTLQWSLNDASHLERKQTSVVFRLRYRKTTSATI
YQVFFTKTLRLGLLQSAGCKPDSQPPVSQERSERHWVLLVGIPETYFSGSPTYTSPSPER
GCYGYRQPIGQPINQPRANLTRHFTKDIPDITLSEIEMALKQLKNNKKPGEDGITSELLK
ACGSPVLKVLQKIFNSVLFEGTTPEAWNKSVVALFFKKGDNTLLKSYTPISFLSHVYKLF
SRVIVNHLERRFDDFQPSEQAGFRKGYSTVDHIHTLRQVIQKTEKYNLPLCLAFLDHEKA
FD