DPGLEAN18182 in OGS1.0

Genomic Positionscaffold5341:+ 2593-4719
See gene structure
CDS Length2127
Paired RNAseq reads  5
Single RNAseq reads  23
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011163 (2e-10)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  endonuclease-reverse transcriptase [Bombyx mori] (0.0)
Best NR hit (blastx)  endonuclease-reverse transcriptase [Bombyx mori] (0.0)
GeneOntology terms



  
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003723 RNA binding
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10020

Nucleotide sequence:

ATGGAAGTTTTAAATGTAAACTTTTCTTCAGACCATAGACCAGTAAGAACAACATTAATA
TTAACAAAATGTAAGAAAACCAGATCAAGCTACAAAAATTACACATACTCGAAATTAAAA
ACAGAAGAAATTTCGAAATACTCAAAACAGATTACTTCCTACTTACCTGAACTCGATGAC
TGCTCTACATACTCTGTACAAACATTTCACGATAAGATAGTAAAAGCTATCTCTGAAAGC
CTTAAGATTGCTCGGATAACAAAAGGGAACACACAAAACAGGCATAATATTCTATCTGAG
CAAACACTGTCACTAATGAAAAGAAGACAAGATCTACAAAATACACCAAATAAAACCAGA
TCCATGAAAAACGAGCTGTCTGCACTATACAAGTTAACAAAGAAACTGATAAAACAAGAC
TACACCAGATACAGATATAATATCCTTGGAAAACACCTTACCCAAACAAATAGTTCAAAG
AAAGCTTTTAAAGAATTACAAACATATAAAACATGGATAGATGGACTAAACCAGAGAAAT
AAAACACTGAACAATCGAAATGATGTAGTAAACCTTGCAACAGATTTTTATAAAAAATTG
TATAGTGCTCCAGATTCAAATTGTTTCGAGACAGAACAGGAAATCATAAATTCAGCTAAA
CATCAAGAAGAAGATCACGGACCACAGCTCGACGTCGTAGACGTTGTAGAAGCGAATCAA
ATGTTAAAAGCAGAAAAAAGCCCTGGTCAAGATAATATAACCAACGAAGCCATAAAAAAC
GCATCAGCTCAACTGGCTCTGCCACTAACTGCATTGTTCAATCGCATACTAGGGACCACC
GAAATTCCAACCCAATGGTCTAATTCGGAGATAATACTGTTATACAAAAAAGGAGACCCA
CGCGATATAGGCAACTATAGGCCTATAAGTCTATTACCATGTTTATATAAACTATTCTCT
ATGCTCATCAACAAAAAAATTAGAAACACCTTAGATGCTGAACAACCCATAGAACAAGCA
GGATTCAGAAAGGGCTTCTCAACTATAGACCACATTCATACCATAGAACTCCTGATCGAA
AAGTATCAAGAATATCAGAAACCGCTATACATTGCTTACATAGATTATAAGAAAGCTTTT
GATACTGTGTCACATACGAGCATATGGAAAACACTGGTAGACCAAGGAGTAGAGACAATG
TATATACAGATAATTAAAAAGATATACGAAAACAGCGCAGCTAGAGTAAAACTAGACAGG
CCTGGCCCAAGCTTTCCTATCAGAAGAGGGGTAAAACAGGGCGACCCACTGTCCCTACAA
CTATTTATTGCTCTCCTAGAATCAATAATAAGAAACCTAGATTGGAAACAGTATGGATTA
AACATTAATGGCAAATACTTAAACCACCTTCGCTTCGCTGATGATCTAATTCTACTGTCA
GAAACAGACACACAGCTACAACTCATGATACACTCACTGAATAAATCCAGTAAACAAGTC
GGTTTAAAAATGAATCTAACTAAAACCATGGTCATGACTAACAGTATACAGAGGAAAATA
GTAGTTGACGACGAAATATTAAAATACACTGATAAATACATTTACCTGGGAAAACAAATA
GGCTTTAACAGAAACAACAGTGAACTAGAAATAGAACGAAGGATACAAAACACCTGGAAT
AAATATTGGGGTTTGAAAGAAATTTTTAAAAGCGATATGGCAGTACAAATAAAAACAAGA
GTAATGAATACATGTCTTTTGCCAGGCCTAACATACGGCTGTCAAACCTGGAAATTTACG
AACAAAACCAAAAACAAAATTATCAGCTGCCAACGTGCCTTGGAACGCAGTATGCTAAAC
ATCAAAAAAATTCAAAAGATTAGACATGATAAAATAAGAAACACATCAAAGGCTACTGAT
GCCCTAGAACAGACCTCAAGACTAAAATGGAAATGGGCAGGACATGTTGCGCGACTACAA
GATGAGAGGTGGACAAAGAAGGTCACCCTGTGGGGCGGACCGAAAGGAAAGCGTCACAGA
GGAAGACCCCACGCGAGATGGAAGTACGACATCACTCGGATTGCGGGCTCAAAATGGACC
GAAATAGCCATGGACAGAGACAAATGA

Protein sequence:

MEVLNVNFSSDHRPVRTTLILTKCKKTRSSYKNYTYSKLKTEEISKYSKQITSYLPELDD
CSTYSVQTFHDKIVKAISESLKIARITKGNTQNRHNILSEQTLSLMKRRQDLQNTPNKTR
SMKNELSALYKLTKKLIKQDYTRYRYNILGKHLTQTNSSKKAFKELQTYKTWIDGLNQRN
KTLNNRNDVVNLATDFYKKLYSAPDSNCFETEQEIINSAKHQEEDHGPQLDVVDVVEANQ
MLKAEKSPGQDNITNEAIKNASAQLALPLTALFNRILGTTEIPTQWSNSEIILLYKKGDP
RDIGNYRPISLLPCLYKLFSMLINKKIRNTLDAEQPIEQAGFRKGFSTIDHIHTIELLIE
KYQEYQKPLYIAYIDYKKAFDTVSHTSIWKTLVDQGVETMYIQIIKKIYENSAARVKLDR
PGPSFPIRRGVKQGDPLSLQLFIALLESIIRNLDWKQYGLNINGKYLNHLRFADDLILLS
ETDTQLQLMIHSLNKSSKQVGLKMNLTKTMVMTNSIQRKIVVDDEILKYTDKYIYLGKQI
GFNRNNSELEIERRIQNTWNKYWGLKEIFKSDMAVQIKTRVMNTCLLPGLTYGCQTWKFT
NKTKNKIISCQRALERSMLNIKKIQKIRHDKIRNTSKATDALEQTSRLKWKWAGHVARLQ
DERWTKKVTLWGGPKGKRHRGRPHARWKYDITRIAGSKWTEIAMDRDK