DPGLEAN00336 in OGS1.0

New model in OGS2.0DPOGS205257 
Genomic Positionscaffold1065:+ 23986-34850
See gene structure
CDS Length1503
Paired RNAseq reads  8
Single RNAseq reads  25
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012973 (7e-12)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  PREDICTED: transposon Ty3-I Gag-Pol polyprotein-like, partial [Xenopus (Silurana) tropicalis] (9e-48)
Best NR hit (blastx)  PREDICTED: transposon Ty3-I Gag-Pol polyprotein-like, partial [Xenopus (Silurana) tropicalis] (9e-45)
GeneOntology terms


  
GO:0003677 DNA binding
GO:0003723 RNA binding
GO:0003964 RNA-directed DNA polymerase activity
GO:0006278 RNA-dependent DNA replication
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL15204

Nucleotide sequence:

ATGTCAGTTATACCGAGAGCTTCAAAGCCGGAGAAAAGAACAGTGGAATTGTCTTTAAGA
TTTGGAAAATTTCTATTGCTGAGTGAACAAAACGTGTCTGAACAATCCCGAGGGATATCC
GGAGTAGATAACTGGCTACATTCCTACCACTCCCTCCTAACAACTCGGCTTCTGCTACAA
ATAACCTGTTTACCTTTCGGTCTTATACCGGTTCCAAGAACGTTCGCATCGGTAACCAAT
TGGATAGCAGAATTGTTGAGAAATCACGGCATCAGATGCGTAGTATATCTCGACGATTTT
TTGCGTGCGAACCAGTCCAAATCGGCTTTACAGAACGACATAGCAGGGGCGCTAAAGATG
ATGAGGACCCTAGGCTGGATGATAAATTTTCAGAAATCGGTCTTAGCCCCGACACAATGC
CTCGAGTTTCTCGGCATAACGTGGGACACAAAACGTAACACAAAGTCCCTGTCGGGGCAG
AAGTGCTTAACGCTACGCAAGGCACTGTATCTGCTCAAACAGAGCAAATGGTCTCTCCGA
CAATACCAGTCCATTATGGGGAGACTCAAATTTGCGAGTTTTGTAACTCGCAGGGGTCGG
CTTCACTGTCGAACACTGCAGTACTACAGTCGACAGTTGCCGAAAACTCACCCGCACCGC
CGCGTGAGTATTCCCCAACCAGTACAGCCGGAATTGGAATGGTGGCTGGAAGAAATAGGA
GGGTCAATGCCGATTCAAATACCTCAATTGACCAACCTTTTGACTACAAATGCATCCAAC
ACAGGCTGGGGTGCACAGCTCAACGAAATCAGCATATCAAGAACGTGGACGAAACCAGCT
ATTCAACTCGATCAGGATGGCTTGCAAAACTCACAAATACTTCTTCAGACCGACAACCGG
ACCGTTGTATCATACATAAACAAAGAAGGAGGTACTCAGTCTCTGAAGCTTCTGGAACAA
ACTCGGCGGCTTCTATCGGTTCTAGACAAAGTGAACATGCATCTAATAGCACAATATATA
CCGGGCAGATACAATGTAGAAGTCGATGCATTGTCCCGTCAAAAGGCTTGCCCCGAATGG
CACTTGATAACAGAAGCAACCACGAAAATATTTCAAATGTGGGGGTGTCCAGAAATAGAT
TTTTTCGCATCGAAAACGGCCCACGTAGTTCGGACATATGTAACAAAAGACATTCAAGAT
CTAGATGCTTTCTATCACAATGCCTTTTGTCGCTCTTGGGATTACAATCTGGCATGGCTA
TTCCCACCATCCAATTTAATCCCTAGGGTACTTGCTCACTTGAACCAGGCCAAAGGACTC
CATGTTATAATAGCTCCGAAATGGCAAAAGGTGTTTTGCCAATCGGATCTTCAGAACCGA
GCCTTATATCTAATTCCAGATCTGAACCGTGTACTTCTAGACACGCGAACAGGAACACAC
CCACCGGAAGTTCAAAAATTACAACTGGGAGCTTGGCTGATTTCGGGTGGCAGGAAATTT
TAA

Protein sequence:

MSVIPRASKPEKRTVELSLRFGKFLLLSEQNVSEQSRGISGVDNWLHSYHSLLTTRLLLQ
ITCLPFGLIPVPRTFASVTNWIAELLRNHGIRCVVYLDDFLRANQSKSALQNDIAGALKM
MRTLGWMINFQKSVLAPTQCLEFLGITWDTKRNTKSLSGQKCLTLRKALYLLKQSKWSLR
QYQSIMGRLKFASFVTRRGRLHCRTLQYYSRQLPKTHPHRRVSIPQPVQPELEWWLEEIG
GSMPIQIPQLTNLLTTNASNTGWGAQLNEISISRTWTKPAIQLDQDGLQNSQILLQTDNR
TVVSYINKEGGTQSLKLLEQTRRLLSVLDKVNMHLIAQYIPGRYNVEVDALSRQKACPEW
HLITEATTKIFQMWGCPEIDFFASKTAHVVRTYVTKDIQDLDAFYHNAFCRSWDYNLAWL
FPPSNLIPRVLAHLNQAKGLHVIIAPKWQKVFCQSDLQNRALYLIPDLNRVLLDTRTGTH
PPEVQKLQLGAWLISGGRKF