DPGLEAN20107 in OGS1.0

Genomic Positionscaffold5109:- 3480-11171
See gene structure
CDS Length5772
Paired RNAseq reads  103
Single RNAseq reads  311
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013961 (3e-07)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  reverse transcriptase [Papilio xuthus] (0.0)
Best NR hit (blastx)  reverse transcriptase [Bombyx mori] (0.0)
GeneOntology terms

  
GO:0006278 RNA-dependent DNA replication
GO:0003964 RNA-directed DNA polymerase activity
GO:0003723 RNA binding
InterPro families


  
IPR001878 Zinc finger, CCHC-type
IPR013084 Zinc finger, CCHC retroviral-type
IPR000477 Reverse transcriptase
IPR005135 Endonuclease/exonuclease/phosphatase
Orthology groupMCL10007

Nucleotide sequence:

ATGGCCTGCACGCCCATCAACATTCTTCAGGCGAACCTGAACCACTCCCCTGGTCCGCAG
GATCTTTGCCTGCAGGCCATGGCGGAGTGGTCCATCGAGGCGGCAGTGCTCTGCGAGCCG
TACTTCGTCCCCGATCGTCCGAATTGGGTGACGGATACGGATCGGTCTGTGGCGGTGGTC
GTCTCATCGGGTGTGCGCTCCTCTCCTCTTGCCGTTGTCGGTAGGGGGGAAGGGTATGTC
GCTGCTCGGTGGGGCGATCTCGTACTGGTGGGCGTCTATCTCGCCCCGAGGAAGCCCGTC
GCAGAGGTAGAAAAACTTCTCGACGAGGTCGGGGCGGAAGTGAGACGTGCGTCGCCGAGG
CAAGTCATCGTTCTCGGCGACTTCAATGCCCACAGTACGGCGTGGCAGTCGCGCGTCACG
GACTCTCGCGGAGAGGTGGTAGAGGAGTGGGCCACAGCGTTCGGCCTGTCACTTCTCAAC
CGGGGAAATGCCGTTACGTGCGTACGACCGCAGGGTGAGTCCGTAGTGGACTTATCGTTC
GCTACTCCCGCGGTCGCAACGCGAGTACGTAACTGGCGAGTTCTGGTTGAGGAGGAGACG
TTGTCGGACCACTCCTTTATCAGGTTCGAGTTGGCTCCACAAGGAGGCTCTTCGTTTCGC
CGTCGCGCCGCCGGGAGCACGTCTGCGTTCCCGCGGTGGTCTCTGACCAAGCTGGACAGA
GACATGGCGGTGGAGGCATCGATCGTCCAGGCCTGGACCATGCGAGTCGATCAGCCCGTT
GAGGTGGATGAGGAGGCGCGCAAGTTCCGCCGGTCCCTATGGCGGATCTGCGATGCGTCG
ATGCCCCGCGTCACGCAACGCCCTCCCAGACGCCAGGTGTATTGGTGGACGCCGGAGATC
GCGCAGCTACGCTCAGTCTGCGTGGTGGCGAGACGCCGGTACACCCGACAACGGAGGCAG
CATCCTCGCAACGAGGCCGATGAAAGTCGGCTTCGCGATGCCTACAAGGAAGCGAAGAGC
GAGCTTCGGCACGCTATTTGCAGGTCCAAAGAGTCGGCCCGCGAGGAGTTACTCGCGCGT
CTAGATGGAGATCCGTGGGGGCGTCCGTATCTCGGTGCTCGGAACAAGATCCGAGCCCAG
ACGGCCCCGGTCACGGAGAGTCTAGAGCCGGAGTTTCTGCGAAGCGTCGTGTGCGCCTTG
TTCCCCATGGAGGCCGCACACACGATGCCAGCGGACACTTCACGCGAGCCGGCCCGGTCG
GAGGTCATTCCGGCCGTCTCTCTGGAGGAGCTTGAGAGGTCTCTGTCGCCGCTGAAGGCC
AAACAAACCGCCCCCGGACCGGACGGCGTCCCCGGACGCGTCCTGGCCCTGGCCCTGGGC
GAGTTGGCCGAGTGGTACTTGGAGATCCTCAATGAGTGTCTAAGAACGGGTCGCTTTCCA
TCGTCTTGGAAAGAGGGGAGACTCGTTCTACTCCAGAAGGGAGGACGACCTGCAGACTCG
CCGTCTGCGTACCGTCCGATAGTGTTGCTAGACGACGCGGGGAAGCTCTTCGAGCGGATC
CTCGCGACTCGTGTCGTCCGACACATCAGCAGCACGGGGCCCGATCTGGCTCAGTCCCAG
TATGGTTTTCGGGGTGGAAGGTCTACCATAGACGCCATCCTGAAACTGAGGAGTCTCGCA
GATGATGCCGTGTCTAGGGGCGGAGTTGCGTTGGCGGTGTCCCTCGACATCACCAACGCG
TTCAACTCGCTACCCTTTGGCGTCATCGAAGAGGCCCTCAGGTACCATGGTCTGCCGGAT
TACATTCGGCGGACCATCGGGTCCTACCTCCGCGGACGGGAGATCTCGTTCGTGGGGGGC
GACGGTAGGTTGCATCGTCACGAGGTGCGCTGCGGTGTTCCGCAGGGGTCGGTCCTCGGG
CCTCTCCTGTGGAACTTGGGGTACGATTTCGTGCTCCGCGGCGCCCTCCCGACCGGGCTG
AGCGTCGTCTGCTACGCGGACGACACGCTCGTGTTAGCCCGAGGCGATGACCTGGCAGAG
GCGAAGGCGCGTGCCGAGGCGGGAGCCGCCTTGATAGTACGGCGCATCCAGATGCTCGGG
CTGAGGGTGGGCCTGGAGAAGACAGAGGCCCTCCTGTTCCACGGCCCTCGAGCAGGACCT
CCGACGGACGCCAGCATCAACATCTGCGGCGTCCGCGTCGAGCTCAGTCCCCGGATGAAG
TATCTGGGGCTGACTCTGGACGGAAGGTGGAACTTCCGAGAGCATTTTCGCGGGTTAGTC
CCGAAATTACTCGGGACGGCGAACGCACTCGGAAGACTTCTGCCGAATCTCGGTGGTCCG
AGCGCGACATGTCGGCGCCTGTACACCGGCGTGTTGCGTTCGATGGCGCTGTACGGAGCT
CCAGTGTGGGCCGGTGCCCTCACGAGGCCGAACGTGGCTGCGCTGCACAGAGTGCAGCGC
GTCATGGCTGTGAGGGTGGTACGGGGATACTGCACTGTCTCCCACGAGGCGGCTTGCGTG
CTCGCTGGAACGCCTCCCTGGGACTTGGACGCCCAGGTTCTGGCGGAGGTTTACCAGCAG
CGCGCTCGAGCTCGATCCCGAGGTACGAGTCCGGACCGGGATCGGGCGGCGGTGGTGGAG
GGTTGGCGGCGCTCCGCGCTCGCGTTGATCTTCCGTCGATGGAGGCGGCGGCTCTCTGAG
CCAGTGGCCGGGTTGCGCACCGTGGAGGCGATTCGTCCGCTCCTCAAGGAGTGGGTGGAC
CGTCGACACGGTTCCTTGACCTTCCGGTTGGTGCAGATCCTCTCGGGGCATGGCAGCTTC
GGGCGGTATCTGTGCCACATAGCCGGGAGAGAGCCGACGGCGGCGTGTCACCATTGCAGT
TGCATGGATGATACGCCCGACCATACTTTGGCGGAGTGTCCAGCGTGGGCGACGGAGCGT
CGTCAACTCACCACCGTGGTTGGCGCGGATCTCTCGTTGCCGGCAGTAGTTCAGGCTATG
GTCGGTAGCGAGAGGGCCTGGGCGGCGGTGGTCTGTTTCTGTGAGGTGGTCATCTCGCAG
AAGGAGGCCGCCGAACGAGTGAGGGAGGACAATCCCTCCTCGGCGCCCATGCGCCGACGA
AGGCTGGGTCGAAGGCAGCGGGCCTACGCCCGCGAAATGCCTCCCCAGGGCTGGCGCGCC
GATCTTGGTGTGCCAGTCATCTTGCTCCCCCGTCCGCTGAAAAGGGGACTCCGGGGGCAG
GATCCTTCACTCTGCCGAGTGGACTATATTCTGTGTTCGGCACAATCCCGCTCCCCAGTT
CCTCTGTACCAGAAGATCCCGGGGACGTGGGAACAAGGAGTGGTAGAGCCTGCTGCGGTC
TTCGGACTCCGGCTAGTAATGGGCTGGCTCGACCGGGTGTTGGTACGGGCGCGCCACTTC
TGGCGTTCCGCCTTCCGGTCCCCAGCTCTCCTGTATCAGGAGACTCCGGGGACACCGGAT
CTGTCACGTGTGACATTGCCGAGTGGACTCGTCTGCTCTCGGCACAACCCGCTCCCCAGC
TCCCCTGTATCAGGGGACTCCGGGGACGCGGGATCGGGCGGTGGGCGGTCTCCCAGACCA
CCCGCCGGTCAAACCGTAACCCTCATTGAGAGGACACGGGGGCTTCGGCCGCGAGGAGTA
AACCTCTTTAAAAAATCCCCAAATTCCCCTAGTTGCGGGGCGCGGCTAGGGGGTGCTTCT
CCGGGAGCGGCTGGGGGGTATCTCAGCCACCTAGCGACCAAGGGGTTCCTGCCCCCGAGG
CAGGAGGGTACCGGTCGCGATGCGGCCGGAGAATCCCTCACCGGTTCGCCGGTCAGCCCC
TCGTATTCTGGGGGGGGCAAAAACCGCCTTCGGGGTCCATTGGTGGACGCCCGACGTAGA
GGTGCAGTGGGCACGGACCTAGGCTCAGACAACGGACAAGATTCTGCTAGTGATTTGGAT
ATAGCTGGTAAGAGTAAAGAAGCGGCGGGCACCTCACGTGTACGCGGACGAGGGCGACCT
CCTACTACCGGGGCGTACGTAGGATTAGCAAAGGCCAAGGAGGCTCTCAACCGCCAGAAG
CGTGAGGAGCTCTTGCTGGAGACTGAGGAAGAAGTGACGCGTACGGCACGCAAAACGGAA
GCGGCTTTGCCCTCTGTCAAGAAGATGCCTATCGCTTTACTCAGCAAAGAGGTGGACAAG
GCTGTCGAAGCGATCCTTGAGGTCGCCATTAAGTCTAAAAACCTTAAAGGCGGCTGCATC
AGGTCCTTAAAGGCTTCGTCGGCCCTAATCAGGGAGGCAAAGGAAGTTCTTCTTTGCCGA
ACTAGCTCGGAGGAGATTGCCGTATTGCAGTCTCAGCTGGAGGAGGAGAAGAGAAAGAAC
GCTCGCCTTCAGCAAGAACTGGTTGAGCTGAGAGAGGGGCAAGCTCGCTTGCGAGCGGAC
ATGGACCAGTTGGCCATTGCCCCTCCCACGGTTGTAGCGGAGAGGAGTTTCGAGGAGCTT
CGTGCCGGAATGTTGCTTGACATAGGCAATATGATGGATGCGAAGCTCCGCGACGTCGAG
AATAAACTCCTTAAGGATAGGCGCACGCGGCAACCAACTGCTTCTGACAATAGGCCTTCA
GTACCAACGTCAGAAGTAGCGGAACACTACGCGGACCGTGAACCGATCACAGCGTCGGCA
CCTGAAGCCAGTAACAAAACGGCTTCAGGCAAACAGCAAGGCGCCATTAAACCAGGGCAA
CCAACATCCGCCCGGTTCTTACCCCCGCCACCGGCTTCCATGGCCGAAAGGTGGACGGAG
GTTGTTAAAAGGAAGGGCAAGAGCAAGGAACAACCCGGCCCTCGCCCCAAGAGTGCTGCT
GTCCCATCTCGGACGACAAAAGGAGCGGCTGCAGTTGCTCAGTCTGTGCCCCTACAAGAC
CGGGGGCACAAAAAAGTAAAGAAGAAGGGAAGGAGGAGAGAAATTCGTCCCCAACGCTCG
CCGGCAGTGGTCATCACACTGTCACGCGAGACACAAGAGCGCGGGATAAAATACGAGGAC
GTCCTCAAGAAAGCGCGGGCGAGACTCGATTTGGCGGAGATAGGGTTGCCTATGGGTCTC
GTCTGCAAGAAGACGGCAACGGGGGCCCGTATGTTCGAGCTCCCGGAGAGTGCGGGTGAG
GGAACAGCTGATCTCCTCGCACGGAAACTCCAAGAGCTGGCTCCTGAAGCGAAGATCGCC
AGGCCCATACAGTGTGCGGAACTGCGCATTTCAGGTCTGGACGACTCGGTCGTGAAGGAG
GACGTCCTCGCAGCCGTTGCACGGCAGGGAAGCTGCTCAGCCGAGCATATCAAGGTCGGG
CAGGTGCGGTTTCGAGGAGACTTCGGCACCGGCGCAGTCTGGGTGAAGTGCCCTCTCAAA
GCCGCCAAGACCCTCGCAGATGCTGGTCGGCTGTTAGTCGGTTGGTGCTCGGCGAGGGTG
CAGACACTCGAGCCTCGCCCAATGCGATGCTTCCGGTGCCTGGAGATAGGACACACTGGC
ATGCGGTGCCCGTCAACCACCGACCGCAGCGGTCTGTGCTTCCGCTGCGGCGGTGAGGGG
CATACGGCCAGTGACTGCAGGAAGGAGCCGCATTGCCTGGTCTGCGCAGCAGCTGGAGCT
CCCGCAAACCATATGGTGGGTGGGAAGAACTGTCACCCGCCTAAGAGAAGGAAGAAGGGT
AAGGGGCCCATAAACACCTCGGCCCCGACCACTAAACCTGCCGGTACGGAGGAAGCTACA
ACAACACTCTAA

Protein sequence:

MACTPINILQANLNHSPGPQDLCLQAMAEWSIEAAVLCEPYFVPDRPNWVTDTDRSVAVV
VSSGVRSSPLAVVGRGEGYVAARWGDLVLVGVYLAPRKPVAEVEKLLDEVGAEVRRASPR
QVIVLGDFNAHSTAWQSRVTDSRGEVVEEWATAFGLSLLNRGNAVTCVRPQGESVVDLSF
ATPAVATRVRNWRVLVEEETLSDHSFIRFELAPQGGSSFRRRAAGSTSAFPRWSLTKLDR
DMAVEASIVQAWTMRVDQPVEVDEEARKFRRSLWRICDASMPRVTQRPPRRQVYWWTPEI
AQLRSVCVVARRRYTRQRRQHPRNEADESRLRDAYKEAKSELRHAICRSKESAREELLAR
LDGDPWGRPYLGARNKIRAQTAPVTESLEPEFLRSVVCALFPMEAAHTMPADTSREPARS
EVIPAVSLEELERSLSPLKAKQTAPGPDGVPGRVLALALGELAEWYLEILNECLRTGRFP
SSWKEGRLVLLQKGGRPADSPSAYRPIVLLDDAGKLFERILATRVVRHISSTGPDLAQSQ
YGFRGGRSTIDAILKLRSLADDAVSRGGVALAVSLDITNAFNSLPFGVIEEALRYHGLPD
YIRRTIGSYLRGREISFVGGDGRLHRHEVRCGVPQGSVLGPLLWNLGYDFVLRGALPTGL
SVVCYADDTLVLARGDDLAEAKARAEAGAALIVRRIQMLGLRVGLEKTEALLFHGPRAGP
PTDASINICGVRVELSPRMKYLGLTLDGRWNFREHFRGLVPKLLGTANALGRLLPNLGGP
SATCRRLYTGVLRSMALYGAPVWAGALTRPNVAALHRVQRVMAVRVVRGYCTVSHEAACV
LAGTPPWDLDAQVLAEVYQQRARARSRGTSPDRDRAAVVEGWRRSALALIFRRWRRRLSE
PVAGLRTVEAIRPLLKEWVDRRHGSLTFRLVQILSGHGSFGRYLCHIAGREPTAACHHCS
CMDDTPDHTLAECPAWATERRQLTTVVGADLSLPAVVQAMVGSERAWAAVVCFCEVVISQ
KEAAERVREDNPSSAPMRRRRLGRRQRAYAREMPPQGWRADLGVPVILLPRPLKRGLRGQ
DPSLCRVDYILCSAQSRSPVPLYQKIPGTWEQGVVEPAAVFGLRLVMGWLDRVLVRARHF
WRSAFRSPALLYQETPGTPDLSRVTLPSGLVCSRHNPLPSSPVSGDSGDAGSGGGRSPRP
PAGQTVTLIERTRGLRPRGVNLFKKSPNSPSCGARLGGASPGAAGGYLSHLATKGFLPPR
QEGTGRDAAGESLTGSPVSPSYSGGGKNRLRGPLVDARRRGAVGTDLGSDNGQDSASDLD
IAGKSKEAAGTSRVRGRGRPPTTGAYVGLAKAKEALNRQKREELLLETEEEVTRTARKTE
AALPSVKKMPIALLSKEVDKAVEAILEVAIKSKNLKGGCIRSLKASSALIREAKEVLLCR
TSSEEIAVLQSQLEEEKRKNARLQQELVELREGQARLRADMDQLAIAPPTVVAERSFEEL
RAGMLLDIGNMMDAKLRDVENKLLKDRRTRQPTASDNRPSVPTSEVAEHYADREPITASA
PEASNKTASGKQQGAIKPGQPTSARFLPPPPASMAERWTEVVKRKGKSKEQPGPRPKSAA
VPSRTTKGAAAVAQSVPLQDRGHKKVKKKGRRREIRPQRSPAVVITLSRETQERGIKYED
VLKKARARLDLAEIGLPMGLVCKKTATGARMFELPESAGEGTADLLARKLQELAPEAKIA
RPIQCAELRISGLDDSVVKEDVLAAVARQGSCSAEHIKVGQVRFRGDFGTGAVWVKCPLK
AAKTLADAGRLLVGWCSARVQTLEPRPMRCFRCLEIGHTGMRCPSTTDRSGLCFRCGGEG
HTASDCRKEPHCLVCAAAGAPANHMVGGKNCHPPKRRKKGKGPINTSAPTTKPAGTEEAT
TTL