DPGLEAN06880 in OGS1.0

New model in OGS2.0DPOGS213238 
Genomic Positionscaffold517:- 22121-31220
See gene structure
CDS Length2547
Paired RNAseq reads  654
Single RNAseq reads  1562
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009517 (0.0)
Best Drosophila hit  topoisomerase 3beta (0.0)
Best Human hitDNA topoisomerase 3-beta-1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to prokaryotic DNA topoisomerase [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to prokaryotic DNA topoisomerase [Nasonia vitripennis] (0.0)
GeneOntology terms











  
GO:0003916 DNA topoisomerase activity
GO:0003677 DNA binding
GO:0000166 nucleotide binding
GO:0016853 isomerase activity
GO:0005524 ATP binding
GO:0006268 DNA unwinding involved in replication
GO:0006265 DNA topological change
GO:0003917 DNA topoisomerase type I activity
GO:0003676 nucleic acid binding
GO:0006259 DNA metabolic process
GO:0005694 chromosome
GO:0007059 chromosome segregation
GO:0000793 condensed chromosome
InterPro families







  
IPR000380 DNA topoisomerase, type IA
IPR013824 DNA topoisomerase, type IA, central region, subdomain 1
IPR013826 DNA topoisomerase, type IA, central region, subdomain 3
IPR023406 DNA topoisomerase, type IA, active site
IPR023405 DNA topoisomerase, type IA, core domain
IPR013497 DNA topoisomerase, type IA, central
IPR006171 Toprim domain
IPR003601 DNA topoisomerase, type IA, domain 2
IPR003602 DNA topoisomerase, type IA, DNA-binding
Orthology groupMCL13223

Nucleotide sequence:

ATGAAGACAGCATTAATGGTGGCTGAAAAGCCGTCCCTGGCTCAAAATCTAGCAAATATT
CTCAGTAATGGAAAATGCAATACCAACAAGGGCTCTAATTCAGCTTGCGCAGTTCATGAG
TGGACAGGTACCTTCAAAAACGAACCTGTGAAATTTAAAATGACTTCAGTGTGTGGTCAT
GTGATGAGCTTAGATTTCACTGGCAAATATAATAATTGGGATAAAGTAGATCCCGTTGAA
CTGTTCATATGTCCTACAGAGAAGAAGGAAGCAATGCCAAGACTTAGGATTCCCGCTTTC
CTAGCACAGGAGGCTAGAGGATGTGATTATCTCATTCTTTGGTTGGATTGTGATAAAGAA
GGGGAAAATATATGTTTTGAGGTTATGTCCTGCGTTCAAAACTACATGAAAGGTGACGTA
TACTCACCAGCAGTGACATTTCGGGCGCGATTTTCAGCCATCACAGATAAAGATATTAAA
ACAGCCATGATGAATCTGGTTAGACCAAATGAAAGCGAATCTCGAAGTGTTGACGCCAGA
CAGGAACTAGATTTGCGTATCGGATGTGCCTTCACGAGATTCCAGACGAAGTATTTTCAA
GGTCGCTACGGTGATTTGGACGCGTCTCTCATATCGTACGGTCCCTGCCAGACTCCGACA
CTCGGATTCTGTGTCCAACGCCACGATGACATCCAGACCTTCAAACCGGAAACCTATTGG
GTGTTGAGAGTGACCGCCTCCACCTCCGAGGGCAGAGAGCTCCCGCTTGAATGGAAACGT
GTCAGGAGCTTCGAAAAGGACATAGCTAACATGTTTCTGGTCGGCATCAAGGAATTCAAA
GAGGCCACAGTTGTTAATATCCAAGCTAAAGAGAAGATAAAGTCCAGACCGACCGCTCTC
AACACTGTTGAGTTGATGAGGGTGGCCAGTGCTGGTCTCGGTATGGGACCACATCACGCT
ATGCAGATTGCTGAACGTCTGTACACTCAAGGTTATATATCATATCCTAGAACAGAGACG
ACTAGTTATGGAGAGAATTTTGATCTCATTGGTAGTCTTCGTCAACAACAGAATTCTAAC
AAGTGGGGTTCTGAGGTACGAGCTTTACTGGCTAATGGTATCAATAAGCCCAAGAAGGGC
CACGACGCGGGTGACCATCCACCGATCACTCCTATGAAGCCTGCCTCCGAATCCGAGCTG
GAGGGTGACATGTGGCGTATATACGACTACATCACGCGGCATTTCATAGCGACACTGTCG
CGCGACTGCCGCTACCTCAGCACGACCCTTACCTTCAGCGTGGGCTCCGAGACGTTCTAT
TACACTGGCAATACTCTGGTCGACGCTGGCTACACTGAGATCATGCATTGGCAGGCTTTC
GGTAAGGATGAGTTCGTCCCAGTACTGAAGGTGGACGAGGTGCTTCGGGCACACGACCAC
CGCCTCGTGGAGTGTCAGACCTCGCCCCCGGACTACCTCACCGAGTCTGAGGTGATAACT
CTGATGGAGAAGCACGGGATCGGCACGGACGCGTCCATACCTGTCCACATCAATAACATC
TGTCAGAGGAACTACGTGAGCGTCGGCAGCGGGCGGCGGCTCGTGCCCACCAGCCTGGGC
GTCGTGCTCGTACATGGATATCAGAAGATCGACCCGGAGCTAGTGTTACCGACGATGCGA
TCGGCCGTCGAGGAACAGCTCAACCTCATCGCAATCGGTCGAGCCGATTTCCACGCGGTG
TTGACTCACACCACGGAGATCTTCAGGCGGAAGTTCCAATACTTCGTGAGGTCCATAGAG
GCCATGGACCAACTGTTCGAGGTCAGCTTTTCGTCGCTCAAGACCAGCGGCAAGGCGCTG
TCCCGCTGCGGCAAGTGCAGGAGATACATGAGATACATACAGGCGAAGCCCGCCCGCCTG
CACTGCTCCCACTGTGACGACACCTACACGCTGCCCCAGCACGGCACGGTCCGCATTTAC
CGCGAGCTGAAGTGTCCTCTGGACGACTTCGAGCTGCTGTCCTGGTCCACCGGCAGCAAA
GGGAAGAGCTTCCCGCTCTGCCCTTACTGCTACAATCACCCACCATTCAGGGATATGAAG
AAGGGCTTCGGCTGTAACTCCTGCACTCACCCCACTTGTCCCTACGGCGTGAACTCCACC
GGCGTCTCCGGCTGTGTCGAATGTGATGGAGTTTTAGTTTTGGATCCCTCGGCGCCGAAG
TGGAAGCTGGCGTGTAACCGTTGTGACGTCATCATAAACGTGTTCGAGGACGCGAGCCGC
GTGTCCGTGTGCGAGGCGGCGTGCGCGTGCGGCGCTCAGTTAGTGTGCGTCGAGTACCGC
GCCGAGCGGACCAAGCTGCCGGCCGCGCTCACCGAGATGACCGCCTGCCTTTACTGCGAG
CCGGCTTTCAGCGCGCTTGTGGAGAAGCATCGTGCGGTGGCGCCCCGGAGCGGAGGATCG
CGAGGACGGAGCGCCAGGGGCAGAGGGAAACATCGCAACAAACAACCCAAAGACAAAATG
GCCCAATTAGCGGCGTATTTCGTATAA

Protein sequence:

MKTALMVAEKPSLAQNLANILSNGKCNTNKGSNSACAVHEWTGTFKNEPVKFKMTSVCGH
VMSLDFTGKYNNWDKVDPVELFICPTEKKEAMPRLRIPAFLAQEARGCDYLILWLDCDKE
GENICFEVMSCVQNYMKGDVYSPAVTFRARFSAITDKDIKTAMMNLVRPNESESRSVDAR
QELDLRIGCAFTRFQTKYFQGRYGDLDASLISYGPCQTPTLGFCVQRHDDIQTFKPETYW
VLRVTASTSEGRELPLEWKRVRSFEKDIANMFLVGIKEFKEATVVNIQAKEKIKSRPTAL
NTVELMRVASAGLGMGPHHAMQIAERLYTQGYISYPRTETTSYGENFDLIGSLRQQQNSN
KWGSEVRALLANGINKPKKGHDAGDHPPITPMKPASESELEGDMWRIYDYITRHFIATLS
RDCRYLSTTLTFSVGSETFYYTGNTLVDAGYTEIMHWQAFGKDEFVPVLKVDEVLRAHDH
RLVECQTSPPDYLTESEVITLMEKHGIGTDASIPVHINNICQRNYVSVGSGRRLVPTSLG
VVLVHGYQKIDPELVLPTMRSAVEEQLNLIAIGRADFHAVLTHTTEIFRRKFQYFVRSIE
AMDQLFEVSFSSLKTSGKALSRCGKCRRYMRYIQAKPARLHCSHCDDTYTLPQHGTVRIY
RELKCPLDDFELLSWSTGSKGKSFPLCPYCYNHPPFRDMKKGFGCNSCTHPTCPYGVNST
GVSGCVECDGVLVLDPSAPKWKLACNRCDVIINVFEDASRVSVCEAACACGAQLVCVEYR
AERTKLPAALTEMTACLYCEPAFSALVEKHRAVAPRSGGSRGRSARGRGKHRNKQPKDKM
AQLAAYFV