New model in OGS2.0 | DPOGS213238  |
---|---|
Genomic Position | scaffold517:- 22121-31220 |
See gene structure | |
CDS Length | 2547 |
Paired RNAseq reads   | 654 |
Single RNAseq reads   | 1562 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009517 (0.0) |
Best Drosophila hit   | topoisomerase 3beta (0.0) |
Best Human hit | DNA topoisomerase 3-beta-1 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to prokaryotic DNA topoisomerase [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to prokaryotic DNA topoisomerase [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0003916 DNA topoisomerase activity GO:0003677 DNA binding GO:0000166 nucleotide binding GO:0016853 isomerase activity GO:0005524 ATP binding GO:0006268 DNA unwinding involved in replication GO:0006265 DNA topological change GO:0003917 DNA topoisomerase type I activity GO:0003676 nucleic acid binding GO:0006259 DNA metabolic process GO:0005694 chromosome GO:0007059 chromosome segregation GO:0000793 condensed chromosome |
InterPro families    | IPR000380 DNA topoisomerase, type IA IPR013824 DNA topoisomerase, type IA, central region, subdomain 1 IPR013826 DNA topoisomerase, type IA, central region, subdomain 3 IPR023406 DNA topoisomerase, type IA, active site IPR023405 DNA topoisomerase, type IA, core domain IPR013497 DNA topoisomerase, type IA, central IPR006171 Toprim domain IPR003601 DNA topoisomerase, type IA, domain 2 IPR003602 DNA topoisomerase, type IA, DNA-binding |
Orthology group | MCL13223 |
Nucleotide sequence:
ATGAAGACAGCATTAATGGTGGCTGAAAAGCCGTCCCTGGCTCAAAATCTAGCAAATATT
CTCAGTAATGGAAAATGCAATACCAACAAGGGCTCTAATTCAGCTTGCGCAGTTCATGAG
TGGACAGGTACCTTCAAAAACGAACCTGTGAAATTTAAAATGACTTCAGTGTGTGGTCAT
GTGATGAGCTTAGATTTCACTGGCAAATATAATAATTGGGATAAAGTAGATCCCGTTGAA
CTGTTCATATGTCCTACAGAGAAGAAGGAAGCAATGCCAAGACTTAGGATTCCCGCTTTC
CTAGCACAGGAGGCTAGAGGATGTGATTATCTCATTCTTTGGTTGGATTGTGATAAAGAA
GGGGAAAATATATGTTTTGAGGTTATGTCCTGCGTTCAAAACTACATGAAAGGTGACGTA
TACTCACCAGCAGTGACATTTCGGGCGCGATTTTCAGCCATCACAGATAAAGATATTAAA
ACAGCCATGATGAATCTGGTTAGACCAAATGAAAGCGAATCTCGAAGTGTTGACGCCAGA
CAGGAACTAGATTTGCGTATCGGATGTGCCTTCACGAGATTCCAGACGAAGTATTTTCAA
GGTCGCTACGGTGATTTGGACGCGTCTCTCATATCGTACGGTCCCTGCCAGACTCCGACA
CTCGGATTCTGTGTCCAACGCCACGATGACATCCAGACCTTCAAACCGGAAACCTATTGG
GTGTTGAGAGTGACCGCCTCCACCTCCGAGGGCAGAGAGCTCCCGCTTGAATGGAAACGT
GTCAGGAGCTTCGAAAAGGACATAGCTAACATGTTTCTGGTCGGCATCAAGGAATTCAAA
GAGGCCACAGTTGTTAATATCCAAGCTAAAGAGAAGATAAAGTCCAGACCGACCGCTCTC
AACACTGTTGAGTTGATGAGGGTGGCCAGTGCTGGTCTCGGTATGGGACCACATCACGCT
ATGCAGATTGCTGAACGTCTGTACACTCAAGGTTATATATCATATCCTAGAACAGAGACG
ACTAGTTATGGAGAGAATTTTGATCTCATTGGTAGTCTTCGTCAACAACAGAATTCTAAC
AAGTGGGGTTCTGAGGTACGAGCTTTACTGGCTAATGGTATCAATAAGCCCAAGAAGGGC
CACGACGCGGGTGACCATCCACCGATCACTCCTATGAAGCCTGCCTCCGAATCCGAGCTG
GAGGGTGACATGTGGCGTATATACGACTACATCACGCGGCATTTCATAGCGACACTGTCG
CGCGACTGCCGCTACCTCAGCACGACCCTTACCTTCAGCGTGGGCTCCGAGACGTTCTAT
TACACTGGCAATACTCTGGTCGACGCTGGCTACACTGAGATCATGCATTGGCAGGCTTTC
GGTAAGGATGAGTTCGTCCCAGTACTGAAGGTGGACGAGGTGCTTCGGGCACACGACCAC
CGCCTCGTGGAGTGTCAGACCTCGCCCCCGGACTACCTCACCGAGTCTGAGGTGATAACT
CTGATGGAGAAGCACGGGATCGGCACGGACGCGTCCATACCTGTCCACATCAATAACATC
TGTCAGAGGAACTACGTGAGCGTCGGCAGCGGGCGGCGGCTCGTGCCCACCAGCCTGGGC
GTCGTGCTCGTACATGGATATCAGAAGATCGACCCGGAGCTAGTGTTACCGACGATGCGA
TCGGCCGTCGAGGAACAGCTCAACCTCATCGCAATCGGTCGAGCCGATTTCCACGCGGTG
TTGACTCACACCACGGAGATCTTCAGGCGGAAGTTCCAATACTTCGTGAGGTCCATAGAG
GCCATGGACCAACTGTTCGAGGTCAGCTTTTCGTCGCTCAAGACCAGCGGCAAGGCGCTG
TCCCGCTGCGGCAAGTGCAGGAGATACATGAGATACATACAGGCGAAGCCCGCCCGCCTG
CACTGCTCCCACTGTGACGACACCTACACGCTGCCCCAGCACGGCACGGTCCGCATTTAC
CGCGAGCTGAAGTGTCCTCTGGACGACTTCGAGCTGCTGTCCTGGTCCACCGGCAGCAAA
GGGAAGAGCTTCCCGCTCTGCCCTTACTGCTACAATCACCCACCATTCAGGGATATGAAG
AAGGGCTTCGGCTGTAACTCCTGCACTCACCCCACTTGTCCCTACGGCGTGAACTCCACC
GGCGTCTCCGGCTGTGTCGAATGTGATGGAGTTTTAGTTTTGGATCCCTCGGCGCCGAAG
TGGAAGCTGGCGTGTAACCGTTGTGACGTCATCATAAACGTGTTCGAGGACGCGAGCCGC
GTGTCCGTGTGCGAGGCGGCGTGCGCGTGCGGCGCTCAGTTAGTGTGCGTCGAGTACCGC
GCCGAGCGGACCAAGCTGCCGGCCGCGCTCACCGAGATGACCGCCTGCCTTTACTGCGAG
CCGGCTTTCAGCGCGCTTGTGGAGAAGCATCGTGCGGTGGCGCCCCGGAGCGGAGGATCG
CGAGGACGGAGCGCCAGGGGCAGAGGGAAACATCGCAACAAACAACCCAAAGACAAAATG
GCCCAATTAGCGGCGTATTTCGTATAA
Protein sequence:
MKTALMVAEKPSLAQNLANILSNGKCNTNKGSNSACAVHEWTGTFKNEPVKFKMTSVCGH
VMSLDFTGKYNNWDKVDPVELFICPTEKKEAMPRLRIPAFLAQEARGCDYLILWLDCDKE
GENICFEVMSCVQNYMKGDVYSPAVTFRARFSAITDKDIKTAMMNLVRPNESESRSVDAR
QELDLRIGCAFTRFQTKYFQGRYGDLDASLISYGPCQTPTLGFCVQRHDDIQTFKPETYW
VLRVTASTSEGRELPLEWKRVRSFEKDIANMFLVGIKEFKEATVVNIQAKEKIKSRPTAL
NTVELMRVASAGLGMGPHHAMQIAERLYTQGYISYPRTETTSYGENFDLIGSLRQQQNSN
KWGSEVRALLANGINKPKKGHDAGDHPPITPMKPASESELEGDMWRIYDYITRHFIATLS
RDCRYLSTTLTFSVGSETFYYTGNTLVDAGYTEIMHWQAFGKDEFVPVLKVDEVLRAHDH
RLVECQTSPPDYLTESEVITLMEKHGIGTDASIPVHINNICQRNYVSVGSGRRLVPTSLG
VVLVHGYQKIDPELVLPTMRSAVEEQLNLIAIGRADFHAVLTHTTEIFRRKFQYFVRSIE
AMDQLFEVSFSSLKTSGKALSRCGKCRRYMRYIQAKPARLHCSHCDDTYTLPQHGTVRIY
RELKCPLDDFELLSWSTGSKGKSFPLCPYCYNHPPFRDMKKGFGCNSCTHPTCPYGVNST
GVSGCVECDGVLVLDPSAPKWKLACNRCDVIINVFEDASRVSVCEAACACGAQLVCVEYR
AERTKLPAALTEMTACLYCEPAFSALVEKHRAVAPRSGGSRGRSARGRGKHRNKQPKDKM
AQLAAYFV