New model in OGS2.0 | DPOGS212434  |
---|---|
Genomic Position | scaffold1732:- 29310-32425 |
See gene structure | |
CDS Length | 2358 |
Paired RNAseq reads   | 239 |
Single RNAseq reads   | 626 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002892 (0.0) |
Best Drosophila hit   | DNApol-eta (2e-113) |
Best Human hit | DNA polymerase eta (3e-89) |
Best NR hit (blastp)   | DNA polymerase eta [Aedes aegypti] (9e-153) |
Best NR hit (blastx)   | DNA polymerase eta [Aedes aegypti] (2e-144) |
GeneOntology terms    | GO:0003887 DNA-directed DNA polymerase activity GO:0019985 translesion synthesis GO:0003684 damaged DNA binding |
InterPro families    | IPR017963 DNA-repair protein, UmuC-like, N-terminal IPR001126 DNA-repair protein, UmuC-like IPR017961 DNA polymerase, Y-family, little finger domain IPR017061 DNA polymerase eta |
Orthology group | MCL13962 |
Nucleotide sequence:
ATGGATCAGGAGAACAGGATTGTTGTACTGATAGATATGGACTGTTTTTATTGTCAAGTA
GAAGAAAAATTAAATCCACAATTGAAAGGCAAACCAATTGCTGTCGTGCAATATAATCCC
TGGAGAGGAGGAGGAATTATAGCCGTGAACTATGTTGCTCGAGCCATGGGAGTAACCAGG
CACATGAGAGGTAATGAAGCTAAGCAAAAGTGTCCAGAAATACAACTACCATCGGTGCCA
TGTTTCAGAGGGAAGGCTGATATAACCAAGTACAGGGAAGCGGGCAAAGATGTTGCTAAG
GTCCTACAAAGGTTTACACCCTTATTGGAAAGAGCTTCGATTGATGAGGCATATTTAGAT
ATCACAGACCCGGTGCGGAAGAGAATTCTAAACATTGATGTCAGGGACATAAATTCTAAC
ATGCTACCAAATAATTTTGCCCTCGGTTATGATACCTTAGATTCCTTCATATCTGATGTA
CATAGCTGTGGCCTGTCGTCTATGGAGTTTGATTATGAACACTCAAAACATCTTCTTGTC
GGTGCTCTCATAGTTAGCGAGATAAGGGCTGCGGTATACGCTGAAACTGGCTACCAATGT
TCAGCCGGGATAGCTCATAATAAAATCTTGGCAAAGCTCGTGTGTGGTATGAACAAGCCC
AACAAACAGACAGTGTTACCAAAACATTCTGTTAACATTCTATACAAGACATTGTCACTC
AAAAAAGTAAAGCACTTGGGTGGGAAGTTTGGGGATCACGTCGCTGAAACTCTTAATATT
AGTACGATGGGACAACTACAGAGATTCACGGAAAAGGATCTTCAGGCGAGATTTGATGAA
AAGAACGGTTCCTGGTTGTACAATATTGCCCGCGGCGTTGACTTGGAACCAGTCCAAGCT
AGATTTAACCCTAAAAGTATCGGTTGTTGCAAACAGCTGAGAGGCAAAGCGGCTCTGCAG
GATTTAGTCAGCCTCAGGAAGTGGCTTCGAGATCTAGGCGATGAAATCGAGAACCGATTG
GAACAGGACTCATTAGAAAATAATCGGATCCCGAAACAAATGGTTGTTAGTTTTTCTTTA
CAAGCTTCCAAAGGGAAGAGAGATATAAGTAGTTCAAGGTCTTACAATTTCAGCCCCGAA
GATGAATTATGTGGAGAAATATTTTCGAGTAAGGCCTTGGAGCTAGTGATGGACAGTGCC
GAAGGCTGCAAACCGACAGATGGCGAACTCAACAGGATGTTGAAATCACCGATAACGTTT
TTAGGCATAAGTGTTGGGAAATTTGATGATAATATTGATGCGAAGAAAACGAAAAAGATC
AAAGATTATTTCAGTGCCGGGTCGTCTAAGGATGTGTCACAAACCGATGAAAGCGTTAGG
ATAAAACTTGAGAGATGTGTTGAGAAGGATGGAACCAACGCTGGCAAAGAATACGTTTTA
GAAAAATACTTTGAATCATCTGATGACGTTAGAAAAGAAAATATTACGAGTAAAATAAGT
ACAGAAACAGAACAACGTAAAGAGACGGTATACCAGTCAAGTTTGGACAGACAGGAGTCA
TTTTTTGCAAAATATTTAAACAGTGGAAGATCAAATGTTGCGAATGACAGAACGCCCTGT
ACACGCGCGGCCTGCGGACAAACATTGCACCTTAGTAACGCTGAAGCCAGTAACGACACG
GACTATTCAGGTTCAACAATAAATGAGGAAATTAACAGGAGTATAGCTCTGTTTGAAGAT
GATCCAGATGATGTAACGCGAGTTGTTAATATGAGGCAGCTGTTGAAGACATCGGAAGCG
AAGTTAGAAGATGTGGAGGATGGAGACAGAGCTCAGACAGGAACAGCGCCGATAGAACCT
GAACGAAATAAATCTCCCGATATAAACAGCGTTGAATGCTCTGAATGTGGCAAGACAGTG
TCTTTGGACAAATTCGATGAACACTCAGATTATCATTTAGCACTTAAATTAAGAGAGGAA
ATGAGACAGGAGGTCAGAAGAGAACAGAACAGAACAAAATCTGTTCTGTCAGAAACTAAT
AAGAATTCTCCAAATAAAAAAGAAACACCAGAAAAACAATGCAACAGGAGTGACAATGTA
CCTTCAATAGTCAATTTTTTTACAAAATTCGACAGATCCATTGAAACAAAATTATGTGCG
GAATGTGGAAAGAAAGTCCCCATTAACAAACTCCCTGAACATCTAGATTTCCACGAAGCT
CAGAAATTGAGCAGAGAAATAAACAACCGGTCAAGTGTAGTGAATGTAACGAGTGCTAAA
AGAAAAAGAAAGTCGTCATCTCCAGTAAAAAAAAACAAAGTGCCTTGTAAGTCAATAGAT
CTGTTCTTTAGACAATAG
Protein sequence:
MDQENRIVVLIDMDCFYCQVEEKLNPQLKGKPIAVVQYNPWRGGGIIAVNYVARAMGVTR
HMRGNEAKQKCPEIQLPSVPCFRGKADITKYREAGKDVAKVLQRFTPLLERASIDEAYLD
ITDPVRKRILNIDVRDINSNMLPNNFALGYDTLDSFISDVHSCGLSSMEFDYEHSKHLLV
GALIVSEIRAAVYAETGYQCSAGIAHNKILAKLVCGMNKPNKQTVLPKHSVNILYKTLSL
KKVKHLGGKFGDHVAETLNISTMGQLQRFTEKDLQARFDEKNGSWLYNIARGVDLEPVQA
RFNPKSIGCCKQLRGKAALQDLVSLRKWLRDLGDEIENRLEQDSLENNRIPKQMVVSFSL
QASKGKRDISSSRSYNFSPEDELCGEIFSSKALELVMDSAEGCKPTDGELNRMLKSPITF
LGISVGKFDDNIDAKKTKKIKDYFSAGSSKDVSQTDESVRIKLERCVEKDGTNAGKEYVL
EKYFESSDDVRKENITSKISTETEQRKETVYQSSLDRQESFFAKYLNSGRSNVANDRTPC
TRAACGQTLHLSNAEASNDTDYSGSTINEEINRSIALFEDDPDDVTRVVNMRQLLKTSEA
KLEDVEDGDRAQTGTAPIEPERNKSPDINSVECSECGKTVSLDKFDEHSDYHLALKLREE
MRQEVRREQNRTKSVLSETNKNSPNKKETPEKQCNRSDNVPSIVNFFTKFDRSIETKLCA
ECGKKVPINKLPEHLDFHEAQKLSREINNRSSVVNVTSAKRKRKSSSPVKKNKVPCKSID
LFFRQ