New model in OGS2.0 | DPOGS202630  |
---|---|
Genomic Position | scaffold822:- 9867-14889 |
See gene structure | |
CDS Length | 2085 |
Paired RNAseq reads   | 37 |
Single RNAseq reads   | 131 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008315 (0.0) |
Best Drosophila hit   | Rev1 (2e-113) |
Best Human hit | DNA repair protein REV1 isoform 2 (5e-97) |
Best NR hit (blastp)   | PREDICTED: similar to terminal deoxycytidyl transferase rev1 [Tribolium castaneum] (4e-133) |
Best NR hit (blastx)   | terminal deoxycytidyl transferase rev1 [Culex quinquefasciatus] (3e-116) |
GeneOntology terms    | GO:0017125 deoxycytidyl transferase activity GO:0006281 DNA repair GO:0019985 translesion synthesis GO:0005622 intracellular GO:0003684 damaged DNA binding GO:0003887 DNA-directed DNA polymerase activity GO:0000287 magnesium ion binding |
InterPro families    | IPR001126 DNA-repair protein, UmuC-like IPR017961 DNA polymerase, Y-family, little finger domain IPR017963 DNA-repair protein, UmuC-like, N-terminal |
Orthology group | MCL13622 |
Nucleotide sequence:
ATGCATATTGACATGGATTGCTTTTTTGTGTCTGTTGGTTTGAGAAATAGACCAGAATTG
AGAGGAAAACCGGTAGCTGTCACACATTCAAAAGGAGGCCAGCCTGGAAGTAAACGGTCT
GGCAATGATGCAATTACTGAATCCAACTTATACAAACAAAGGCAAGCCAAAAAAATTGGT
ATTGCTCTGGATATCAAAGACGATATTGATACAGAATCTGCTGAAAGTGGTTATGAGGAG
GAATCATCATATGGGTCTATGAGTGAAATAGCTTCTTGTTCTTATGAAGCAAGGGCTAAG
GGCATCAAAAATGGAATGTTTATGGGGAAAGCTCTGAAGCTTTGCCCCGAATTAAGGACA
ATTCCATATGATTTTGATGGTTACAAAGATGTTGCTTTTAAATTATACAACACTATAGCT
AATTACACATTAGATATTGAAGCTGTGTCATGTGATGAAATGTATGTTGATTGTACAGAA
CTCTTGAAATCAATGAATGTAAGTGTTACTGACTTTGCCACCGCTTTGAGAGAGGAGATA
AAAAGTATAACTAAATGTCCCTGCTCAACAGGGTTTGGTGGTAATAGATTGCAAGCTCGC
TTGGCTACGAAGAAGGCTAAGCCCAATGGACAATTCTTTCTTACAGCTGATATAGTTAAC
GATTTTATGTACAATATACAACTTAGTGATCTACCAGGTGTTGGGTATCAGACTTCACAT
AAATTAGAATCTTTAGGGTACCAGACTTGTGGATCCCTGTTAAGCTTGAGCTTAATAAAC
CTTCAACAACATTTAGGAAAAAAGACTGGCGCACAATTATATGAACAGATTCGTGGACAA
GATTCACACCCTTTATCTTTCCACACAGTGAGAAAATCTGTTTCCGCGGAAGTAAATTAT
GGTATTCGTTTTGAGAATAATGATCAATGCAAAGAGTTTTTAAAGCAACTTTCTGCTGAG
GTCCATTCTCGGATGCAACAATTCAAAGTAATTGGCAAATGTATTACCTTAAAATTGATG
GTCAGGGCTGAAAATGCTCCGGTACAAACTGCAAAGTTCATGGGTCATGGCTACTGTGAC
GTCATAAACAAGTCTACAACATTGCAAAATGCAACTAATGATGTTGAAATAATAACAAAG
GAAGTTATTTCAATATGTAAGAAACAAAACATAGATCCAAAAGAAATGCGTGGAATAGGA
ATTCAAGTCACTAAGCTAGAACCAATCAATATTAAGCCAATAAAAGGAGCAATTAACAAA
TTTTTGACGTCCAAACCCGTCCCTAAATCTGAAAAAAATATTTCAAATGAAAACATTGTT
GACATAGGTGTTAAAGTTAAAGTACCTACGACTCCGAAAAAAGTTACAACTTGCACAACC
CTACAAAAATCTCCTATTTTAAATATATCTAAATCTCCTAAAGGGAAGAGAAGAGGACGA
CCACCCAAACATTCTAAACCTCAGATATCTTTTAACCCACTTAGTAGATTTTTTCATTCA
AATACTGAAATAACTGTTAAAAGTGAAATAAAAACAGAAGAATTAAGCAAAATCGTCATA
AAAGAAGATATATCTAAAGAAGAAAAGCCGAAGCCCCAGGGTTTACTCGGATTACCGTGG
GATAAAATAAGAGAATTACTCCGAGCCTGGTTTGAAAGCGGACAAACTCCTAAACATTGT
GATATCCAATTAATTGCTGGTTACATGCGAGATATGGAAGACAAGGAAACTGATGCTGGA
CCATCTCAAGCACAAAAAGCAGAAAAAGAAAATTTACAATCAGACAATACAGACAATAGT
TGGAGAAATTGGACACCAGCAGCATTGAAGACCAAGGCGTCTAGTACTCTTAAACGGAAA
AATAATCCATCATCATCATCACCATCATCATGGCTTCATCGAAGAAGACAAAGAACCTAT
CTACAGGAGAAAGTTTTACAAAATAAAATTGGATTATTAGAAATATTGAAACAAAATGCT
CATAGAGAAGCTGAATTAAAAACTAAACTGCTCGAGGAACAAATTAAGCAGGAACTAATA
AGAACGAAAATTTTAACATTGGAACTACAAAAACTGCAACAGTAA
Protein sequence:
MHIDMDCFFVSVGLRNRPELRGKPVAVTHSKGGQPGSKRSGNDAITESNLYKQRQAKKIG
IALDIKDDIDTESAESGYEEESSYGSMSEIASCSYEARAKGIKNGMFMGKALKLCPELRT
IPYDFDGYKDVAFKLYNTIANYTLDIEAVSCDEMYVDCTELLKSMNVSVTDFATALREEI
KSITKCPCSTGFGGNRLQARLATKKAKPNGQFFLTADIVNDFMYNIQLSDLPGVGYQTSH
KLESLGYQTCGSLLSLSLINLQQHLGKKTGAQLYEQIRGQDSHPLSFHTVRKSVSAEVNY
GIRFENNDQCKEFLKQLSAEVHSRMQQFKVIGKCITLKLMVRAENAPVQTAKFMGHGYCD
VINKSTTLQNATNDVEIITKEVISICKKQNIDPKEMRGIGIQVTKLEPINIKPIKGAINK
FLTSKPVPKSEKNISNENIVDIGVKVKVPTTPKKVTTCTTLQKSPILNISKSPKGKRRGR
PPKHSKPQISFNPLSRFFHSNTEITVKSEIKTEELSKIVIKEDISKEEKPKPQGLLGLPW
DKIRELLRAWFESGQTPKHCDIQLIAGYMRDMEDKETDAGPSQAQKAEKENLQSDNTDNS
WRNWTPAALKTKASSTLKRKNNPSSSSPSSWLHRRRQRTYLQEKVLQNKIGLLEILKQNA
HREAELKTKLLEEQIKQELIRTKILTLELQKLQQ