DPGLEAN12009 in OGS1.0

New model in OGS2.0DPOGS202630 
Genomic Positionscaffold822:- 9867-14889
See gene structure
CDS Length2085
Paired RNAseq reads  37
Single RNAseq reads  131
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008315 (0.0)
Best Drosophila hit  Rev1 (2e-113)
Best Human hitDNA repair protein REV1 isoform 2 (5e-97)
Best NR hit (blastp)  PREDICTED: similar to terminal deoxycytidyl transferase rev1 [Tribolium castaneum] (4e-133)
Best NR hit (blastx)  terminal deoxycytidyl transferase rev1 [Culex quinquefasciatus] (3e-116)
GeneOntology terms





  
GO:0017125 deoxycytidyl transferase activity
GO:0006281 DNA repair
GO:0019985 translesion synthesis
GO:0005622 intracellular
GO:0003684 damaged DNA binding
GO:0003887 DNA-directed DNA polymerase activity
GO:0000287 magnesium ion binding
InterPro families

  
IPR001126 DNA-repair protein, UmuC-like
IPR017961 DNA polymerase, Y-family, little finger domain
IPR017963 DNA-repair protein, UmuC-like, N-terminal
Orthology groupMCL13622

Nucleotide sequence:

ATGCATATTGACATGGATTGCTTTTTTGTGTCTGTTGGTTTGAGAAATAGACCAGAATTG
AGAGGAAAACCGGTAGCTGTCACACATTCAAAAGGAGGCCAGCCTGGAAGTAAACGGTCT
GGCAATGATGCAATTACTGAATCCAACTTATACAAACAAAGGCAAGCCAAAAAAATTGGT
ATTGCTCTGGATATCAAAGACGATATTGATACAGAATCTGCTGAAAGTGGTTATGAGGAG
GAATCATCATATGGGTCTATGAGTGAAATAGCTTCTTGTTCTTATGAAGCAAGGGCTAAG
GGCATCAAAAATGGAATGTTTATGGGGAAAGCTCTGAAGCTTTGCCCCGAATTAAGGACA
ATTCCATATGATTTTGATGGTTACAAAGATGTTGCTTTTAAATTATACAACACTATAGCT
AATTACACATTAGATATTGAAGCTGTGTCATGTGATGAAATGTATGTTGATTGTACAGAA
CTCTTGAAATCAATGAATGTAAGTGTTACTGACTTTGCCACCGCTTTGAGAGAGGAGATA
AAAAGTATAACTAAATGTCCCTGCTCAACAGGGTTTGGTGGTAATAGATTGCAAGCTCGC
TTGGCTACGAAGAAGGCTAAGCCCAATGGACAATTCTTTCTTACAGCTGATATAGTTAAC
GATTTTATGTACAATATACAACTTAGTGATCTACCAGGTGTTGGGTATCAGACTTCACAT
AAATTAGAATCTTTAGGGTACCAGACTTGTGGATCCCTGTTAAGCTTGAGCTTAATAAAC
CTTCAACAACATTTAGGAAAAAAGACTGGCGCACAATTATATGAACAGATTCGTGGACAA
GATTCACACCCTTTATCTTTCCACACAGTGAGAAAATCTGTTTCCGCGGAAGTAAATTAT
GGTATTCGTTTTGAGAATAATGATCAATGCAAAGAGTTTTTAAAGCAACTTTCTGCTGAG
GTCCATTCTCGGATGCAACAATTCAAAGTAATTGGCAAATGTATTACCTTAAAATTGATG
GTCAGGGCTGAAAATGCTCCGGTACAAACTGCAAAGTTCATGGGTCATGGCTACTGTGAC
GTCATAAACAAGTCTACAACATTGCAAAATGCAACTAATGATGTTGAAATAATAACAAAG
GAAGTTATTTCAATATGTAAGAAACAAAACATAGATCCAAAAGAAATGCGTGGAATAGGA
ATTCAAGTCACTAAGCTAGAACCAATCAATATTAAGCCAATAAAAGGAGCAATTAACAAA
TTTTTGACGTCCAAACCCGTCCCTAAATCTGAAAAAAATATTTCAAATGAAAACATTGTT
GACATAGGTGTTAAAGTTAAAGTACCTACGACTCCGAAAAAAGTTACAACTTGCACAACC
CTACAAAAATCTCCTATTTTAAATATATCTAAATCTCCTAAAGGGAAGAGAAGAGGACGA
CCACCCAAACATTCTAAACCTCAGATATCTTTTAACCCACTTAGTAGATTTTTTCATTCA
AATACTGAAATAACTGTTAAAAGTGAAATAAAAACAGAAGAATTAAGCAAAATCGTCATA
AAAGAAGATATATCTAAAGAAGAAAAGCCGAAGCCCCAGGGTTTACTCGGATTACCGTGG
GATAAAATAAGAGAATTACTCCGAGCCTGGTTTGAAAGCGGACAAACTCCTAAACATTGT
GATATCCAATTAATTGCTGGTTACATGCGAGATATGGAAGACAAGGAAACTGATGCTGGA
CCATCTCAAGCACAAAAAGCAGAAAAAGAAAATTTACAATCAGACAATACAGACAATAGT
TGGAGAAATTGGACACCAGCAGCATTGAAGACCAAGGCGTCTAGTACTCTTAAACGGAAA
AATAATCCATCATCATCATCACCATCATCATGGCTTCATCGAAGAAGACAAAGAACCTAT
CTACAGGAGAAAGTTTTACAAAATAAAATTGGATTATTAGAAATATTGAAACAAAATGCT
CATAGAGAAGCTGAATTAAAAACTAAACTGCTCGAGGAACAAATTAAGCAGGAACTAATA
AGAACGAAAATTTTAACATTGGAACTACAAAAACTGCAACAGTAA

Protein sequence:

MHIDMDCFFVSVGLRNRPELRGKPVAVTHSKGGQPGSKRSGNDAITESNLYKQRQAKKIG
IALDIKDDIDTESAESGYEEESSYGSMSEIASCSYEARAKGIKNGMFMGKALKLCPELRT
IPYDFDGYKDVAFKLYNTIANYTLDIEAVSCDEMYVDCTELLKSMNVSVTDFATALREEI
KSITKCPCSTGFGGNRLQARLATKKAKPNGQFFLTADIVNDFMYNIQLSDLPGVGYQTSH
KLESLGYQTCGSLLSLSLINLQQHLGKKTGAQLYEQIRGQDSHPLSFHTVRKSVSAEVNY
GIRFENNDQCKEFLKQLSAEVHSRMQQFKVIGKCITLKLMVRAENAPVQTAKFMGHGYCD
VINKSTTLQNATNDVEIITKEVISICKKQNIDPKEMRGIGIQVTKLEPINIKPIKGAINK
FLTSKPVPKSEKNISNENIVDIGVKVKVPTTPKKVTTCTTLQKSPILNISKSPKGKRRGR
PPKHSKPQISFNPLSRFFHSNTEITVKSEIKTEELSKIVIKEDISKEEKPKPQGLLGLPW
DKIRELLRAWFESGQTPKHCDIQLIAGYMRDMEDKETDAGPSQAQKAEKENLQSDNTDNS
WRNWTPAALKTKASSTLKRKNNPSSSSPSSWLHRRRQRTYLQEKVLQNKIGLLEILKQNA
HREAELKTKLLEEQIKQELIRTKILTLELQKLQQ