DPGLEAN15553 in OGS1.0

New model in OGS2.0DPOGS207004 
Genomic Positionscaffold1:+ 645865-647449
See gene structure
CDS Length978
Paired RNAseq reads  21
Single RNAseq reads  50
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012924 (1e-107)
Best Drosophila hit  Ogg1 (4e-69)
Best Human hitN-glycosylase/DNA lyase isoform 1a (3e-61)
Best NR hit (blastp)  PREDICTED: similar to N-glycosylase/DNA lyase [Tribolium castaneum] (3e-97)
Best NR hit (blastx)  PREDICTED: similar to N-glycosylase/DNA lyase [Tribolium castaneum] (3e-97)
GeneOntology terms






  
GO:0008534 oxidized purine base lesion DNA N-glycosylase activity
GO:0003906 DNA-(apurinic or apyrimidinic site) lyase activity
GO:0005634 nucleus
GO:0006281 DNA repair
GO:0003684 damaged DNA binding
GO:0006284 base-excision repair
GO:0006289 nucleotide-excision repair
GO:0006974 response to DNA damage stimulus
InterPro families



  
IPR003265 HhH-GPD domain
IPR012904 8-oxoguanine DNA glycosylase, N-terminal
IPR011257 DNA glycosylase
IPR012294 Transcription factor TFIID, C-terminal/DNA glycosylase, N-terminal
IPR023170 Helix-turn-helix, base-excision DNA repair, C-terminal
Orthology groupMCL15060

Nucleotide sequence:

ATGGCTTGGAATAAAATAAATTGTTGTCAGCGAGAATTGCAATTGCTTGGTACACTTAAC
GGAGGTCAAAGTTTTAGGTGGAATTATAATAAAGACACAAATGAATGGAAAGGCGTTTTT
TCAAGAACCTTATGGAAGTTACGGCAACGAGACGATTTTTTGGAATATCAAGTTTTAGGA
TCTCTACTCATTAAATCAAAAGAAAATAATTCTGTTAAAGTAGATTTTGCGGATATGCTT
ACAAAATATTTTAGGTTAGATTTCAACTTAAAAGACCACTATAAAGTATGGTCAGATAAA
GATGAACTTTTTAAATCTGCCTGTACAAAGTTCTATGGAATAAGAATGCTAAATCAGGAG
CCTGTAGAAAATCTTTTTTCGTTTATCTGCAGCCAGAACAATCATATTTCCAGGATATCC
AGCCTGGTTGAAAAACTCTGCATCTATTATGGTGATGAAATTTGTCAGTTTGAAGGAGTG
ACATATTATGCTTTTCCTGATGTGGAAAAGCTTATGGACATAAAAGTGGAATCTAAATTA
AGAGAACTAGGTTTTGGTTATAGAGCCAAATTTATTCAAAAATCAGCAGCTCAGATTGTA
GAGTGGGGAGGAGACGAATGGTTTAAAAGATTAAAGGATATGAAATACAAGGACGCCCGA
CAGGAACTTATAAAATTGTGTGGAATCGGACCTAAAGTCGCTGACTGTATATGCCTGATG
TCATTGAATCATCTAGAGGCACTTCCTGTTGACACGCACGTGTATCAAATAGCTGCCACA
AACTATCTCCCACACTTGAAAGGTAAAAAAAGTGTCACAGAAAAAATTTATACTGAAATA
GGCGACCACTTTAGAAGTTTGTATGGAGATAAAGCAGGATGGGCACATACTGTGCTCTTC
TGTGCTGATTTAAAAAAATTTCAACAAGATGACTCAAATGAGGATGTCGTTAAAAGTAAA
AGAAAAAAGAAAAAATAA

Protein sequence:

MAWNKINCCQRELQLLGTLNGGQSFRWNYNKDTNEWKGVFSRTLWKLRQRDDFLEYQVLG
SLLIKSKENNSVKVDFADMLTKYFRLDFNLKDHYKVWSDKDELFKSACTKFYGIRMLNQE
PVENLFSFICSQNNHISRISSLVEKLCIYYGDEICQFEGVTYYAFPDVEKLMDIKVESKL
RELGFGYRAKFIQKSAAQIVEWGGDEWFKRLKDMKYKDARQELIKLCGIGPKVADCICLM
SLNHLEALPVDTHVYQIAATNYLPHLKGKKSVTEKIYTEIGDHFRSLYGDKAGWAHTVLF
CADLKKFQQDDSNEDVVKSKRKKKK