New model in OGS2.0 | DPOGS207004  |
---|---|
Genomic Position | scaffold1:+ 645865-647449 |
See gene structure | |
CDS Length | 978 |
Paired RNAseq reads   | 21 |
Single RNAseq reads   | 50 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012924 (1e-107) |
Best Drosophila hit   | Ogg1 (4e-69) |
Best Human hit | N-glycosylase/DNA lyase isoform 1a (3e-61) |
Best NR hit (blastp)   | PREDICTED: similar to N-glycosylase/DNA lyase [Tribolium castaneum] (3e-97) |
Best NR hit (blastx)   | PREDICTED: similar to N-glycosylase/DNA lyase [Tribolium castaneum] (3e-97) |
GeneOntology terms    | GO:0008534 oxidized purine base lesion DNA N-glycosylase activity GO:0003906 DNA-(apurinic or apyrimidinic site) lyase activity GO:0005634 nucleus GO:0006281 DNA repair GO:0003684 damaged DNA binding GO:0006284 base-excision repair GO:0006289 nucleotide-excision repair GO:0006974 response to DNA damage stimulus |
InterPro families    | IPR003265 HhH-GPD domain IPR012904 8-oxoguanine DNA glycosylase, N-terminal IPR011257 DNA glycosylase IPR012294 Transcription factor TFIID, C-terminal/DNA glycosylase, N-terminal IPR023170 Helix-turn-helix, base-excision DNA repair, C-terminal |
Orthology group | MCL15060 |
Nucleotide sequence:
ATGGCTTGGAATAAAATAAATTGTTGTCAGCGAGAATTGCAATTGCTTGGTACACTTAAC
GGAGGTCAAAGTTTTAGGTGGAATTATAATAAAGACACAAATGAATGGAAAGGCGTTTTT
TCAAGAACCTTATGGAAGTTACGGCAACGAGACGATTTTTTGGAATATCAAGTTTTAGGA
TCTCTACTCATTAAATCAAAAGAAAATAATTCTGTTAAAGTAGATTTTGCGGATATGCTT
ACAAAATATTTTAGGTTAGATTTCAACTTAAAAGACCACTATAAAGTATGGTCAGATAAA
GATGAACTTTTTAAATCTGCCTGTACAAAGTTCTATGGAATAAGAATGCTAAATCAGGAG
CCTGTAGAAAATCTTTTTTCGTTTATCTGCAGCCAGAACAATCATATTTCCAGGATATCC
AGCCTGGTTGAAAAACTCTGCATCTATTATGGTGATGAAATTTGTCAGTTTGAAGGAGTG
ACATATTATGCTTTTCCTGATGTGGAAAAGCTTATGGACATAAAAGTGGAATCTAAATTA
AGAGAACTAGGTTTTGGTTATAGAGCCAAATTTATTCAAAAATCAGCAGCTCAGATTGTA
GAGTGGGGAGGAGACGAATGGTTTAAAAGATTAAAGGATATGAAATACAAGGACGCCCGA
CAGGAACTTATAAAATTGTGTGGAATCGGACCTAAAGTCGCTGACTGTATATGCCTGATG
TCATTGAATCATCTAGAGGCACTTCCTGTTGACACGCACGTGTATCAAATAGCTGCCACA
AACTATCTCCCACACTTGAAAGGTAAAAAAAGTGTCACAGAAAAAATTTATACTGAAATA
GGCGACCACTTTAGAAGTTTGTATGGAGATAAAGCAGGATGGGCACATACTGTGCTCTTC
TGTGCTGATTTAAAAAAATTTCAACAAGATGACTCAAATGAGGATGTCGTTAAAAGTAAA
AGAAAAAAGAAAAAATAA
Protein sequence:
MAWNKINCCQRELQLLGTLNGGQSFRWNYNKDTNEWKGVFSRTLWKLRQRDDFLEYQVLG
SLLIKSKENNSVKVDFADMLTKYFRLDFNLKDHYKVWSDKDELFKSACTKFYGIRMLNQE
PVENLFSFICSQNNHISRISSLVEKLCIYYGDEICQFEGVTYYAFPDVEKLMDIKVESKL
RELGFGYRAKFIQKSAAQIVEWGGDEWFKRLKDMKYKDARQELIKLCGIGPKVADCICLM
SLNHLEALPVDTHVYQIAATNYLPHLKGKKSVTEKIYTEIGDHFRSLYGDKAGWAHTVLF
CADLKKFQQDDSNEDVVKSKRKKKK