DPGLEAN15891 in OGS1.0

New model in OGS2.0DPOGS204318 
Genomic Positionscaffold644:+ 137596-142720
See gene structure
CDS Length2169
Paired RNAseq reads  312
Single RNAseq reads  741
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007587 (0.0)
Best Drosophila hit  XPG-like endonuclease (3e-102)
Best Human hitflap endonuclease GEN homolog 1 (4e-55)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC006044 [Tribolium castaneum] (8e-122)
Best NR hit (blastx)  PREDICTED: similar to XPG-like endonuclease CG10670-PA [Tribolium castaneum] (5e-112)
GeneOntology terms










  
GO:0048256 flap endonuclease activity
GO:0003684 damaged DNA binding
GO:0005634 nucleus
GO:0006284 base-excision repair
GO:0000014 single-stranded DNA specific endodeoxyribonuclease activity
GO:0004519 endonuclease activity
GO:0000737 DNA catabolic process, endonucleolytic
GO:0000738 DNA catabolic process, exonucleolytic
GO:0004520 endodeoxyribonuclease activity
GO:0008310 single-stranded DNA specific 3'-5' exodeoxyribonuclease activity
GO:0035312 5'-3' exodeoxyribonuclease activity
GO:0008311 double-stranded DNA specific 3'-5' exodeoxyribonuclease activity
InterPro families


  
IPR020045 5'-3' exonuclease, C-terminal subdomain
IPR006084 DNA repair protein (XPGC)/yeast Rad
IPR006085 XPG N-terminal
IPR006086 XPG/RAD2 endonuclease
Orthology groupMCL15318

Nucleotide sequence:

ATGGGTATAAAGGGGCTATGGACGGTGTTAGCTCCATACTCTGAGAAGATATCATTACAC
GAAATCAGTGGTCAAACTGTTGCCATAGACTTAGCTGGATGGGTTTGTGATAGCCAAAAT
GTTACTGATTATTATATTCAGCCTAAGCTATATCTAAGAAATCTATTCTTCCGAACTCTT
TACTTAGTGCTAAGTGATGTAAATCCTATATTTGTGCTTGAGGGTGATGCTCCAGAACTT
AAGAGAGATGTTATGGCTGCTAGAAATGCATTGCAGTTTAAAGGTGCAGCCCCTAAAGCT
ACCACAGAGAAAACAAAACAAACTAACATAACAAGAACAAGGTTCAAAGGGGTATTAAAA
GAGTGTGAAAATCTCTTAAGGACAATGGGAGTGAGATGTGTAAAAGGTAGAGGAGAAGCT
GAAGCTGCTTGTGCTAGATTAAATGCTGAAGGTTTAGTAGATGCTGTTGTATCCCAAGAT
TCTGACTGCTTCGCATATGGAGCTAAGAAAGTGTATCGCAACTTCAGTGTATCAAGCGCT
GGTGGTGGAGGAGCCACACATGGCTCGGTGGACGTGTATGATGCTGTCAAGATGTTTAAT
AATAAGGGGTTTGGACGTAACAAGATGGTTGCGTTAGCATTGCTCTGTGGTTCTGACTAC
GGAGTTGGTGTCTGCGGGTCGTCTAAAACAACCGTTGTTTCTTTTCTCCACACTGTCCCA
GAAGATCAAGTTATATCGAGGTTATTATCGTGGGTGAGTGATCCACAGCACTACGAGGCG
CAGTCCCGCTGGGTGTCCGTCCCGGGTCGCTGTGACCGCTGCGGGCACGCGGGCCGCACT
CACCTCAAGAAGGGTTGCTCCACGTGCGCCACTCACCAAGGATGTAATGATACTGGACAT
AAATCAAAATTATGTGATGTAAAACGTGAACTATTGCTTCGCAATAAAGCCTTGTCGTCC
GGCATACCGTTTCCGGAGCCGAAAGTTATGAAGGAATTCCTGAACAGTACACCAGAAGAT
ATAGATCTAGACACTTTGAAGATTCCTAAACCCAGTTTAATACAATTCGTGAAAATTATG
TCGTACAAATTGGATTGGCCGCAGCGGTACTGCGTTGAAAAGTTTCTGCCATTATTAACG
AAATGGCACCTCCAGGATAATGTTGCGTCTAGGACGTTACGACCGCTTGAAATCAGAAAG
AAAAGACATCCGAAAGGCGTACCGAGTTATGAAGTTGTATGGGGGGATATTGATGGGCAT
TATGAAGGACTAATACCTGACGAGCAGTTGGAAGAGGACGAAGATGTTTCAGCACCTTGG
GTGACGATCGAGAGGCAGGATTTGATGTTTAAATATTATCCCGGTATAGTAGAGAAGTAT
GAAGAGTCTATAAAGAAGCCCCCCAAAGAAAAGAAAACAAGGGGCAGGAAAAAGAAGGAA
GAAAATGAAAATTCTGAAGAAGTTCACAAGACAAAGAGGAAATATACTAGAAAACCAAAA
CCAATTACAGACTTTATGACTACTCTCAATAGGTCTTTGAAAAATCTAAGTCTGTCTAAG
AAAGATACAGAACTGAATTCCAGTAAAGTGAGCGTTAATGTAAATAAACTGAAGAGAAAA
ATCAAAAATAACGGTAAAGTGAAGACAAAAAATACAATAGAAAGCTATTTGAAACCGTGC
AAAAAGAAAAAGTCTTCGACTATGACCAGTTTGATAAGTGGAAGTCAAAACAAGTCTAAG
TCCTTTAAACTGAATTTGGGAAATAAAGAAAATATGGAACCGAAAAAGCATTTGTTGAGC
ATGTTCAATGATAGTTCAAGTGATGTAGAGGCAAACGATTTATCGGATATTGTAGAAAAA
ATCGTATCGAGATCGGCTCCTTCAACTAAAGCAAAAGTGGACAGTAATTACGTGAAATTG
ATATTTGAGAATAAACTTGATAGGAAATCTATTCTGACGCAAAGATTTGATCAAAAAAAT
TGTTCGACTCCTATAAGCAGTCCGATAAGGAAAAGTTGTCAGCAGAGAAGGACGTCCAAA
TCCTCAATATCGGAAGCTGTAGACACCAGCTATTTCTTTGATAAATTAACAGAGGAACGA
GATGCTTTTGAATTGTCTCTGGAATTCTCAGCGAACTTGGACTATAGCCTACCCAAAGTG
GAGCTATAG

Protein sequence:

MGIKGLWTVLAPYSEKISLHEISGQTVAIDLAGWVCDSQNVTDYYIQPKLYLRNLFFRTL
YLVLSDVNPIFVLEGDAPELKRDVMAARNALQFKGAAPKATTEKTKQTNITRTRFKGVLK
ECENLLRTMGVRCVKGRGEAEAACARLNAEGLVDAVVSQDSDCFAYGAKKVYRNFSVSSA
GGGGATHGSVDVYDAVKMFNNKGFGRNKMVALALLCGSDYGVGVCGSSKTTVVSFLHTVP
EDQVISRLLSWVSDPQHYEAQSRWVSVPGRCDRCGHAGRTHLKKGCSTCATHQGCNDTGH
KSKLCDVKRELLLRNKALSSGIPFPEPKVMKEFLNSTPEDIDLDTLKIPKPSLIQFVKIM
SYKLDWPQRYCVEKFLPLLTKWHLQDNVASRTLRPLEIRKKRHPKGVPSYEVVWGDIDGH
YEGLIPDEQLEEDEDVSAPWVTIERQDLMFKYYPGIVEKYEESIKKPPKEKKTRGRKKKE
ENENSEEVHKTKRKYTRKPKPITDFMTTLNRSLKNLSLSKKDTELNSSKVSVNVNKLKRK
IKNNGKVKTKNTIESYLKPCKKKKSSTMTSLISGSQNKSKSFKLNLGNKENMEPKKHLLS
MFNDSSSDVEANDLSDIVEKIVSRSAPSTKAKVDSNYVKLIFENKLDRKSILTQRFDQKN
CSTPISSPIRKSCQQRRTSKSSISEAVDTSYFFDKLTEERDAFELSLEFSANLDYSLPKV
EL