New model in OGS2.0 | DPOGS204318  |
---|---|
Genomic Position | scaffold644:+ 137596-142720 |
See gene structure | |
CDS Length | 2169 |
Paired RNAseq reads   | 312 |
Single RNAseq reads   | 741 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007587 (0.0) |
Best Drosophila hit   | XPG-like endonuclease (3e-102) |
Best Human hit | flap endonuclease GEN homolog 1 (4e-55) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC006044 [Tribolium castaneum] (8e-122) |
Best NR hit (blastx)   | PREDICTED: similar to XPG-like endonuclease CG10670-PA [Tribolium castaneum] (5e-112) |
GeneOntology terms    | GO:0048256 flap endonuclease activity GO:0003684 damaged DNA binding GO:0005634 nucleus GO:0006284 base-excision repair GO:0000014 single-stranded DNA specific endodeoxyribonuclease activity GO:0004519 endonuclease activity GO:0000737 DNA catabolic process, endonucleolytic GO:0000738 DNA catabolic process, exonucleolytic GO:0004520 endodeoxyribonuclease activity GO:0008310 single-stranded DNA specific 3'-5' exodeoxyribonuclease activity GO:0035312 5'-3' exodeoxyribonuclease activity GO:0008311 double-stranded DNA specific 3'-5' exodeoxyribonuclease activity |
InterPro families    | IPR020045 5'-3' exonuclease, C-terminal subdomain IPR006084 DNA repair protein (XPGC)/yeast Rad IPR006085 XPG N-terminal IPR006086 XPG/RAD2 endonuclease |
Orthology group | MCL15318 |
Nucleotide sequence:
ATGGGTATAAAGGGGCTATGGACGGTGTTAGCTCCATACTCTGAGAAGATATCATTACAC
GAAATCAGTGGTCAAACTGTTGCCATAGACTTAGCTGGATGGGTTTGTGATAGCCAAAAT
GTTACTGATTATTATATTCAGCCTAAGCTATATCTAAGAAATCTATTCTTCCGAACTCTT
TACTTAGTGCTAAGTGATGTAAATCCTATATTTGTGCTTGAGGGTGATGCTCCAGAACTT
AAGAGAGATGTTATGGCTGCTAGAAATGCATTGCAGTTTAAAGGTGCAGCCCCTAAAGCT
ACCACAGAGAAAACAAAACAAACTAACATAACAAGAACAAGGTTCAAAGGGGTATTAAAA
GAGTGTGAAAATCTCTTAAGGACAATGGGAGTGAGATGTGTAAAAGGTAGAGGAGAAGCT
GAAGCTGCTTGTGCTAGATTAAATGCTGAAGGTTTAGTAGATGCTGTTGTATCCCAAGAT
TCTGACTGCTTCGCATATGGAGCTAAGAAAGTGTATCGCAACTTCAGTGTATCAAGCGCT
GGTGGTGGAGGAGCCACACATGGCTCGGTGGACGTGTATGATGCTGTCAAGATGTTTAAT
AATAAGGGGTTTGGACGTAACAAGATGGTTGCGTTAGCATTGCTCTGTGGTTCTGACTAC
GGAGTTGGTGTCTGCGGGTCGTCTAAAACAACCGTTGTTTCTTTTCTCCACACTGTCCCA
GAAGATCAAGTTATATCGAGGTTATTATCGTGGGTGAGTGATCCACAGCACTACGAGGCG
CAGTCCCGCTGGGTGTCCGTCCCGGGTCGCTGTGACCGCTGCGGGCACGCGGGCCGCACT
CACCTCAAGAAGGGTTGCTCCACGTGCGCCACTCACCAAGGATGTAATGATACTGGACAT
AAATCAAAATTATGTGATGTAAAACGTGAACTATTGCTTCGCAATAAAGCCTTGTCGTCC
GGCATACCGTTTCCGGAGCCGAAAGTTATGAAGGAATTCCTGAACAGTACACCAGAAGAT
ATAGATCTAGACACTTTGAAGATTCCTAAACCCAGTTTAATACAATTCGTGAAAATTATG
TCGTACAAATTGGATTGGCCGCAGCGGTACTGCGTTGAAAAGTTTCTGCCATTATTAACG
AAATGGCACCTCCAGGATAATGTTGCGTCTAGGACGTTACGACCGCTTGAAATCAGAAAG
AAAAGACATCCGAAAGGCGTACCGAGTTATGAAGTTGTATGGGGGGATATTGATGGGCAT
TATGAAGGACTAATACCTGACGAGCAGTTGGAAGAGGACGAAGATGTTTCAGCACCTTGG
GTGACGATCGAGAGGCAGGATTTGATGTTTAAATATTATCCCGGTATAGTAGAGAAGTAT
GAAGAGTCTATAAAGAAGCCCCCCAAAGAAAAGAAAACAAGGGGCAGGAAAAAGAAGGAA
GAAAATGAAAATTCTGAAGAAGTTCACAAGACAAAGAGGAAATATACTAGAAAACCAAAA
CCAATTACAGACTTTATGACTACTCTCAATAGGTCTTTGAAAAATCTAAGTCTGTCTAAG
AAAGATACAGAACTGAATTCCAGTAAAGTGAGCGTTAATGTAAATAAACTGAAGAGAAAA
ATCAAAAATAACGGTAAAGTGAAGACAAAAAATACAATAGAAAGCTATTTGAAACCGTGC
AAAAAGAAAAAGTCTTCGACTATGACCAGTTTGATAAGTGGAAGTCAAAACAAGTCTAAG
TCCTTTAAACTGAATTTGGGAAATAAAGAAAATATGGAACCGAAAAAGCATTTGTTGAGC
ATGTTCAATGATAGTTCAAGTGATGTAGAGGCAAACGATTTATCGGATATTGTAGAAAAA
ATCGTATCGAGATCGGCTCCTTCAACTAAAGCAAAAGTGGACAGTAATTACGTGAAATTG
ATATTTGAGAATAAACTTGATAGGAAATCTATTCTGACGCAAAGATTTGATCAAAAAAAT
TGTTCGACTCCTATAAGCAGTCCGATAAGGAAAAGTTGTCAGCAGAGAAGGACGTCCAAA
TCCTCAATATCGGAAGCTGTAGACACCAGCTATTTCTTTGATAAATTAACAGAGGAACGA
GATGCTTTTGAATTGTCTCTGGAATTCTCAGCGAACTTGGACTATAGCCTACCCAAAGTG
GAGCTATAG
Protein sequence:
MGIKGLWTVLAPYSEKISLHEISGQTVAIDLAGWVCDSQNVTDYYIQPKLYLRNLFFRTL
YLVLSDVNPIFVLEGDAPELKRDVMAARNALQFKGAAPKATTEKTKQTNITRTRFKGVLK
ECENLLRTMGVRCVKGRGEAEAACARLNAEGLVDAVVSQDSDCFAYGAKKVYRNFSVSSA
GGGGATHGSVDVYDAVKMFNNKGFGRNKMVALALLCGSDYGVGVCGSSKTTVVSFLHTVP
EDQVISRLLSWVSDPQHYEAQSRWVSVPGRCDRCGHAGRTHLKKGCSTCATHQGCNDTGH
KSKLCDVKRELLLRNKALSSGIPFPEPKVMKEFLNSTPEDIDLDTLKIPKPSLIQFVKIM
SYKLDWPQRYCVEKFLPLLTKWHLQDNVASRTLRPLEIRKKRHPKGVPSYEVVWGDIDGH
YEGLIPDEQLEEDEDVSAPWVTIERQDLMFKYYPGIVEKYEESIKKPPKEKKTRGRKKKE
ENENSEEVHKTKRKYTRKPKPITDFMTTLNRSLKNLSLSKKDTELNSSKVSVNVNKLKRK
IKNNGKVKTKNTIESYLKPCKKKKSSTMTSLISGSQNKSKSFKLNLGNKENMEPKKHLLS
MFNDSSSDVEANDLSDIVEKIVSRSAPSTKAKVDSNYVKLIFENKLDRKSILTQRFDQKN
CSTPISSPIRKSCQQRRTSKSSISEAVDTSYFFDKLTEERDAFELSLEFSANLDYSLPKV
EL