DPGLEAN16727 in OGS1.0

New model in OGS2.0DPOGS210296 
Genomic Positionscaffold1729:+ 1776-7173
See gene structure
CDS Length1275
Paired RNAseq reads  416
Single RNAseq reads  1199
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004227 (5e-148)
Best Drosophila hit  flap endonuclease 1 (4e-124)
Best Human hitflap endonuclease 1 (2e-106)
Best NR hit (blastp)  flap endonuclease-1 [Aedes aegypti] (1e-161)
Best NR hit (blastx)  flap endonuclease-1 [Aedes aegypti] (3e-137)
GeneOntology terms



  
GO:0005634 nucleus
GO:0048256 flap endonuclease activity
GO:0006281 DNA repair
GO:0004519 endonuclease activity
GO:0003677 DNA binding
InterPro families




  
IPR020045 5'-3' exonuclease, C-terminal subdomain
IPR006084 DNA repair protein (XPGC)/yeast Rad
IPR006085 XPG N-terminal
IPR006086 XPG/RAD2 endonuclease
IPR008918 Helix-hairpin-helix motif, class 2
IPR019974 XPG conserved site
Orthology groupMCL13716

Nucleotide sequence:

ATGGGTATTTTAGGATTATCAAAGTTGATTGCAGATATTGCTCCAATGGCTGTAAAAGAA
ACAGAGATAAAAATTATTTCGGTTGGTAGGAAAGTTGCCATCGATGCATCTATGAGCTTG
TATCAATTCTTAATTGCTGTAAGAAGTCAAGGCGCTCAGCTGACGTCCGTTGATGGTGAA
ACAACATCACACCTAATGGGTACATTCTACAGAACGATTCGTCTCATAGAAGATGGTATC
AAGCCTGTGTATGTCTTTGATGGTAAACCGCCTGATATGAAGTCACATCAATTGAACAAG
AGGGCCGAGAGACGAGAGGAAGCTGAGAAAGAACTCCAGAAGGCTACCGAGGCTGGTGAT
ACGGCATCTATTGACAAGTTCAACCGTCGGTTGGTGAAGGTGACTCAGCAACACGGTGCC
GAAGCTCGGCAGTTGTTGAAGCTTATGGGGATACCCGTGGTGGAGGCTCCGTGTGAAGCT
GAGGCACAATGCGCTGAATTAACTTCTGAAGGTAACCTCGTAGACGGTTTGACGAATCCC
TTACTTCGAAGAGGTCCGATCCCTGCAGCGGCTCGAGCTAGACTCCCTGTAACACACACC
GAGGTGATCTCGGGCCCCCCAGTTGGAGGGGTCCCAGTCAAAGGTGGTAAGGTGTATGCT
GTAGCCACTGAGGATATGGATGCTTTGACCTTCGGAGCGAACGTGCTGTTGAGGCACCTC
ACCTTCTCCGAGGCGAGGAAGATGCCAGTACAGGAGTTCCACCTGGACCAGGTGCTGAGA
GGATTGGAATTGGAACAGACAGAGTTCATTGACCTCTGCATTCTGTTGGGTTGTGATTAC
TGCGGCTCCATCAAAGGGATCGGACCGAAACGGGCCATCGAACTCATCAAGCAACACCGC
AGTATAGAACAGGTCCTTCACAATATCGACACAAAGAAGTACAGTCCGCCGGAGAATTGG
GAATATGAAAACGCTCGGAGACTGTTCCAGCAACCAGAAGTTACCGAGGCGAAGGATGTC
GAGTTAAAATGGTCGGATCCTGACGAGGAAGGTCTGGTGAAGTTCCTCTGTGGAGACAAA
CAGTTCAACGAGGAGCGCGTCAGGAACGGGGCCAAGAAACTCATGAAGGCGCGCACCGGA
ACCACGCAGGGCAGGCTGGATGGATTCTTCAAGGTGTTATCAACAACACCAAACCCAAAA
AGGAAAGCGGAGGAAGATAAAAAGAGTGCCAACAAGAAAGTTAAAACAGCTGGAAGGGGG
CGGAAACCGAAATAA

Protein sequence:

MGILGLSKLIADIAPMAVKETEIKIISVGRKVAIDASMSLYQFLIAVRSQGAQLTSVDGE
TTSHLMGTFYRTIRLIEDGIKPVYVFDGKPPDMKSHQLNKRAERREEAEKELQKATEAGD
TASIDKFNRRLVKVTQQHGAEARQLLKLMGIPVVEAPCEAEAQCAELTSEGNLVDGLTNP
LLRRGPIPAAARARLPVTHTEVISGPPVGGVPVKGGKVYAVATEDMDALTFGANVLLRHL
TFSEARKMPVQEFHLDQVLRGLELEQTEFIDLCILLGCDYCGSIKGIGPKRAIELIKQHR
SIEQVLHNIDTKKYSPPENWEYENARRLFQQPEVTEAKDVELKWSDPDEEGLVKFLCGDK
QFNEERVRNGAKKLMKARTGTTQGRLDGFFKVLSTTPNPKRKAEEDKKSANKKVKTAGRG
RKPK