DPGLEAN08000 in OGS1.0

New model in OGS2.0DPOGS200726 
Genomic Positionscaffold402:- 5407-9440
See gene structure
CDS Length1893
Paired RNAseq reads  3337
Single RNAseq reads  8281
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001124 (2e-167)
Best Drosophila hit  recombination repair protein 1, isoform A (3e-96)
Best Human hitDNA-(apurinic or apyrimidinic site) lyase (2e-85)
Best NR hit (blastp)  ap endonuclease [Aedes aegypti] (2e-112)
Best NR hit (blastx)  ap endonuclease [Aedes aegypti] (1e-108)
GeneOntology terms







  
GO:0006281 DNA repair
GO:0003906 DNA-(apurinic or apyrimidinic site) lyase activity
GO:0005634 nucleus
GO:0008311 double-stranded DNA specific 3'-5' exodeoxyribonuclease activity
GO:0004519 endonuclease activity
GO:0003677 DNA binding
GO:0005622 intracellular
GO:0008408 3'-5' exonuclease activity
GO:0006974 response to DNA damage stimulus
InterPro families



  
IPR004808 Exodeoxyribonuclease III xth
IPR000097 AP endonuclease, family 1
IPR020847 AP endonuclease, family 1, binding site
IPR020848 AP endonuclease, family 1, conserved site
IPR005135 Endonuclease/exonuclease/phosphatase
Orthology groupMCL11144

Nucleotide sequence:

ATGGCGCCACGTACAGCTAAGGCTAAGAAAAATGCTGATGTTAAAGTTTCTGAGGGTGAA
GTTCCAAAAAAAGGTAGAGGGAAAGCTAAAAATGTGACAGAACAAGAACAAGTTGTCCCT
GAAGTTGTTATTGATAAAAATCCCTCAGTAGAAAAAAAAACAAAAAGGGGTAAAAATGCA
GTTGTTGAAAACAATGATGAGGCACAAATTATAGTCAAAGATGCACCACCAGCTAAAAAG
GGAAGGAAGAAGGCTGTTGAGGAACCCATTCCTGAATCAAACTCTCTAAATGGAGATAAC
TCGGATGAAAACGAGCCAGCAGTTCAATCGGATGATAACGAACAGAATGAAGAACATTCT
GAATCAAATGATAATGCTGAAGACACTCAAGCCAGTGCAGGCAAAGGTAGGAAAAAAGTT
AACAAGAAAGAACCTATAGAAAAAGAGGTTAAACCTAAAGAAACTGGCAGAGGAAAAAAA
AATGTGAAGCAGGAAAATATTACAGCAAAGAAAGATGCAGAAGAAAAACCTAAAGCTAGA
GGGAGAGGACGTAAAGTACAGCCCAAGGCAGAAGACGTTCAAGATAATGATGAAGTAGAA
GTTAAACCTAAAGGTAGAAAGAAGGCTCAACCAAAAGTAGTTGAAAAAGTACAGAAAGCT
GATGATGAAGATGATGAACAAGAAGAAATACCAGATGAAGAAGCTGAAGAAGAGAAGCCA
GTTGAAGAAGTAAAGAAGAAAGGTCGGAAGAATGCTGACAAAAAGACAACACAGAAAGAA
GACTCCGAACAAAAAGATGATGAAGTTAAAGAACAAATGCCAGTCAGCAAAAGTCGGAAG
GGTGCTAAGAAAGATGAAAAAGCAAAAGGAGACACAAAAGACGATGATAAAGATGATGTA
GCAGAATCCAAACCGGTTAAAGGGAAACGTGGTCAAAAGAAAGCTGAAGCCAGCGAGCTA
CAAGATACGGGGGAACCGATAAACAAACGTCGCCGTAAAGATGACAAGGCCACCGAGGAC
AATAAAAAGAAAACTAAAGCCGCAACGGACTATGAATCTATTGATTTCTCTAACAAATCA
CAGACGTCTCAGGGTAAAGAGTGGAATTTTAAAATAGCCAATTGGAACGTGGACGGCATT
AGGGCTTGGATGGGAAAAGGCGGATTGGACTACCTTAAATACGAAAAACCGGATATATTG
TGTCTACAGGAAACGAAATGCGCTCTAGATAAATTGCCGTACGAAGTGAAAAATATACCC
GGATATCACGCGTACTGGCTGTCTAGTGATAAAGATGGCTACGCCGGCGTAGGAATTTAC
ACTACAAAGTTAGCTATGAATGTACAATACGGTTTACAAAACGAGGAATTGGATTCCGAA
GGTCGGATAATAACGGCTGAGTACGAACAATTCTACTTAATATGCACGTACGTACCTAAC
GCGGGACGAAAATTAGTTTCACTGCCCAAGAGATTAAAGTGGAACGACGAGTTCAGGGAA
CACGTTAAGGCGCTGGACGAAAAGAAACCTGTCATTATATGCGGTGACATGAACGTGGCT
CACAACGAAATAGATCTAACGAATCCAAAAACGAATAAGAAGAACGCCGGCTTCACGGAG
GAGGAACGAGCTGGTATGACGGAGCTGCTCGGGGACGGATTCGTAGACACGTTCAGACAT
TTTCATCCTGAGAAAGTCGCTTATACGTTCTGGAGTTACATGGCCAATAGTAGAGCTAAG
AACGTCGGATGGCGTTTGGACTACTTCATCGTGTCAGAGAGACTTTTACCGTCTATATGC
GACAGTTCGATCCGCGGCGAGGTGTATGGGAGTGACCACTGTCCTATAGCACTCTACCTA
CACTTAACGAGCGCCGACAAACCCAAGGAATAG

Protein sequence:

MAPRTAKAKKNADVKVSEGEVPKKGRGKAKNVTEQEQVVPEVVIDKNPSVEKKTKRGKNA
VVENNDEAQIIVKDAPPAKKGRKKAVEEPIPESNSLNGDNSDENEPAVQSDDNEQNEEHS
ESNDNAEDTQASAGKGRKKVNKKEPIEKEVKPKETGRGKKNVKQENITAKKDAEEKPKAR
GRGRKVQPKAEDVQDNDEVEVKPKGRKKAQPKVVEKVQKADDEDDEQEEIPDEEAEEEKP
VEEVKKKGRKNADKKTTQKEDSEQKDDEVKEQMPVSKSRKGAKKDEKAKGDTKDDDKDDV
AESKPVKGKRGQKKAEASELQDTGEPINKRRRKDDKATEDNKKKTKAATDYESIDFSNKS
QTSQGKEWNFKIANWNVDGIRAWMGKGGLDYLKYEKPDILCLQETKCALDKLPYEVKNIP
GYHAYWLSSDKDGYAGVGIYTTKLAMNVQYGLQNEELDSEGRIITAEYEQFYLICTYVPN
AGRKLVSLPKRLKWNDEFREHVKALDEKKPVIICGDMNVAHNEIDLTNPKTNKKNAGFTE
EERAGMTELLGDGFVDTFRHFHPEKVAYTFWSYMANSRAKNVGWRLDYFIVSERLLPSIC
DSSIRGEVYGSDHCPIALYLHLTSADKPKE