New model in OGS2.0 | DPOGS200726  |
---|---|
Genomic Position | scaffold402:- 5407-9440 |
See gene structure | |
CDS Length | 1893 |
Paired RNAseq reads   | 3337 |
Single RNAseq reads   | 8281 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001124 (2e-167) |
Best Drosophila hit   | recombination repair protein 1, isoform A (3e-96) |
Best Human hit | DNA-(apurinic or apyrimidinic site) lyase (2e-85) |
Best NR hit (blastp)   | ap endonuclease [Aedes aegypti] (2e-112) |
Best NR hit (blastx)   | ap endonuclease [Aedes aegypti] (1e-108) |
GeneOntology terms    | GO:0006281 DNA repair GO:0003906 DNA-(apurinic or apyrimidinic site) lyase activity GO:0005634 nucleus GO:0008311 double-stranded DNA specific 3'-5' exodeoxyribonuclease activity GO:0004519 endonuclease activity GO:0003677 DNA binding GO:0005622 intracellular GO:0008408 3'-5' exonuclease activity GO:0006974 response to DNA damage stimulus |
InterPro families    | IPR004808 Exodeoxyribonuclease III xth IPR000097 AP endonuclease, family 1 IPR020847 AP endonuclease, family 1, binding site IPR020848 AP endonuclease, family 1, conserved site IPR005135 Endonuclease/exonuclease/phosphatase |
Orthology group | MCL11144 |
Nucleotide sequence:
ATGGCGCCACGTACAGCTAAGGCTAAGAAAAATGCTGATGTTAAAGTTTCTGAGGGTGAA
GTTCCAAAAAAAGGTAGAGGGAAAGCTAAAAATGTGACAGAACAAGAACAAGTTGTCCCT
GAAGTTGTTATTGATAAAAATCCCTCAGTAGAAAAAAAAACAAAAAGGGGTAAAAATGCA
GTTGTTGAAAACAATGATGAGGCACAAATTATAGTCAAAGATGCACCACCAGCTAAAAAG
GGAAGGAAGAAGGCTGTTGAGGAACCCATTCCTGAATCAAACTCTCTAAATGGAGATAAC
TCGGATGAAAACGAGCCAGCAGTTCAATCGGATGATAACGAACAGAATGAAGAACATTCT
GAATCAAATGATAATGCTGAAGACACTCAAGCCAGTGCAGGCAAAGGTAGGAAAAAAGTT
AACAAGAAAGAACCTATAGAAAAAGAGGTTAAACCTAAAGAAACTGGCAGAGGAAAAAAA
AATGTGAAGCAGGAAAATATTACAGCAAAGAAAGATGCAGAAGAAAAACCTAAAGCTAGA
GGGAGAGGACGTAAAGTACAGCCCAAGGCAGAAGACGTTCAAGATAATGATGAAGTAGAA
GTTAAACCTAAAGGTAGAAAGAAGGCTCAACCAAAAGTAGTTGAAAAAGTACAGAAAGCT
GATGATGAAGATGATGAACAAGAAGAAATACCAGATGAAGAAGCTGAAGAAGAGAAGCCA
GTTGAAGAAGTAAAGAAGAAAGGTCGGAAGAATGCTGACAAAAAGACAACACAGAAAGAA
GACTCCGAACAAAAAGATGATGAAGTTAAAGAACAAATGCCAGTCAGCAAAAGTCGGAAG
GGTGCTAAGAAAGATGAAAAAGCAAAAGGAGACACAAAAGACGATGATAAAGATGATGTA
GCAGAATCCAAACCGGTTAAAGGGAAACGTGGTCAAAAGAAAGCTGAAGCCAGCGAGCTA
CAAGATACGGGGGAACCGATAAACAAACGTCGCCGTAAAGATGACAAGGCCACCGAGGAC
AATAAAAAGAAAACTAAAGCCGCAACGGACTATGAATCTATTGATTTCTCTAACAAATCA
CAGACGTCTCAGGGTAAAGAGTGGAATTTTAAAATAGCCAATTGGAACGTGGACGGCATT
AGGGCTTGGATGGGAAAAGGCGGATTGGACTACCTTAAATACGAAAAACCGGATATATTG
TGTCTACAGGAAACGAAATGCGCTCTAGATAAATTGCCGTACGAAGTGAAAAATATACCC
GGATATCACGCGTACTGGCTGTCTAGTGATAAAGATGGCTACGCCGGCGTAGGAATTTAC
ACTACAAAGTTAGCTATGAATGTACAATACGGTTTACAAAACGAGGAATTGGATTCCGAA
GGTCGGATAATAACGGCTGAGTACGAACAATTCTACTTAATATGCACGTACGTACCTAAC
GCGGGACGAAAATTAGTTTCACTGCCCAAGAGATTAAAGTGGAACGACGAGTTCAGGGAA
CACGTTAAGGCGCTGGACGAAAAGAAACCTGTCATTATATGCGGTGACATGAACGTGGCT
CACAACGAAATAGATCTAACGAATCCAAAAACGAATAAGAAGAACGCCGGCTTCACGGAG
GAGGAACGAGCTGGTATGACGGAGCTGCTCGGGGACGGATTCGTAGACACGTTCAGACAT
TTTCATCCTGAGAAAGTCGCTTATACGTTCTGGAGTTACATGGCCAATAGTAGAGCTAAG
AACGTCGGATGGCGTTTGGACTACTTCATCGTGTCAGAGAGACTTTTACCGTCTATATGC
GACAGTTCGATCCGCGGCGAGGTGTATGGGAGTGACCACTGTCCTATAGCACTCTACCTA
CACTTAACGAGCGCCGACAAACCCAAGGAATAG
Protein sequence:
MAPRTAKAKKNADVKVSEGEVPKKGRGKAKNVTEQEQVVPEVVIDKNPSVEKKTKRGKNA
VVENNDEAQIIVKDAPPAKKGRKKAVEEPIPESNSLNGDNSDENEPAVQSDDNEQNEEHS
ESNDNAEDTQASAGKGRKKVNKKEPIEKEVKPKETGRGKKNVKQENITAKKDAEEKPKAR
GRGRKVQPKAEDVQDNDEVEVKPKGRKKAQPKVVEKVQKADDEDDEQEEIPDEEAEEEKP
VEEVKKKGRKNADKKTTQKEDSEQKDDEVKEQMPVSKSRKGAKKDEKAKGDTKDDDKDDV
AESKPVKGKRGQKKAEASELQDTGEPINKRRRKDDKATEDNKKKTKAATDYESIDFSNKS
QTSQGKEWNFKIANWNVDGIRAWMGKGGLDYLKYEKPDILCLQETKCALDKLPYEVKNIP
GYHAYWLSSDKDGYAGVGIYTTKLAMNVQYGLQNEELDSEGRIITAEYEQFYLICTYVPN
AGRKLVSLPKRLKWNDEFREHVKALDEKKPVIICGDMNVAHNEIDLTNPKTNKKNAGFTE
EERAGMTELLGDGFVDTFRHFHPEKVAYTFWSYMANSRAKNVGWRLDYFIVSERLLPSIC
DSSIRGEVYGSDHCPIALYLHLTSADKPKE