DPGLEAN09668 in OGS1.0

New model in OGS2.0DPOGS213985 
Genomic Positionscaffold295:+ 15887-20105
See gene structure
CDS Length2058
Paired RNAseq reads  683
Single RNAseq reads  3012
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008208 (2e-29)
Best Drosophila hit  XRCC1 (2e-15)
Best Human hitDNA repair protein XRCC1 (1e-21)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC007398 [Tribolium castaneum] (2e-52)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC007398 [Tribolium castaneum] (3e-56)
GeneOntology terms


  
GO:0003684 damaged DNA binding
GO:0005622 intracellular
GO:0000012 single strand break repair
GO:0005634 nucleus
InterPro families

  
IPR002706 DNA-repair protein Xrcc1, N-terminal
IPR001357 BRCT
IPR008979 Galactose-binding domain-like
Orthology groupMCL14199

Nucleotide sequence:

ATGCCTCGAGTTAAAATTGATTACGTTGTGAGCATGAGCAGTGAGGACCCTGAAAATCCG
GCAAACAATTTATTATCGTGGGAAATAAATAAAAAGAAATGGCTTTGTAAGACGGGGGAG
ACCTCTTGTTCAGTAGTTCTACAGCTGACTAAGGCTGTCCAGATAGAATCGATCACGATT
GGAACATACCACACGTCTATGTTAGAGGTGTTAGTAGGATCATCAGAGAAGCCCAATGAA
ACCTTTGAGGTGTTAGTCCCGAGTTGTGTGCTGTGTTCTCCACGAGAGGCTCGCGGAGCA
CCAGTTGAGAGAGTGAAGAGCTTTACACGAGATGAACTGACATCTGTCCGACAGAGACGC
TGGGACCGATTGAGACTAGTCTGCTCACAACCTTACAACAGACACTGCAAGTATGGAATC
TCATTTGTTCATATCTTTGAACCGGAAAGTCCAACTCTGTCCGGTCACACAGCCTTGTCC
ATCTCTCGCACGTTCCGCCTCGAGGAGCTTGGTTCAGAGGATGAAGAGTTCCGTCCTGGG
GAACTGTTCCATAAACACAAACAAGACCAGAAAACACATAATAGTACTGACGCACAAATC
AGACAAGCTACGTCGCGGGCACTGAACAACATAGGCGACTCCTCCACCAGATTAACAAAG
ACGCCAATATCGAAAACTAGCAACAGACCGTCTGATCAAAGCTCGAATTATTCCACTCGA
GAAAAGAGGAGTCTCATGTATACAGAGGATGACGAGCAACCACACCAGAAAATAGATAGA
GTTATAGAAAGACATGGGAGAGAGAAACAGAGAGAAGATGGAAAAAAGAAAACTGACCAG
GAGGCCAAGAAGAAGAAGACCGGCAGTAAGAGAGAAGAAAGTAAGGAAGATGAGAAGAAC
AAGGAGACAAAACATACAGACAATCGGACTCAGGACCAGACACATACTACATTAATGAAT
TCCACTAAAAGGAAACACTCCCAGGAAGCCCCATCCCGGGCTCCGGCCCGTCCCCTGTCT
TCTCTTCTGTCGGATGTGGTGTTCTCTATTTCGGGATACGTGAACCCGCGTCGAGCGTCG
GTCCGCGCGGCCGCTCTCCGGATGGGTGCGCACTACACGCCCGACGTCACCGCCGACTGC
ACACATCTCATCTGTGCCTTCCCCAACACTCCAAAACTCCGCCTGGTGCGGGGAAGTGTG
GCCGTCGTCAAGGCCGAGTGGGTCGAAGACTGTCTGCGCTCGGGGACCAGGCTGAAGGAG
ACAACATACGACACGAGGGGAGGGGCGGGGGGGCGCCACCAGGACAGTGAGAGGACGGGA
GACGGGGGAGGAGGAGGGAGGGGGCGATGTAGTAACGGTGACTCCGCAGAGACGGAGCAT
GACACGGACGACGAAATAGAACAAGTCATGCGACGACAAAAGAGAAAACGACTCAGTGAA
GAGGAAGAAGAGGGAGGGGAGGAAGACCGGGATGTGATGTGCGACACGGACGAGGAGGAC
GGAGAACAGAGGCGGGAGGAGATAGACGCCCGTAAGGTGAGGGGTCGAGTGACAGTCAGG
TCTCAATATAGTGTAACAAGAGTCCCGGAGACGGGAGACGACCTCAGCAGATCGCAGGAC
AGAGGCGTGTGTGTGCAGTCGCTGCCGACGTTCCTGGCGGGAGTGACGTTCTCCCTGTGC
CCGGAGCTACCGGTGTGTGAGCGCGCGCTCCTGGAGCGGTACATCACAGCCTACGGCGGG
GTGGTGCTGCAGGTCGGTCTGGTCTGCTGTGCAACGTCAAATATCACTCAGGGGAAGAGG
ACGAAGGAGGCAAGGCGTGAGATTCACGAAGGCAAACTGCGGATGATGCGGATGGTAACA
GTATGGTGTCGGCGGGAGGAGGATCGCGGAAAAAACTCTGGAACGGAGCTTGGTCAAGTA
GGTCGACGAGCGCAGCACAACATTTTGACTGAACACTCTCTTCCGACTCCGTACGCAAGA
AAGAAAATTGACGATAAGTTTATTACTGCCTGGAAACTTTTATTGGATGATAATATTCTG
CGTCGTATAGAGAAGTAA

Protein sequence:

MPRVKIDYVVSMSSEDPENPANNLLSWEINKKKWLCKTGETSCSVVLQLTKAVQIESITI
GTYHTSMLEVLVGSSEKPNETFEVLVPSCVLCSPREARGAPVERVKSFTRDELTSVRQRR
WDRLRLVCSQPYNRHCKYGISFVHIFEPESPTLSGHTALSISRTFRLEELGSEDEEFRPG
ELFHKHKQDQKTHNSTDAQIRQATSRALNNIGDSSTRLTKTPISKTSNRPSDQSSNYSTR
EKRSLMYTEDDEQPHQKIDRVIERHGREKQREDGKKKTDQEAKKKKTGSKREESKEDEKN
KETKHTDNRTQDQTHTTLMNSTKRKHSQEAPSRAPARPLSSLLSDVVFSISGYVNPRRAS
VRAAALRMGAHYTPDVTADCTHLICAFPNTPKLRLVRGSVAVVKAEWVEDCLRSGTRLKE
TTYDTRGGAGGRHQDSERTGDGGGGGRGRCSNGDSAETEHDTDDEIEQVMRRQKRKRLSE
EEEEGGEEDRDVMCDTDEEDGEQRREEIDARKVRGRVTVRSQYSVTRVPETGDDLSRSQD
RGVCVQSLPTFLAGVTFSLCPELPVCERALLERYITAYGGVVLQVGLVCCATSNITQGKR
TKEARREIHEGKLRMMRMVTVWCRREEDRGKNSGTELGQVGRRAQHNILTEHSLPTPYAR
KKIDDKFITAWKLLLDDNILRRIEK