New model in OGS2.0 | DPOGS213985  |
---|---|
Genomic Position | scaffold295:+ 15887-20105 |
See gene structure | |
CDS Length | 2058 |
Paired RNAseq reads   | 683 |
Single RNAseq reads   | 3012 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008208 (2e-29) |
Best Drosophila hit   | XRCC1 (2e-15) |
Best Human hit | DNA repair protein XRCC1 (1e-21) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC007398 [Tribolium castaneum] (2e-52) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC007398 [Tribolium castaneum] (3e-56) |
GeneOntology terms    | GO:0003684 damaged DNA binding GO:0005622 intracellular GO:0000012 single strand break repair GO:0005634 nucleus |
InterPro families    | IPR002706 DNA-repair protein Xrcc1, N-terminal IPR001357 BRCT IPR008979 Galactose-binding domain-like |
Orthology group | MCL14199 |
Nucleotide sequence:
ATGCCTCGAGTTAAAATTGATTACGTTGTGAGCATGAGCAGTGAGGACCCTGAAAATCCG
GCAAACAATTTATTATCGTGGGAAATAAATAAAAAGAAATGGCTTTGTAAGACGGGGGAG
ACCTCTTGTTCAGTAGTTCTACAGCTGACTAAGGCTGTCCAGATAGAATCGATCACGATT
GGAACATACCACACGTCTATGTTAGAGGTGTTAGTAGGATCATCAGAGAAGCCCAATGAA
ACCTTTGAGGTGTTAGTCCCGAGTTGTGTGCTGTGTTCTCCACGAGAGGCTCGCGGAGCA
CCAGTTGAGAGAGTGAAGAGCTTTACACGAGATGAACTGACATCTGTCCGACAGAGACGC
TGGGACCGATTGAGACTAGTCTGCTCACAACCTTACAACAGACACTGCAAGTATGGAATC
TCATTTGTTCATATCTTTGAACCGGAAAGTCCAACTCTGTCCGGTCACACAGCCTTGTCC
ATCTCTCGCACGTTCCGCCTCGAGGAGCTTGGTTCAGAGGATGAAGAGTTCCGTCCTGGG
GAACTGTTCCATAAACACAAACAAGACCAGAAAACACATAATAGTACTGACGCACAAATC
AGACAAGCTACGTCGCGGGCACTGAACAACATAGGCGACTCCTCCACCAGATTAACAAAG
ACGCCAATATCGAAAACTAGCAACAGACCGTCTGATCAAAGCTCGAATTATTCCACTCGA
GAAAAGAGGAGTCTCATGTATACAGAGGATGACGAGCAACCACACCAGAAAATAGATAGA
GTTATAGAAAGACATGGGAGAGAGAAACAGAGAGAAGATGGAAAAAAGAAAACTGACCAG
GAGGCCAAGAAGAAGAAGACCGGCAGTAAGAGAGAAGAAAGTAAGGAAGATGAGAAGAAC
AAGGAGACAAAACATACAGACAATCGGACTCAGGACCAGACACATACTACATTAATGAAT
TCCACTAAAAGGAAACACTCCCAGGAAGCCCCATCCCGGGCTCCGGCCCGTCCCCTGTCT
TCTCTTCTGTCGGATGTGGTGTTCTCTATTTCGGGATACGTGAACCCGCGTCGAGCGTCG
GTCCGCGCGGCCGCTCTCCGGATGGGTGCGCACTACACGCCCGACGTCACCGCCGACTGC
ACACATCTCATCTGTGCCTTCCCCAACACTCCAAAACTCCGCCTGGTGCGGGGAAGTGTG
GCCGTCGTCAAGGCCGAGTGGGTCGAAGACTGTCTGCGCTCGGGGACCAGGCTGAAGGAG
ACAACATACGACACGAGGGGAGGGGCGGGGGGGCGCCACCAGGACAGTGAGAGGACGGGA
GACGGGGGAGGAGGAGGGAGGGGGCGATGTAGTAACGGTGACTCCGCAGAGACGGAGCAT
GACACGGACGACGAAATAGAACAAGTCATGCGACGACAAAAGAGAAAACGACTCAGTGAA
GAGGAAGAAGAGGGAGGGGAGGAAGACCGGGATGTGATGTGCGACACGGACGAGGAGGAC
GGAGAACAGAGGCGGGAGGAGATAGACGCCCGTAAGGTGAGGGGTCGAGTGACAGTCAGG
TCTCAATATAGTGTAACAAGAGTCCCGGAGACGGGAGACGACCTCAGCAGATCGCAGGAC
AGAGGCGTGTGTGTGCAGTCGCTGCCGACGTTCCTGGCGGGAGTGACGTTCTCCCTGTGC
CCGGAGCTACCGGTGTGTGAGCGCGCGCTCCTGGAGCGGTACATCACAGCCTACGGCGGG
GTGGTGCTGCAGGTCGGTCTGGTCTGCTGTGCAACGTCAAATATCACTCAGGGGAAGAGG
ACGAAGGAGGCAAGGCGTGAGATTCACGAAGGCAAACTGCGGATGATGCGGATGGTAACA
GTATGGTGTCGGCGGGAGGAGGATCGCGGAAAAAACTCTGGAACGGAGCTTGGTCAAGTA
GGTCGACGAGCGCAGCACAACATTTTGACTGAACACTCTCTTCCGACTCCGTACGCAAGA
AAGAAAATTGACGATAAGTTTATTACTGCCTGGAAACTTTTATTGGATGATAATATTCTG
CGTCGTATAGAGAAGTAA
Protein sequence:
MPRVKIDYVVSMSSEDPENPANNLLSWEINKKKWLCKTGETSCSVVLQLTKAVQIESITI
GTYHTSMLEVLVGSSEKPNETFEVLVPSCVLCSPREARGAPVERVKSFTRDELTSVRQRR
WDRLRLVCSQPYNRHCKYGISFVHIFEPESPTLSGHTALSISRTFRLEELGSEDEEFRPG
ELFHKHKQDQKTHNSTDAQIRQATSRALNNIGDSSTRLTKTPISKTSNRPSDQSSNYSTR
EKRSLMYTEDDEQPHQKIDRVIERHGREKQREDGKKKTDQEAKKKKTGSKREESKEDEKN
KETKHTDNRTQDQTHTTLMNSTKRKHSQEAPSRAPARPLSSLLSDVVFSISGYVNPRRAS
VRAAALRMGAHYTPDVTADCTHLICAFPNTPKLRLVRGSVAVVKAEWVEDCLRSGTRLKE
TTYDTRGGAGGRHQDSERTGDGGGGGRGRCSNGDSAETEHDTDDEIEQVMRRQKRKRLSE
EEEEGGEEDRDVMCDTDEEDGEQRREEIDARKVRGRVTVRSQYSVTRVPETGDDLSRSQD
RGVCVQSLPTFLAGVTFSLCPELPVCERALLERYITAYGGVVLQVGLVCCATSNITQGKR
TKEARREIHEGKLRMMRMVTVWCRREEDRGKNSGTELGQVGRRAQHNILTEHSLPTPYAR
KKIDDKFITAWKLLLDDNILRRIEK