Monarch geneset OGS2.0

DPOGS213985
TranscriptDPOGS213985-TA1932 bp
ProteinDPOGS213985-PA643 aa
Genomic positionDPSCF300417 + 15887-20105
RNAseq coverage401x (Rank: top 30%)
Annotation
HeliconiusHMEL0116974e-11640.38% 
BombyxBGIBMGA008208-TA4e-4131.52% 
DrosophilaXRCC1-PA2e-3928.90% 
EBI UniRef50UniRef50_D1ZZU61e-5331.13%Putative uncharacterized protein GLEAN_07398 n=3 Tax=Pancrustacea RepID=D1ZZU6_TRICA
NCBI RefSeqXP_975029.14e-5431.19%PREDICTED: similar to DNA-repair protein XRCC1 (X-ray repair cross-complementing protein 1) [Tribolium castaneum]
NCBI nr blastpgi|2700053494e-5331.13%hypothetical protein TcasGA2_TC007398 [Tribolium castaneum]
NCBI nr blastxgi|2700053496e-6231.72%hypothetical protein TcasGA2_TC007398 [Tribolium castaneum]
Group
Gene OntologyGO:00056343.1e-29nucleus
GO:00036843.1e-29damaged DNA binding
GO:00000123.1e-29single strand break repair
GO:00056223.9e-20intracellular
KEGG pathwaytca:6639071e-53 
 K10803 (XRCC1)maps-> Base excision repair
InterPro domain[1-147] IPR0089792.6e-35Galactose-binding domain-like
[1-145] IPR0027063.1e-29DNA-repair protein Xrcc1, N-terminal
[338-423] IPR0013573.9e-20BRCT
Orthology groupMCL16977 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213985-TA
ATGCCTCGAGTTAAAATTGATTACGTTGTGAGCATGAGCAGTGAGGACCCTGAAAATCCGGCAAACAATTTATTATCGTGGGAAATAAATAAAAAGAAATGGCTTTGTAAGACGGGGGAGACCTCTTGTTCAGTAGTTCTACAGCTGACTAAGGCTGTCCAGATAGAATCGATCACGATTGGAACATACCACACGTCTATGTTAGAGGTGTTAGTAGGATCATCAGAGAAGCCCAATGAAACCTTTGAGGTGTTAGTCCCGAGTTGTGTGCTGTGTTCTCCACGAGAGGCTCGCGGAGCACCAGTTGAGAGAGTGAAGAGCTTTACACGAGATGAACTGACATCTGTCCGACAGAGACGCTGGGACCGATTGAGACTAGTCTGCTCACAACCTTACAACAGACACTGCAAGTATGGAATCTCATTTGTTCATATCTTTGAACCGGAAAGTCCAACTCTGTCCGGTCACACAGCCTTGTCCATCTCTCGCACGTTCCGCCTCGAGGAGCTTGGTTCAGAGGATGAAGAGTTCCGTCCTGGGGAACTGTTCCATAAACACAAACAAGACCAGAAAACACATAATAGTACTGACGCACAAATCAGACAAGCTACGTCGCGGGCACTGAACAACATAGGCGACTCCTCCACCAGATTAACAAAGACGCCAATATCGAAAACTAGCAACAGACCGTCTGATCAAAGCTCGAATTATTCCACTCGAGAAAAGAGGAGTCTCATGTATACAGAGGATGACGAGCAACCACACCAGAAAATAGATAGAGTTATAGAAAGACATGGGAGAGAGAAACAGAGAGAAGATGGAAAAAAGAAAACTGACCAGGAGGCCAAGAAGAAGAAGACCGGCAGTAAGAGAGAAGAAAGTAAGGAAGATGAGAAGAACAAGGAGACAAAACATACAGACAATCGGACTCAGGACCAGACACATACTACATTAATGAATTCCACTAAAAGGAAACACTCCCAGGAAGCCCCATCCCGGGCTCCGGCCCGTCCCCTGTCTTCTCTTCTGTCGGATGTGGTGTTCTCTATTTCGGGATACGTGAACCCGCGTCGAGCGTCGGTCCGCGCGGCCGCTCTCCGGATGGGTGCGCACTACACGCCCGACGTCACCGCCGACTGCACACATCTCATCTGTGCCTTCCCCAACACTCCAAAACTCCGCCTGGTGCGGGGAAGTGTGGCCGTCGTCAAGGCCGAGTGGGTCGAAGACTGTCTGCGCTCGGGGACCAGGCTGAAGGAGACAACATACGACACGAGGGGAGGGGCGGGGGGGCGCCACCAGGACAGTGAGAGGACGGGAGACGGGGGAGGAGGAGGGAGGGGGCGATGTAGTAACGGTGACTCCGCAGAGACGGAGCATGACACGGACGACGAAATAGAACAAGTCATGCGACGACAAAAGAGAAAACGACTCAGTGAAGAGGAAGAAGAGGGAGGGGAGGAAGACCGGGATGTGATGTGCGACACGGACGAGGAGGACGGAGAACAGAGGCGGGAGGAGATAGACGCCCGTAAGGGCGTGTGTGTGCAGTCGCTGCCGACGTTCCTGGCGGGAGTGACGTTCTCCCTGTGCCCGGAGCTACCGGTGTGTGAGCGCGCGCTCCTGGAGCGGTACATCACAGCCTACGGCGGGGTGGTGCTGCAGGGGAAGAGGACGAAGGAGGCAAGGCGTGAGATTCACGAAGGCAAACTGCGGATGATGCGGATGGTAACAGTATGGTGTCGGCGGGAGGAGGATCGCGGAAAAAACTCTGGAACGGAGCTTGGTCAAGTAGGTCGACGAGCGCAGCACAACATTTTGACTGAACACTCTCTTCCGACTCCGTACGCAAGAAAGAAAATTGACGATAAGTTTATTACTGCCTGGAAACTTTTATTGGATGATAATATTCTGCGTCGTATAGAGAAGTAA

Protein sequence:

>DPOGS213985-PA
MPRVKIDYVVSMSSEDPENPANNLLSWEINKKKWLCKTGETSCSVVLQLTKAVQIESITIGTYHTSMLEVLVGSSEKPNETFEVLVPSCVLCSPREARGAPVERVKSFTRDELTSVRQRRWDRLRLVCSQPYNRHCKYGISFVHIFEPESPTLSGHTALSISRTFRLEELGSEDEEFRPGELFHKHKQDQKTHNSTDAQIRQATSRALNNIGDSSTRLTKTPISKTSNRPSDQSSNYSTREKRSLMYTEDDEQPHQKIDRVIERHGREKQREDGKKKTDQEAKKKKTGSKREESKEDEKNKETKHTDNRTQDQTHTTLMNSTKRKHSQEAPSRAPARPLSSLLSDVVFSISGYVNPRRASVRAAALRMGAHYTPDVTADCTHLICAFPNTPKLRLVRGSVAVVKAEWVEDCLRSGTRLKETTYDTRGGAGGRHQDSERTGDGGGGGRGRCSNGDSAETEHDTDDEIEQVMRRQKRKRLSEEEEEGGEEDRDVMCDTDEEDGEQRREEIDARKGVCVQSLPTFLAGVTFSLCPELPVCERALLERYITAYGGVVLQGKRTKEARREIHEGKLRMMRMVTVWCRREEDRGKNSGTELGQVGRRAQHNILTEHSLPTPYARKKIDDKFITAWKLLLDDNILRRIEK-