Monarch geneset OGS2.0

DPOGS211792
TranscriptDPOGS211792-TA816 bp
ProteinDPOGS211792-PA271 aa
Genomic positionDPSCF300107 + 403198-404013
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0079392e-8257.14% 
BombyxBGIBMGA004019-TA5e-4353.15% 
DrosophilaIrbp-PA3e-1929.96% 
EBI UniRef50UniRef50_P234759e-2832.97%X-ray repair cross-complementing protein 6 n=74 Tax=Euteleostomi RepID=XRCC6_MOUSE
NCBI RefSeqXP_002117640.17e-2232.01%hypothetical protein TRIADDRAFT_61626 [Trichoplax adhaerens]
NCBI nr blastpgi|1986373e-2833.33%p70 Ku lupus autoantigen, partial [Mus musculus]
NCBI nr blastxgi|14695157e-2933.33%DNA repair enzyme [Mus musculus]
Group
Gene OntologyGO:00054881.7e-35binding
GO:00036778.1e-18DNA binding
GO:00063038.1e-18double-strand break repair via nonhomologous end joining
GO:00040038.1e-18ATP-dependent DNA helicase activity
KEGG pathwaymmu:143752e-27 
 K10884 (XRCC6, KU70, G22P1)maps-> Non-homologous end-joining
InterPro domain[2-194] IPR0161941.7e-35Spen Paralogue and Orthologue SPOC, C-terminal-like
[2-101] IPR0061648.1e-18DNA helicase, ATP-dependent, Ku type
[97-180] IPR0051603.1e-14Ku70/Ku80 C-terminal arm
Orthology groupMCL17283 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211792-TA
ATGATGAAACTCCTTGGTTTCAAGCCAGCCAGTATCATTTGTAAGGAAAAGTGGTATTTTAAAATTGGACAATTTTTATATCCAAATGAAAGTATTATAGAAGGTTCCACAGTCGCTTTCAAAGCTTTACATGAAGCTTGCACTGTTATGAAAATGGTAGCACTTTGTATTTTGTGTACTAGAGTCAATTCTAGACCCGTAATAGTTGCGCTGAGTCCTTGTGTGAAACCTCTCAATCTTAACATTGATATTGGTTTTGACATTGTTAATATACCATTTGTTGAACATGTAAGGGAACTTAATGTCGAAGAGGATGTTATCGAAGATGAGAGTCTAGTTGTAGAAAGTGCACACAAAGAGCTGATGAAGGGTATAATAAATAATACTATAATAGATTACCGACCCGATATGTTTGAGGATCCCAAATTGCAATCTAAATACAGAGCGATTGAGGCTCTAGCATTGGACGAAGATGAGACTGAACCTTTTGTAGATACAACCAAACCTAGCATTGAAAGATTTCAAAACTTACCAGACGATCTATTTGAGGAACTATTTGGACCCTTTGCATCTATGACTTTGAAGAGATCATGTCCTAAAGTGCCATCTCAACAGAACAAGAAGCCAAAAATTGAAAATTTTGATGAAGAACTTTTTAATACTAAATTGAAAGAAAAAAAGATTGAGTCATATACTGTGCCACAGTTAAAAAACATATTAAAATATAAAAATATTCAGAATCTTCCAGCGTTAAATGGCTTAAAAAAGGCTGAGCTTGTTAATTTAGTTTACACACATTGTGATGAAGAGAAATAA

Protein sequence:

>DPOGS211792-PA
MMKLLGFKPASIICKEKWYFKIGQFLYPNESIIEGSTVAFKALHEACTVMKMVALCILCTRVNSRPVIVALSPCVKPLNLNIDIGFDIVNIPFVEHVRELNVEEDVIEDESLVVESAHKELMKGIINNTIIDYRPDMFEDPKLQSKYRAIEALALDEDETEPFVDTTKPSIERFQNLPDDLFEELFGPFASMTLKRSCPKVPSQQNKKPKIENFDEELFNTKLKEKKIESYTVPQLKNILKYKNIQNLPALNGLKKAELVNLVYTHCDEEK-