Monarch geneset OGS2.0

DPOGS204170
TranscriptDPOGS204170-TA1017 bp
ProteinDPOGS204170-PA338 aa
Genomic positionDPSCF300034 - 230338-232711
RNAseq coverage145x (Rank: top 54%)
Annotation
HeliconiusHMEL0099611e-16286.39% 
BombyxBGIBMGA005107-TA0.097.25% 
Drosophilaspn-A-PA7e-13169.06% 
EBI UniRef50UniRef50_Q066094e-15778.40%DNA repair protein RAD51 homolog 1 n=204 Tax=root RepID=RAD51_HUMAN
NCBI RefSeqNP_001037484.10.097.55%Rad51 homolog [Bombyx mori]
NCBI nr blastpgi|1129845360.097.55%Rad51 homolog [Bombyx mori]
NCBI nr blastxgi|1129845360.096.45%Rad51 homolog [Bombyx mori]
Group
Gene OntologyGO:00062817e-179DNA repair
GO:00055247e-179ATP binding
GO:00036847e-179damaged DNA binding
GO:00080947e-179DNA-dependent ATPase activity
GO:00001661.6e-16nucleotide binding
GO:00171114.8e-11nucleoside-triphosphatase activity
KEGG pathwaycqu:CpipJ_CPIJ0036793e-160 
 K04482 (RAD51)maps-> Pancreatic cancer
    Pathways in cancer
    Homologous recombination
InterPro domain[1-338] IPR0164671.4e-190DNA recombination and repair protein, RecA-like
[24-338] IPR0119417e-179DNA recombination/repair protein Rad51
[77-337] IPR0136321e-144DNA recombination and repair protein Rad51, C-terminal
[15-84] IPR0109951.6e-16DNA repair Rad51/transcription factor NusA, alpha-helical
[118-305] IPR0035934.8e-11ATPase, AAA+ type, core
Orthology groupMCL12269 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204170-TA
ATGGCTACAACAGCATCTGCTGCCACGGCTACAATAGAGGAAGATTTGGACGAATGTGGACCACAGTTGATTACAAAATTAGAGGGTAATGGAATTACTTCAGGGGATATAAAAAAGTTAGAAGAGGCTGGGTATCATACTGTTGAGTCTGTCGCATATGCTCCAAAGAAATGGTTAATAACAATAAAGGGCATCTCCGAAGCCAAAGCTGATAAAATACTGTCAGAAGCTTCAAAATTGGTACCAATGGGATTTACAACAGCTACAGAGTTCCACCAGAAGAGGGCAGAAATTATACAGTTGACAACGGGATCAAAAGAACTTGATAGACTACTTGGTGGTGGCATAGAAACAGGCTCAATCACTGAGATCTTTGGAGAGTTTCGGACAGGAAAAACACAGTTGTGTCATACTTTAGCTGTGACTTGTCAGCTTCCGATTGAGCAATCTGGTGGAGAAGGGAAGTGTATGTACATAGACACTGAAGGAACGTTCAGACCAGAACGATTACTGGCTGTGGCTCAGAGATATGGAATGGAAAGTGCCGCAGTTCTCGATAATGTGGCGTATGCAAGGGCTTATAACACAGATCATCAAACCCAGCTCTTAGTACAAGCGTGTGCTATGATGGCTGAATCTAGATACTCATTGCTGATAGTTGATAGTGCGACAGCGCTTTACAGAACTGACTATTCTGGTAGAGGGGAATTGAATTCAAGACAATTACATCTCGGAAGATTTATGAGAATGTTACTCAGATTGGCCGATGAGTTCGGTGTAGCTGTTATCATAACAAATCAAGTGGTGGCACAGGTCGATTCCGTTGGTGTGTTCAATGCTGATACAAAGAAACCTATCGGGGGACATATTATAGCTCACGCATCGACCACAAGACTTTATCTCCGGAAGGGCAGAGGAGATAATAGAGTATGCAAGATATATGACAGCCCCTGTCTGCCGGAAACGGAAGCCATGTTTGCTATCAGCACCGAAGGCATCACAGATGCTAAGGAATGA

Protein sequence:

>DPOGS204170-PA
MATTASAATATIEEDLDECGPQLITKLEGNGITSGDIKKLEEAGYHTVESVAYAPKKWLITIKGISEAKADKILSEASKLVPMGFTTATEFHQKRAEIIQLTTGSKELDRLLGGGIETGSITEIFGEFRTGKTQLCHTLAVTCQLPIEQSGGEGKCMYIDTEGTFRPERLLAVAQRYGMESAAVLDNVAYARAYNTDHQTQLLVQACAMMAESRYSLLIVDSATALYRTDYSGRGELNSRQLHLGRFMRMLLRLADEFGVAVIITNQVVAQVDSVGVFNADTKKPIGGHIIAHASTTRLYLRKGRGDNRVCKIYDSPCLPETEAMFAISTEGITDAKE-