Monarch geneset OGS2.0

DPOGS202916
TranscriptDPOGS202916-TA807 bp
ProteinDPOGS202916-PA268 aa
Genomic positionDPSCF300126 + 403293-404771
RNAseq coverage339x (Rank: top 34%)
Annotation
HeliconiusHMEL0145871e-3639.30% 
BombyxBGIBMGA004198-TA1e-4366.10% 
Drosophila% 
EBI UniRef50UniRef50_P433525e-1135.64%DNA repair protein RAD52 homolog n=16 Tax=Amniota RepID=RAD52_MOUSE
NCBI RefSeqXP_002732295.15e-1138.78%PREDICTED: brain-specific angiogenesis inhibitor 3-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3517107861e-1035.64%DNA repair protein RAD52-like protein [Heterocephalus glaber]
NCBI nr blastxgi|3517107868e-0936.00%DNA repair protein RAD52-like protein [Heterocephalus glaber]
Group
Gene OntologyGO:00062812e-07DNA repair
GO:00063102e-07DNA recombination
KEGG pathwayrno:2975612e-11 
 K10873 (RAD52)maps-> Homologous recombination
InterPro domain[45-153] IPR0072322e-07Rad52/22 double-strand break repair protein
Orthology groupMCL24998 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202916-TA
ATGCAGAGAAAACCAGGATACCCTGCTACCGCTAATAATATGAAAATGGAGCTTGGTTTGGGAATGGAATACGTTGCGCAATGTGATTTTGCATCTGAAAGCCAATGTGATGACGATGAGACGCAACAGCGCCGACAGCAGCTAATAAACTTTGGACATTCTCAGTGGGGGTTCAATAATTGGAGTTGGTTGGTTACTTCACAAGCCTTAGATTTTGTCGAAAATCATAATGGCATATACACAGCTGGTGTTGCATGTTTTGTGTCCGTTAAAGTAAAAAGTCTGGATATACAACGAGTTAATGTGGGCTATGCAACATCAGTAGCAGCCTATAAAGGACTCTCTATTCATAGAGCAAGGATGTGCTCTGTGACAAATGGTCTTCTCGAAACTCTTTTGAGTTTTGGTGGAAATCTTGCTTCAGAACTAATGGAACTCCTCGAAAGTAACAAAAATGAGGCAGCAAACCACATCATGGTGCCTGAAGCTGTTCCAGAGAGCAGCAGCAATCCGCTTATTAAAACAGAACCAAAGAACCTCTCAAAGCCAATTAATAGAAAAGATACTGAAGCCAATGTAAACCTACCTCAGGTTTTTCCACAGAAGGGGCCTAAGAATACACTAAACCCTCCCGTGGCTAAGGCCCATCCGATGCCCGCCAACCTGGCAATCAACATGTCAACGAATCTCCCAAATCCTCCGACGACCCCGGCCGCTAACTTGCCGCCCGCCGCCAACGCTCCCCGGCCACCCTACGCCGCGCCGCCCGCCCCCGCCCCCGCCCACGTTCGTCCCAAAATCACGTGA

Protein sequence:

>DPOGS202916-PA
MQRKPGYPATANNMKMELGLGMEYVAQCDFASESQCDDDETQQRRQQLINFGHSQWGFNNWSWLVTSQALDFVENHNGIYTAGVACFVSVKVKSLDIQRVNVGYATSVAAYKGLSIHRARMCSVTNGLLETLLSFGGNLASELMELLESNKNEAANHIMVPEAVPESSSNPLIKTEPKNLSKPINRKDTEANVNLPQVFPQKGPKNTLNPPVAKAHPMPANLAINMSTNLPNPPTTPAANLPPAANAPRPPYAAPPAPAPAHVRPKIT-