Monarch geneset OGS2.0

DPOGS202586
TranscriptDPOGS202586-TA1683 bp
ProteinDPOGS202586-PA560 aa
Genomic positionDPSCF300363 - 116044-117844
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0142020.062.89% 
BombyxBGIBMGA011431-TA6e-16955.71% 
DrosophilaCat-PA4e-10239.80% 
EBI UniRef50UniRef50_A5XB389e-10241.21%Catalase n=5 Tax=cellular organisms RepID=A5XB38_HALDI
NCBI RefSeqXP_314995.41e-10542.54%AGAP004904-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1143259496e-11143.93%catalase [Chlamys farreri]
NCBI nr blastxgi|1143259496e-11143.93%catalase [Chlamys farreri]
Group
Gene OntologyGO:00069791.4e-135response to oxidative stress
GO:00040961.4e-135catalase activity
GO:00551141.4e-135oxidation-reduction process
GO:00200374.6e-09heme binding
GO:00055064.6e-09iron ion binding
KEGG pathwayaga:AgaP_AGAP0049043e-105 
 K03781 (katE, CAT)maps-> Peroxisome
    Amyotrophic lateral sclerosis (ALS)
    Tryptophan metabolism
    Methane metabolism
InterPro domain[46-536] IPR0208351.4e-135Catalase-like domain, haem-dependent
[83-435] IPR0180283.9e-123Catalase-related subgroup
[77-404] IPR0116149.8e-66Catalase, N-terminal
[460-515] IPR0105824.6e-09Catalase-related immune responsive
Orthology groupMCL10272 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202586-TA
ATGCGCATCCGGTTGTTTTGTCTGCTTCTTATTTACCAAATATCTTATTTACGCAATGTGTTATGTTATTATTGTGACAATGAGTTATGTGACTACCTAAATCGGACAGATCCAGCCACGAGACAGTTGTATGAATTCAAACTACAGCATCCGAAACCAATAGGAATACTAACGGTCAGTTCGGGAGAGTTTGTAGAGATAAGAAAAACCAATTCTTTTAACTCAGATCAGTTTTTGAACCAGTATCATACAGATTTAATTAGTCATACTAATGATGAGAGAATACCTGAAAGATTTGTTCACGCTAAAGGCGGAGGTGCGTTTGGCTATTTTGAGGTCACTCATGACGTCACAAAATACGTAAGCGCGGAATTGTTTGACACAATTGGCAAGAAGACACCATTAGTTGTGCGTTTTTCTACTGTTGTGCAGAATTTGGGAGGGAATGACCTTGCTAGGGAGGTAAAAGGAATGGCTATCAAACTTTATACCCAAGAAGGAAACCTTGATTTCTTATGTCTCAATTATCCAGTATTTTTTTATAGAGATCCTTTATTTTTTACTTCTTTCAGTCATAGTTTTAAAAGAAACCCCAAGACATTCCTATTGGATTTCACAATGTCTTGGGATTTCGTAACTAAAAGACCTGATGCATTACACAGTTACTTGTGGTTGCTATCAGATTACGGTATTCCTAATGGTTATAGGAAAATGGATGCTTTTCCGATTCATACATACAGGGTATACAATAAACATGGAGATACGTACTTCGTCAGATTCAATTTCAGGACCGAACAAGGTGTTGAAAATCTTCCCTCGGATGTTGCTGCTGAAATTTCTTCCAGGGATTTAGATTATTTCAATAGAGATTTGTTTAATGCTATAGAGAATAAAACATATCCATCTTGGAGATTAGACATGGATATCATGACATTCGAGGATGTTAAAAATGTTGATTATAATCCATTCGACGTCGGTAGATTGTGGAAGGAAGGTACTTACTTCACAGAGACAGTGGGACGACTTGTTCTAGACAGAAATCCTGATAATTACTTCAGAGCAATAGAACAGAGCGCTTTTAATCCAGCCAATTTAGTACCGGGTATACCTGGTCCAATGGACACCATGTTTCGTAGTAGACGGCAATCTTACCGGGATGCTCAAATTTATCGTTTGGGAGTGAATCATAACAGAATTAAAGTCAACCAGCCCCTGTATTACAAGGGATACAATCGTGACGGAGTATCTCCTTTAAAAGATAACATGAAAGATGCACCGACTTATTATCCAAACAGATTTAGCGGCCCCGTACCATATGTAGATCCAAATATGCCTAAAGAGAAATTTAAAATTTACGAAACCACTGCTGTTGATTTGGAACCAGCTGCAAATTTTTATAATAATATCTTGAAAACGGACGATCAACGGGAGAGACTGGCCAATAACAGTGTTGCAAGGCTAATAACCGTATCCCCGGAATTGCAAAGGAGGGTTATACGGTTGTTTAGTTTAGCAGAACCTGATTTAGGTAGAAGAGTGGAACGTATATTGGTGGAAACTTTGGAACAGCCGCCGCCTCCTATACAACCTAGTCGCGTGCTAAAAGTACCTCCGACAATGTATGCTATGTATAGTAGTAAAATGAATGATGAAAACCAAAACAATCCCTTCATTCATAATTGA

Protein sequence:

>DPOGS202586-PA
MRIRLFCLLLIYQISYLRNVLCYYCDNELCDYLNRTDPATRQLYEFKLQHPKPIGILTVSSGEFVEIRKTNSFNSDQFLNQYHTDLISHTNDERIPERFVHAKGGGAFGYFEVTHDVTKYVSAELFDTIGKKTPLVVRFSTVVQNLGGNDLAREVKGMAIKLYTQEGNLDFLCLNYPVFFYRDPLFFTSFSHSFKRNPKTFLLDFTMSWDFVTKRPDALHSYLWLLSDYGIPNGYRKMDAFPIHTYRVYNKHGDTYFVRFNFRTEQGVENLPSDVAAEISSRDLDYFNRDLFNAIENKTYPSWRLDMDIMTFEDVKNVDYNPFDVGRLWKEGTYFTETVGRLVLDRNPDNYFRAIEQSAFNPANLVPGIPGPMDTMFRSRRQSYRDAQIYRLGVNHNRIKVNQPLYYKGYNRDGVSPLKDNMKDAPTYYPNRFSGPVPYVDPNMPKEKFKIYETTAVDLEPAANFYNNILKTDDQRERLANNSVARLITVSPELQRRVIRLFSLAEPDLGRRVERILVETLEQPPPPIQPSRVLKVPPTMYAMYSSKMNDENQNNPFIHN-