Monarch geneset OGS2.0

DPOGS202587
TranscriptDPOGS202587-TA1683 bp
ProteinDPOGS202587-PA560 aa
Genomic positionDPSCF300363 - 105071-109185
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0142022e-17958.23% 
BombyxBGIBMGA011431-TA8e-16655.51% 
DrosophilaCat-PA1e-9839.24% 
EBI UniRef50UniRef50_P040405e-10039.31%Catalase n=690 Tax=root RepID=CATA_HUMAN
NCBI RefSeqXP_001943641.13e-10241.34%PREDICTED: similar to catalase [Acyrthosiphon pisum]
NCBI nr blastpgi|1143259499e-10741.00%catalase [Chlamys farreri]
NCBI nr blastxgi|1143259497e-10441.38%catalase [Chlamys farreri]
Group
Gene OntologyGO:00069796.6e-129response to oxidative stress
GO:00040966.6e-129catalase activity
GO:00551146.6e-129oxidation-reduction process
GO:00200372.1e-09heme binding
GO:00055062.1e-09iron ion binding
KEGG pathwaymdo:1000314951e-101 
 K03781 (katE, CAT)maps-> Peroxisome
    Amyotrophic lateral sclerosis (ALS)
    Tryptophan metabolism
    Methane metabolism
InterPro domain[48-534] IPR0208356.6e-129Catalase-like domain, haem-dependent
[77-433] IPR0180282.6e-117Catalase-related subgroup
[78-402] IPR0116144.4e-60Catalase, N-terminal
[456-513] IPR0105822.1e-09Catalase-related immune responsive
Orthology groupMCL10272 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202587-TA
ATGAAGACTGCCGGTGCGTGTGGCTGTTTGCTTTTTCTAATCATACAAGCACTGGGCGTATTCAGCAGTGATATCGATTATATTGAATATTTAAACAGAACCGATCCAACGAAAATTCAACTGTACGAATTCAGAAGAGAACACCCGAATCCTATTGGTATTCTAACTACAAGCGCAGGGAAAATTGTGGAAATCAGAGAGACGAAAACGCTCAATTCAGAACCATACGATAATAGTTATTTTGTAGATCTTTTAACTCATTGGCACGCCGAGAGAGTACCTGAAAGAATTATTTACGCTAAAGCCGCCGGTGCCTTTGGTTACTTTGAGGTAACTAACGATGTATCTAAGTACACTAAAGCTGAAGTTTTCAATGGCGTGGGCAAGAAGACCCCTGTTATGGTTCGAGTATCTACAATGCTGCAAAACAGAGGAGGAACTGATCTAGCTAGAGAATCAAAAGGCTTTTCAGTTAAGTTTTACACAAAGGAAGGAAATTTAGACTTACTATGCCTCAATATGCCAGTTTTTTTCCTTAATGATCCTATTGATTTCCCAAGTTTAATTCATGCACTGAAAAGGAATCCAAAACCTCATCTCTATGATTTTAATATGGTTTATGATTTATTAACAAAGAGACCGTTCTCACTCTACGGTTATTTATGGACTATGTCGGATTTTGGTATTCCAAATACATATAGACGTATGGACGCGTTCGCAATTCACACTTACGAGATTAATAACAAATTTGGCGATCGTTACTTTGTCAAATTCAATTTTAGAACTGAGCAAGGTATAGAAAATTTACCTTCGGATGTCGCAGAAAAAATAACTGTACGTGATCCAGATTATTACAAAAGAGACTTGTATAATGTTATAGAGAAAAAGCAATTCCCCTCATGGATTTTGGAGATGGATGTAATGAGCCTTGATGATATAAAAAATATTGATTATAATCCGTTCGACGTTGGTAGAATATGGAAAAATGGTACTTATTGTACAGTACCCATCGGGAGACTTGTTTTGAATAGAAACCCAGAAAACGCCTATAGAGTTGCTGAAAGAGTCGCTTTTAATCCTGCAAATTTGGTACCAGGAATCCCTGGACCACAAGATTTATTGTTCAAAGCTAGACGACAATCTTATCGGGAAGCTCAAATATATCGTTTAGGTGTAAATTACAATAAAATTATGGTTAATGCTCCTCTGTATTCTAAAGTATATAACCGGGATGGCGTGGCGCCAGTGAGAGACAATATGAAAGATGCTCCTATCTATTATCCCAATTCGTTTAGTGGACCAGTACCTTATGTAGATCCCGGGCAATCAAATGAGAAATTAATCATTTACGAGTCTAATGCAGTTGATTTGGAACAACCTGCTCTCTTCTATAATAAAATTCTTCGAACTGATGAAGAGAGGACAAGACTTGCAAAGAATATCGCACCAACGTTAGTTGGAGTTTATCCTGAAATTCAGAAACGTTTGATGCGCTTGCTCACTCTAATAGATCACAGGTTAGGAAAAGATGTTGAAGTCTTGCTTGCAAAGGAATTGAAAAAGCCACACCCTAAACCCCCAAAAGTTTTAAGCTACTCTCGAAATTTAAAGAGCCAGCAAGAATTTTGTCCAATGAAACCTGATGGACATAATAATGAAGATAAAGCCATAGATAGAGAGTAG

Protein sequence:

>DPOGS202587-PA
MKTAGACGCLLFLIIQALGVFSSDIDYIEYLNRTDPTKIQLYEFRREHPNPIGILTTSAGKIVEIRETKTLNSEPYDNSYFVDLLTHWHAERVPERIIYAKAAGAFGYFEVTNDVSKYTKAEVFNGVGKKTPVMVRVSTMLQNRGGTDLARESKGFSVKFYTKEGNLDLLCLNMPVFFLNDPIDFPSLIHALKRNPKPHLYDFNMVYDLLTKRPFSLYGYLWTMSDFGIPNTYRRMDAFAIHTYEINNKFGDRYFVKFNFRTEQGIENLPSDVAEKITVRDPDYYKRDLYNVIEKKQFPSWILEMDVMSLDDIKNIDYNPFDVGRIWKNGTYCTVPIGRLVLNRNPENAYRVAERVAFNPANLVPGIPGPQDLLFKARRQSYREAQIYRLGVNYNKIMVNAPLYSKVYNRDGVAPVRDNMKDAPIYYPNSFSGPVPYVDPGQSNEKLIIYESNAVDLEQPALFYNKILRTDEERTRLAKNIAPTLVGVYPEIQKRLMRLLTLIDHRLGKDVEVLLAKELKKPHPKPPKVLSYSRNLKSQQEFCPMKPDGHNNEDKAIDRE-