Monarch geneset OGS2.0

DPOGS210052
TranscriptDPOGS210052-TA1371 bp
ProteinDPOGS210052-PA456 aa
Genomic positionDPSCF300017 - 1083164-1084607
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0104190.080.80% 
BombyxBGIBMGA012691-TA0.075.83% 
DrosophilaCat-PA2e-12551.60% 
EBI UniRef50UniRef50_P040405e-12649.33%Catalase n=690 Tax=root RepID=CATA_HUMAN
NCBI RefSeqXP_002427710.11e-12849.32%Catalase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3829345053e-13155.04%catalase [Spodoptera litura]
NCBI nr blastxgi|3829345053e-12955.04%catalase [Spodoptera litura]
Group
Gene OntologyGO:00069798.5e-149response to oxidative stress
GO:00040968.5e-149catalase activity
GO:00551148.5e-149oxidation-reduction process
KEGG pathwayphu:Phum_PHUM3368604e-128 
 K03781 (katE, CAT)maps-> Peroxisome
    Amyotrophic lateral sclerosis (ALS)
    Tryptophan metabolism
    Methane metabolism
InterPro domain[34-447] IPR0208358.5e-149Catalase-like domain, haem-dependent
[37-417] IPR0180281.9e-143Catalase-related subgroup
[43-384] IPR0116147.4e-75Catalase, N-terminal
Orthology groupMCL18527 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210052-TA
ATGTTAATATTATTATTGATAAGAGCCGCGCTCGCCACTCGAAGAGACCCCGCTGCGGATCAAATAGTCATGTTTAAAGAAAATACACCAGGACCCATCGGAATAATGACTACGAGCGCCGGCGCGCCCGTGGAGTACGAAGAAGCGACGAATACATTGAACAGCAGGTTAATCTTCAATGAGTTCTTTATGGACTCCATAACACATCTCGTCCGCGAAAGAATACCGGAACGAATAGTGCATGCGAAGGCCGGCGGAGCGTTCGGTTATTTTGAAGTTACACATGATGTGACACATATTTGTAAAGCAAAATTGTTCAGTAAGGTTGGCAAGAGAACACCTGTGGCTGCAAGATTTTCACCTGTGGTGGTTGAGAGGGGTGGAAGTGATACATCTAGGGATGCCCGCGGCTTTGCTGTTAAATTTTACACTGAAGACGGGAACTTTGATATTGTTGGTTTTAATACACCAATGTATGTTTATAATGATCCACGACTTTTCCCTACTTTCGTAAGAGCACAAAAGAAAAATCCAGCAAATAATCTGTTTGATCCTAACACGCTTTGGGATTTTTTAACACTACAACCGGAGAGCTTTCATATGTTTTTATTGGTATTTGGGGATCGTGGCATTCCAGATGGTTATCGGCACATGCCTGGATTTGGTATTCATACGTTCCAAGTCGTTAATGAACACGGAGATATTCATTTTGTAAGATTTCATTTCGTGCCTGACGCTGGTATTAAAAACTTGAGATCGGAAGAAGCTAGAAAAATTGGAGCAGAAGATGCGGATTACAACACGAGAGAGTTATATAGGGCCATAGGAAACGGTGAATTTCCAAGCTGGACTGTTAGCATACAGGTGCTGACTCTGGATGAAGTGAAAACAGCTGGATTTAACGTATTTGATGTAACTCTTGTTCTCCCTTTGGACGACTATCCACTCAAAAAATTAGGGAAACTCGTATTAAACAAAAATGCAGTAAATTACTTTGCTGAAATAGAGCAGCTTGCTTTCTCTCCTGCCAATTTAGTTCCGGGGATACTCGGAGCACCTGACAAGTTGTTTGAAGCCCGCCGTTTGTCGTACAGGGATGCACAGTATTACCGCCTAGGAGCAAATTTCAATAAGATCCCAGTAAATTGTCCGTTTAGAACCGAAGTTTTTGCTTACAACAGAGATGGAAGGCCGCCTGTGAAGGATAATGGCAAAGATACCCCAAACTATTTCCCCAACTCGTTCCATGGTCCAGTACCATATATTGACAAAAGTAAAGGCGACCTCATCGAAATTGTGGAAGAGAAGGCAAACAATTTCATACAATCGAGAGAATTGTATGTTTTTAATGCCAATAAATTAAAGGTATAA

Protein sequence:

>DPOGS210052-PA
MLILLLIRAALATRRDPAADQIVMFKENTPGPIGIMTTSAGAPVEYEEATNTLNSRLIFNEFFMDSITHLVRERIPERIVHAKAGGAFGYFEVTHDVTHICKAKLFSKVGKRTPVAARFSPVVVERGGSDTSRDARGFAVKFYTEDGNFDIVGFNTPMYVYNDPRLFPTFVRAQKKNPANNLFDPNTLWDFLTLQPESFHMFLLVFGDRGIPDGYRHMPGFGIHTFQVVNEHGDIHFVRFHFVPDAGIKNLRSEEARKIGAEDADYNTRELYRAIGNGEFPSWTVSIQVLTLDEVKTAGFNVFDVTLVLPLDDYPLKKLGKLVLNKNAVNYFAEIEQLAFSPANLVPGILGAPDKLFEARRLSYRDAQYYRLGANFNKIPVNCPFRTEVFAYNRDGRPPVKDNGKDTPNYFPNSFHGPVPYIDKSKGDLIEIVEEKANNFIQSRELYVFNANKLKV-