Monarch geneset OGS2.0

DPOGS210053
TranscriptDPOGS210053-TA1542 bp
ProteinDPOGS210053-PA513 aa
Genomic positionDPSCF300017 - 1078075-1079689
RNAseq coverage0x (Rank: top 98%)
Annotation
HeliconiusHMEL0104190.079.48% 
BombyxBGIBMGA012691-TA0.074.11% 
DrosophilaCat-PA2e-13546.52% 
EBI UniRef50UniRef50_P040402e-13848.79%Catalase n=690 Tax=root RepID=CATA_HUMAN
NCBI RefSeqXP_001958202.13e-13847.91%GF23641 [Drosophila ananassae]
NCBI nr blastpgi|3829345057e-14350.20%catalase [Spodoptera litura]
NCBI nr blastxgi|3829345055e-14050.20%catalase [Spodoptera litura]
Group
Gene OntologyGO:00069796e-162response to oxidative stress
GO:00040966e-162catalase activity
GO:00551146e-162oxidation-reduction process
GO:00200371.3e-13heme binding
GO:00055061.3e-13iron ion binding
KEGG pathwaymcc:7174066e-139 
 K03781 (katE, CAT)maps-> Peroxisome
    Amyotrophic lateral sclerosis (ALS)
    Tryptophan metabolism
    Methane metabolism
InterPro domain[29-508] IPR0208356e-162Catalase-like domain, haem-dependent
[37-417] IPR0180284.8e-144Catalase-related subgroup
[43-384] IPR0116143.2e-76Catalase, N-terminal
[434-497] IPR0105821.3e-13Catalase-related immune responsive
Orthology groupMCL18527 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210053-TA
ATGTTAATATTATTATTGATAAGAGCCGCGCTCGCCACTCGAAGAGACCCCGCTGCGGATCAAATAGTCATGTTTAAAGAAAATACACCAGGACCCATCGGAATAATGACTACGAGCGCCGGCGCGCCCGTGGAGTACGAAGAAGCGACGAATACATTGAACAGCAGGTTAATCTTCAATGAGTTCTTTATGGACTCCATAACACATCTCGTCCGCGAAAGAATACCGGAACGAATAGTGCATGCGAAGGCCGGCGGAGCGTTCGGTTATTTTGAAGTTACACATGATGTGACACATATTTGTAAAGCAAAATTGTTCAGTAAGGTTGGCAAGAGAACACCTGTGGCTGCAAGATTTTCACCTGTGGTGGTTGAGAGGGGTGGAAGTGATACATCTAGGGATGCCCGCGGCTTTGCTGTTAAATTTTACACTGAAGACGGGAACTTTGATATTGTTGGTTTTAATACACCAATGTATGTTTATAATGATCCACGACTTTTCCCTACTTTCGTAAGAGCACAAAAGAAAAATCCAGCAAATAATCTCTTTGATCCTAACACGCTTTGGGACTTTTTAACACTACAACCGGAGAGCTTGCATATGTTTTTATTGGTATTTGGGGATCGTGGCATTCCAGATGGTTATCAGCACATGCCTGGATTTGGTATTCATACGTTCCAAGTCGTTAATGAACACGGAGATAGTCATTTTGTAAGATTTCATTTTGTGCCTGACGCTGGTATCAAAAACTTGAGATCGGAAGAAGCTAGAAAAATTGGAGCAGAAGATGCGGATTACAACACGAGAGAGTTATATAGGGCCATAGGAAACGGTGAATTTCCAAGCTGGACAGTCAGCATACAGGTGCTGACTCTGGATGAAGTGAAAACAGCTGGATTTAACGTATTTGATGTAACTCTTGTTCTCCCTTTGGACGACTATCCACTTCAAAAATTAGGGAGACTCGTATTAAATAAAAATGCAGTAAATTACTTTGCTGAAATAGAGCAGCTTGCTTTCTCTCCTGCCAATTTAGTTCCGGGGATACTCGGAGCACCTGACAAGTTGTTTGAAGCCCGCCGTTTGTCGTACAGGGATGCACAGTATTACCGCTTAGGAGCAAATTTCAATAAGATCCCAGTAAATTGTCCGTTTAGAACCGAAGTTTTTGCTTACAACAGAGATGGAAGGGCGCCTGTGAAGGATAATGACAAAGATACCCCAAACTATTTCCCCAATTCGTTCCATGGTCCAGTACCATATATTGACAAAAGTAAAGGCGACCTCATCGAAATTGTGGAAGAGAAGGCAAACAATTTCATACAATCGAGAGAATTGTATGTGAATGAAATGACTAATGAGGAAAGGAATAGATTGGTGGAAAATATTTTATACAGCTTAGGACCAGCGACCCAGTTCATTAAAGACAGAGCGGTCAAAGTGTTCATGCTCATTCATCCAGATCTTGGAACGCGCATAGAACACGGGCTGTCCGCAAACGTGACGAACAAACTAACCAGTTACGAGCCTTATTGGAAATAA

Protein sequence:

>DPOGS210053-PA
MLILLLIRAALATRRDPAADQIVMFKENTPGPIGIMTTSAGAPVEYEEATNTLNSRLIFNEFFMDSITHLVRERIPERIVHAKAGGAFGYFEVTHDVTHICKAKLFSKVGKRTPVAARFSPVVVERGGSDTSRDARGFAVKFYTEDGNFDIVGFNTPMYVYNDPRLFPTFVRAQKKNPANNLFDPNTLWDFLTLQPESLHMFLLVFGDRGIPDGYQHMPGFGIHTFQVVNEHGDSHFVRFHFVPDAGIKNLRSEEARKIGAEDADYNTRELYRAIGNGEFPSWTVSIQVLTLDEVKTAGFNVFDVTLVLPLDDYPLQKLGRLVLNKNAVNYFAEIEQLAFSPANLVPGILGAPDKLFEARRLSYRDAQYYRLGANFNKIPVNCPFRTEVFAYNRDGRAPVKDNDKDTPNYFPNSFHGPVPYIDKSKGDLIEIVEEKANNFIQSRELYVNEMTNEERNRLVENILYSLGPATQFIKDRAVKVFMLIHPDLGTRIEHGLSANVTNKLTSYEPYWK-