Monarch geneset OGS2.0

DPOGS209670
TranscriptDPOGS209670-TA1524 bp
ProteinDPOGS209670-PA507 aa
Genomic positionDPSCF300134 - 262293-264341
RNAseq coverage8603x (Rank: top 2%)
Annotation
HeliconiusHMEL0084630.091.72% 
BombyxBGIBMGA000701-TA0.086.59% 
DrosophilaCat-PA0.069.68% 
EBI UniRef50UniRef50_A8CWF10.070.16%Catalase n=8 Tax=cellular organisms RepID=A8CWF1_LUTLO
NCBI RefSeqNP_001036912.10.086.59%catalase [Bombyx mori]
NCBI nr blastpgi|1129826830.086.59%catalase [Bombyx mori]
NCBI nr blastxgi|1129826830.086.59%catalase [Bombyx mori]
Group
Gene OntologyGO:00069796.6e-209response to oxidative stress
GO:00040966.6e-209catalase activity
GO:00551146.6e-209oxidation-reduction process
GO:00200373.6e-19heme binding
GO:00055063.6e-19iron ion binding
KEGG pathwaydpo:Dpse_GA199200.0 
 K03781 (katE, CAT)maps-> Peroxisome
    Amyotrophic lateral sclerosis (ALS)
    Tryptophan metabolism
    Methane metabolism
InterPro domain[18-497] IPR0208356.6e-209Catalase-like domain, haem-dependent
[26-409] IPR0180282.6e-184Catalase-related subgroup
[21-376] IPR0116141.3e-101Catalase, N-terminal
[432-497] IPR0105823.6e-19Catalase-related immune responsive
Orthology groupMCL10272 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209670-TA
ATGGCTTCACGAGACGCTGCTTCCGACCAACTTGTAAACTTCAAGAAATCCGTTAAGGATTCCCCTGGCTATATGACTACAAAATATGGTACACCAATTGGAGTCAAGACTGCTATTCAAACTGTTGGAAAAAATGGACCTGCTTTACTTCAAGATGCTCAGTTCCTAGATGAAATATCATCATTTGATAGAGAGCGCATTCCTGAAAGAGTAGTTCATGCCAAGGGAGCTGGAGCCTTTGGTTATTTTGAAGTTACCCATGATATTTCAAAATATTCTGCTGCCAAGGTTTTTGAGAACATTGGCAAGAAAACACCTATAGCAGTCAGATTCTCCACTGTAGGAGGTGAGAGTGGATCAGCTGACACTGTCAGAGATCCCCGTGGATTTGCTGTCAAATTCTACACTGATGATGGTATCTGGGACCTTGTTGGTAACAACACTCCAATTTTCTTCATTAGAGATCCAACTCTGTTCCCTAGCTTTATTCACACACAGAAGAGAAATCCTGCTACCCACTTAAAAGATGCCGATATGTTCTGGGACTTTATGACTTTAAGACCTGAAACCATGCACCAATTGGTGTATTTATTTGGTGACAGAGGTATTCCTGATGGTTATAGATTCATGAATGGTTATGGATCTCATACCTTTAAACTTGTAAATGCTCAAGGAGTTGCTCACTGGGTTAAATTCCACTACAAGTCTAACCAAGGAATCAAGAACTTGCCTGTGGATAAAGCTGCTGAATTAGCTTCCTCTGATCCTGACTACTCTATTAGAGATTTATACAATGCAATTGGTAAAGGAGAATTTCCAACATGGACTCTATACATTCAAGTCATGACAATGGCTCAAGGAGAAAACTGCAAATTCAATCCATTCGACCTAACTAAAGTCTGGCCCCATTCAGAATACCCACTCATTCCTGTGGGAAAACTGGTGCTTGATAGAAACCCTAAAAACTATTTTGCTGAAGTTGAACAAATTGCCTTCAGCCCATCCAATCTGGTACCAGGCATTGAACCATCTCCTGATAAAATGTTGCAGGGACGCTTGTTTGCCTACAGTGACACTCATCGTCACCGTCTTGGAGCCAACTTCCTCCAGATTCCTGTTAACTGCCCATACAGGGTCACTGTTGCAAACTATCAGCGAGATGGACCTCAGAACATGTGCAACCAGGATGGTGCTCCTAATTACTTCCCAAATTCATTCTCTGGACCCCAAGAATGCCCACGTTCTCAACGTCTACAACCCAGATACAATGTGAGTGGTGATGTTGACCGTTATGACAGTGGTCAGACCGAAGATAACTTCTCCCAGGCTACCTCTCTATACAAACAAGTCTTTAGTGATGATGAAAAAGCTCGTTGTGTTGCCAATATTGTTGGTAATTTGAAGGATGCTGCAGGATTTATTCAAGAACGTGCCATCAAATTGTTTGTTCAAATAAGCCCTGATTTGGGTAATAAAGTAGCTGCTGGTCTTGCTCCTTACAAAAAATACCATGCCAACTTGTAA

Protein sequence:

>DPOGS209670-PA
MASRDAASDQLVNFKKSVKDSPGYMTTKYGTPIGVKTAIQTVGKNGPALLQDAQFLDEISSFDRERIPERVVHAKGAGAFGYFEVTHDISKYSAAKVFENIGKKTPIAVRFSTVGGESGSADTVRDPRGFAVKFYTDDGIWDLVGNNTPIFFIRDPTLFPSFIHTQKRNPATHLKDADMFWDFMTLRPETMHQLVYLFGDRGIPDGYRFMNGYGSHTFKLVNAQGVAHWVKFHYKSNQGIKNLPVDKAAELASSDPDYSIRDLYNAIGKGEFPTWTLYIQVMTMAQGENCKFNPFDLTKVWPHSEYPLIPVGKLVLDRNPKNYFAEVEQIAFSPSNLVPGIEPSPDKMLQGRLFAYSDTHRHRLGANFLQIPVNCPYRVTVANYQRDGPQNMCNQDGAPNYFPNSFSGPQECPRSQRLQPRYNVSGDVDRYDSGQTEDNFSQATSLYKQVFSDDEKARCVANIVGNLKDAAGFIQERAIKLFVQISPDLGNKVAAGLAPYKKYHANL-