Monarch geneset OGS2.0

DPOGS212404
TranscriptDPOGS212404-TA723 bp
ProteinDPOGS212404-PA240 aa
Genomic positionDPSCF300258 - 267242-273896
RNAseq coverage443x (Rank: top 28%)
Annotation
HeliconiusHMEL0109532e-10175.10% 
BombyxBGIBMGA002798-TA1e-10976.45% 
DrosophilaCG9027-PD1e-2539.35% 
EBI UniRef50UniRef50_D6X0S31e-4545.74%Superoxide dismutase [Cu-Zn] n=1 Tax=Tribolium castaneum RepID=D6X0S3_TRICA
NCBI RefSeqXP_972244.13e-4645.74%PREDICTED: similar to copper-zinc superoxide dismutase [Tribolium castaneum]
NCBI nr blastpgi|910911945e-4545.74%PREDICTED: similar to copper-zinc superoxide dismutase [Tribolium castaneum]
NCBI nr blastxgi|910911945e-4746.15%PREDICTED: similar to copper-zinc superoxide dismutase [Tribolium castaneum]
Group
Gene OntologyGO:00047846.7e-46superoxide dismutase activity
GO:00551146.7e-46oxidation-reduction process
GO:00068013e-43superoxide metabolic process
GO:00468723e-43metal ion binding
KEGG pathwaytca:6609577e-46 
 K04565 (E1.15.1.1C, sodC, SOD1)maps-> Huntington's disease
    Peroxisome
    Amyotrophic lateral sclerosis (ALS)
    Prion diseases
InterPro domain[88-237] IPR0241346.7e-46Superoxide dismutase (Cu/Zn) / chaperones
[88-237] IPR0241366.7e-46Superoxide dismutase, Cu/Zn
[93-238] IPR0014243e-43Superoxide dismutase, copper/zinc binding domain
Orthology groupMCL19018 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212404-TA
ATGAACTTGAGGCTGCACATATTTCTCAGCCAGTTCTTGTGGCTTGTGTCCGTGCTGAAAGGGAAAAGTCTACAAGGCATACCGGGTTACGGCAGGAATTTATTGATCAAAACGATACCCGCTGTAGAAGACTACCAGAGCAATGTTTACGAAGTATTCATGGAGCCGTACTTATATGAACTTGGTACAACCTACCCACTAGACGGAGACCGCTCACAGAAACAGATCCAAATACCCATTGGACCACAGCCAATTCCCGGAATCCAAGCCATAGTTCACCTCCAAGACGATGAAGAATCCGGCGTGGAGGGAGATTTAGTATTCACGCAGTTAGTTCCAAACGGACCAGTGTCCATCGAGGGAAACATTACAGGACTGTCCCCGGGACTTCACGGACTACACGTACACCAGACTGGCGATGTTGATGACAATTGCAAAAAGATTGGTCCCCATTTTATTGCTTATTACGGGCGTCATGGAGGACCGCGGGACGCCGTCCGCCATGTTGGCGATCTTGGTAACATAAAAGCCGAAGAAGGCACTTTAGATGTCAAAATAGTCGACCACCTCATATCACTTACCGGCCCGAGGTCTATCGTTGGCCGTTCATTAGCTATAAGCAAAAGTGAAGACGACTATGGAAGGTCCAGTACTGAGGATAGCGCCCTAACCGGCACCTCGGGTCCTGCTATAGCCTGCGGCATTATCGGCTATCTTAATTAA

Protein sequence:

>DPOGS212404-PA
MNLRLHIFLSQFLWLVSVLKGKSLQGIPGYGRNLLIKTIPAVEDYQSNVYEVFMEPYLYELGTTYPLDGDRSQKQIQIPIGPQPIPGIQAIVHLQDDEESGVEGDLVFTQLVPNGPVSIEGNITGLSPGLHGLHVHQTGDVDDNCKKIGPHFIAYYGRHGGPRDAVRHVGDLGNIKAEEGTLDVKIVDHLISLTGPRSIVGRSLAISKSEDDYGRSSTEDSALTGTSGPAIACGIIGYLN-