Monarch geneset OGS2.0

DPOGS204910
TranscriptDPOGS204910-TA678 bp
ProteinDPOGS204910-PA225 aa
Genomic positionDPSCF300340 - 61071-64880
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0108263e-2660.00% 
BombyxBGIBMGA001697-TA1e-1147.83% 
DrosophilaCCS-PB2e-1038.71% 
EBI UniRef50UniRef50_E2BK277e-1853.19%Copper chaperone for superoxide dismutase n=4 Tax=Coelomata RepID=E2BK27_HARSA
NCBI RefSeqXP_625006.23e-2040.15%PREDICTED: similar to Copper chaperone for superoxide dismutase (Superoxide dismutase copper chaperone), partial [Apis mellifera]
NCBI nr blastpgi|3838571623e-2146.21%PREDICTED: copper chaperone for superoxide dismutase-like [Megachile rotundata]
NCBI nr blastxgi|3838571622e-2046.21%PREDICTED: copper chaperone for superoxide dismutase-like [Megachile rotundata]
Group
Gene OntologyGO:00047845.3e-21superoxide dismutase activity
GO:00551145.3e-21oxidation-reduction process
GO:00068014.2e-11superoxide metabolic process
GO:00468724.2e-11metal ion binding
KEGG pathwayame:5526299e-20 
 K04569 (CCS)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[21-132] IPR0241425.3e-21Superoxide dismutase copper chaperone
[21-132] IPR0241345.3e-21Superoxide dismutase (Cu/Zn) / chaperones
[80-145] IPR0014244.2e-11Superoxide dismutase, copper/zinc binding domain
Orthology groupMCL22616 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204910-TA
ATGATACCATCTAAGCTGGAAGTGCTCGTTGACTTTGGGCCGACCCCAGACAAAGTGACCGTTGAAAAGACTCTCAACTACCTGAACGCCCAAGACGATGTTCAGCAGGCAGTATTTAAGAATGGAGCAGTCATGGTAGAGACGGTGTTACCGAGTTCCGTCGTCTTGGATATGGTGATTAAGACATCGGGGAAGAGGGCTGTTCTTCAAGGATATGGGGACAGTACGTCAGCCGTAGCTATGGTCTCGAGCAAGTGCACCACGGAGCAAGTGCTCGGCGTCATACGCTTCACTCAGACGGACAGCGTCTTGATAGCTGACGGTAGTGTGGACGGACTGACTCCAGGCTTACACGGACTACACGTGCACGAGAGTGGGGATTTGAGCATGGAAATGGGTTTTACGAAGACAAATCAACAATCACGTCAGAGCGACAAAGCAACGCCATCTTTTGGTAGACTCCAGGAACAATCTAGAATACTGTCTCGAAGGAAATGGTACCATACAGACGAGGTTTACGATGAGGACATGATAAACGAGGCGCTGCTCGGACAAGCGGAAGTCATTAAGGGGAAAGCTATTGGTGTAAATTTCATGAAATTTCAAAAACCGCCTCCACCTTTGGATCACTTGAAGCATTCAGAAGTCTTCAAAGCTATCCACGAGCAAGAACATTAA

Protein sequence:

>DPOGS204910-PA
MIPSKLEVLVDFGPTPDKVTVEKTLNYLNAQDDVQQAVFKNGAVMVETVLPSSVVLDMVIKTSGKRAVLQGYGDSTSAVAMVSSKCTTEQVLGVIRFTQTDSVLIADGSVDGLTPGLHGLHVHESGDLSMEMGFTKTNQQSRQSDKATPSFGRLQEQSRILSRRKWYHTDEVYDEDMINEALLGQAEVIKGKAIGVNFMKFQKPPPPLDHLKHSEVFKAIHEQEH-