Monarch geneset OGS2.0

DPOGS203971
TranscriptDPOGS203971-TA618 bp
ProteinDPOGS203971-PA205 aa
Genomic positionDPSCF300005 + 732236-733025
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0103749e-9883.33% 
BombyxBGIBMGA000732-TA5e-8169.52% 
DrosophilaRFeSP-PA1e-2140.41% 
EBI UniRef50UniRef50_Q1G0W21e-6153.50%Ubiquinol-cytochrome c reductase iron-sulfur subunit n=21 Tax=Pancrustacea RepID=Q1G0W2_BOMMO
NCBI RefSeqNP_001106738.12e-6253.50%ubiquinol-cytochrome c reductase [Bombyx mori]
NCBI nr blastpgi|1644486524e-6153.50%ubiquinol-cytochrome c reductase [Bombyx mori]
NCBI nr blastxgi|3152584722e-6054.08%ubiquinol-cytochrome c reductase [Spodoptera litura]
Group
Gene OntologyGO:00166798.7e-72oxidoreductase activity, acting on diphenols and related substances as donors
GO:00551148.7e-72oxidation-reduction process
GO:00081212e-48ubiquinol-cytochrome-c reductase activity
GO:00515371.4e-372 iron, 2 sulfur cluster binding
GO:00164911.4e-37oxidoreductase activity
GO:00160204.5e-15membrane
KEGG pathwayapi:1001617451e-61 
 K00411 (RIP1, UQCRFS1, petA)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[66-205] IPR0143498.7e-72Rieske iron-sulphur protein
[60-205] IPR0063172e-48Ubiquinol-cytochrome c reductase, iron-sulphur subunit
[80-204] IPR0179411.4e-37Rieske [2Fe-2S] iron-sulphur domain
[142-153] IPR0058054.5e-15Rieske iron-sulphur protein, C-terminal
[9-77] IPR0041922.6e-13Ubiquinol cytochrome reductase, transmembrane domain
Orthology groupMCL26561 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203971-TA
ATGTGGGCAAAAGTGGTGGAGAGGCTTCATCGAGATATAAAACATCCAAACTTTGATGCATACAGAAAAGAGAAGTTCAAGGATCCGAGCAAAACCACTTGGCATTCTCAAGATGAAAAATACGGTTACACTTATGTGGTTGGCTTTTTTGGATTGCTAGGTGGAATGTATTGTACGAAAACAGAACTTATTCATTTCTTACTAAGCATGAGTGCTCCGGCTGATGTTCTGGCATTGGCTTCGATCGAAATTGATATAGGAAATATAGCACCAGGTGCGTGTTTATCATATAAATGGCGAGGAAAGCCACTTTTTGTGAAACATAGAACTGTGGGTGAAATTTCGGCTGAAGCGAAAACTCCTCTTAGTTTACTGATAGATCCAGAAACACCTGAGCAACGTACCCAGAAGCCTGAATGGCTTATAGTTATAGGCATATGTACTCATTTAGGTTGTGTTCCTGTTCCGAATTCTGGAGATTGGGCTGGTGGATTCTATTGTCCTTGCCATGGCAGTCATTATGATAACGTTGGTAGGGCTCGAAAGGGTCCAGCTCCTCTAAACTTGGAAGTGCCACCTTACACCTTCCTTTCTGATACATTAGTTCTAGTAGGATAG

Protein sequence:

>DPOGS203971-PA
MWAKVVERLHRDIKHPNFDAYRKEKFKDPSKTTWHSQDEKYGYTYVVGFFGLLGGMYCTKTELIHFLLSMSAPADVLALASIEIDIGNIAPGACLSYKWRGKPLFVKHRTVGEISAEAKTPLSLLIDPETPEQRTQKPEWLIVIGICTHLGCVPVPNSGDWAGGFYCPCHGSHYDNVGRARKGPAPLNLEVPPYTFLSDTLVLVG-