Monarch geneset OGS2.0

DPOGS210917
TranscriptDPOGS210917-TA816 bp
ProteinDPOGS210917-PA271 aa
Genomic positionDPSCF300045 + 69623-71134
RNAseq coverage8951x (Rank: top 2%)
Annotation
HeliconiusHMEL0158131e-12179.34% 
BombyxBGIBMGA003066-TA3e-13481.18% 
DrosophilaRFeSP-PA2e-3865.08% 
EBI UniRef50UniRef50_Q1G0W28e-13080.07%Ubiquinol-cytochrome c reductase iron-sulfur subunit n=21 Tax=Pancrustacea RepID=Q1G0W2_BOMMO
NCBI RefSeqNP_001106738.11e-13080.07%ubiquinol-cytochrome c reductase [Bombyx mori]
NCBI nr blastpgi|1644486523e-12980.07%ubiquinol-cytochrome c reductase [Bombyx mori]
NCBI nr blastxgi|3152584726e-12982.29%ubiquinol-cytochrome c reductase [Spodoptera litura]
Group
Gene OntologyGO:00166791.1e-128oxidoreductase activity, acting on diphenols and related substances as donors
GO:00551141.1e-128oxidation-reduction process
GO:00081213.8e-54ubiquinol-cytochrome-c reductase activity
GO:00515372.4e-442 iron, 2 sulfur cluster binding
GO:00164912.4e-44oxidoreductase activity
GO:00160203.1e-17membrane
KEGG pathwaytca:6611257e-91 
 K00411 (RIP1, UQCRFS1, petA)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Cardiac muscle contraction
    Parkinson's disease
InterPro domain[3-271] IPR0143491.1e-128Rieske iron-sulphur protein
[112-271] IPR0063173.8e-54Ubiquinol-cytochrome c reductase, iron-sulphur subunit
[133-271] IPR0179412.4e-44Rieske [2Fe-2S] iron-sulphur domain
[76-132] IPR0041921.3e-25Ubiquinol cytochrome reductase, transmembrane domain
[209-220] IPR0058053.1e-17Rieske iron-sulphur protein, C-terminal
Orthology groupMCL14361 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210917-TA
ATGACTTCTGTGGCTCGAGCGGGGCACTTAGCTCCATTTTTTAAGGCCTCTTGCTCTATCACTAATAATGGCCTGAAGCCCGTTGTAGCAATCGCAGCCCCAGCAGAAAAACTGGTCGTCCAACCTCTGCCAAAAACTTCGACTGTTCAGTCACTTCATGGGGCCCTGCCCGTACAGAGTTTGAAAGTAAGACACGGAAATTTCGCCCCAACTCAAGTACGTTTTGCGCATACTGATATCGCTTATCCCGACTTTTCCGCCTATCGTCGCAAGGAGAGCCAGGACCCAAAGGTTAAGGCCGATGACAGTGTCGATCGCCGCCAAGCCTTCACCTACCTTATTGCAGGAGCGGGATCTGTTGCGAGTGCCTATGCTGCCAAGTCTGTGGTCACACAGTTTGTGTCATCGATGGCGGCTGCCGCCGACGTCTTGGCTTTAGCTAAGATTGAAATTAAACTTGGTGATATTCCCGAAGGAAAGTCTGTTACCTTCAAATGGCGTGGAAAGCCACTGTTCATTCGTCACAGGACAGCAAACGAAATCTCAACCGAGAAGGCAGTCCCTGTAGACCAGCTTCGAGACCCCGAACATGATGATCAACGCACCCAAGACCCTAAATGGCTTGTGGTGATCGGTGTGTGCACCCACTTAGGTTGTGTGCCTGTTGCAAACGCTGGAGATTTTGGCGGTTACTACTGCCCTTGCCACGGTTCCCATTATGATGCTTCTGGCCGCATTCGCAAAGGCCCAGCACCTCTCAACCTTGAAGTACCCCCGCACACGTTTGTAGATGAAAGCCTCCTAGTTGTAGGATAA

Protein sequence:

>DPOGS210917-PA
MTSVARAGHLAPFFKASCSITNNGLKPVVAIAAPAEKLVVQPLPKTSTVQSLHGALPVQSLKVRHGNFAPTQVRFAHTDIAYPDFSAYRRKESQDPKVKADDSVDRRQAFTYLIAGAGSVASAYAAKSVVTQFVSSMAAAADVLALAKIEIKLGDIPEGKSVTFKWRGKPLFIRHRTANEISTEKAVPVDQLRDPEHDDQRTQDPKWLVVIGVCTHLGCVPVANAGDFGGYYCPCHGSHYDASGRIRKGPAPLNLEVPPHTFVDESLLVVG-