Monarch geneset OGS2.0

DPOGS210834
TranscriptDPOGS210834-TA660 bp
ProteinDPOGS210834-PA219 aa
Genomic positionDPSCF300027 - 37441-39937
RNAseq coverage1902x (Rank: top 7%)
Annotation
HeliconiusHMEL0079772e-8781.82% 
BombyxBGIBMGA003924-TA2e-12091.86% 
DrosophilaND23-PA9e-9286.93% 
EBI UniRef50UniRef50_O002178e-8784.00%NADH dehydrogenase [ubiquinone] iron-sulfur protein 8, mitochondrial n=157 Tax=cellular organisms RepID=NDUS8_HUMAN
NCBI RefSeqNP_001040316.11e-11891.40%NADH dehydrogenase ubiquinone Fe-S 8 [Bombyx mori]
NCBI nr blastpgi|1140513722e-11791.40%NADH dehydrogenase ubiquinone Fe-S 8 [Bombyx mori]
NCBI nr blastxgi|1140513727e-11591.40%NADH dehydrogenase ubiquinone Fe-S 8 [Bombyx mori]
Group
Gene OntologyGO:00160202.4e-50membrane
GO:00166512.4e-50oxidoreductase activity, acting on NADH or NADPH
GO:00515392.4e-504 iron, 4 sulfur cluster binding
GO:00551142.4e-50oxidation-reduction process
GO:00515361.4e-23iron-sulfur cluster binding
GO:00164911.4e-23oxidoreductase activity
KEGG pathwaycqu:CpipJ_CPIJ0185903e-91 
 K03941 (NDUFS8)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[76-197] IPR0102262.4e-50NADH-quinone oxidoreductase, chain I
[78-196] IPR0122851.4e-23Fumarate reductase, C-terminal
Orthology groupMCL14963 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210834-TA
ATGTCTTTAGCAAAAATATTTTCAGTGTCTTCTCGAGTGAGGGCTAGTGGAGCTAGCAATGTTTTAGTACGTTATGCCAGTTCAGGTCAAGAAGGCAAAGTGGAAAAGGTGTATCCACCAAATGTACCTGGATATAAATATGTAAATGCTGAAGATCAGGATATGAGTTTCCGGGAAATGTCCAACAGAGCTGCGCAAACTCTGTTTTGGACTGAATTGGCAAGAGGTTTTGCTGTAACACTTGCACATGTATTTAAGGAACCGGCGACAATCAACTATCCTTTTGAAAAGGGTCCTTTGTCTCCAAGGTTCAGAGGTGAGCATGCATTGAGAAGATATCCATCTGGTGAAGAAAGATGCATTGCTTGTAAGCTGTGTGAAGCTATATGCCCAGCTCAGGCAATCACAATTGAAGCTGAAGAACGCAAAGATGGCTCACGTAGAACAACTAGATACGATATTGATATGACTAAATGCATCTACTGTGGATTCTGTCAGGAGGCTTGTCCAGTTGATGCGATCGTAGAAGGACCGAACTTTGAATTCTCGACCGAAACCCATGAAGAACTCCTTTACAATAAAGAGAAATTACTCTCAAACGGAGACAAATGGGAGAGTGAGATTGCAACTAACATCAGAGCCGATCACCTCTACCGTTAG

Protein sequence:

>DPOGS210834-PA
MSLAKIFSVSSRVRASGASNVLVRYASSGQEGKVEKVYPPNVPGYKYVNAEDQDMSFREMSNRAAQTLFWTELARGFAVTLAHVFKEPATINYPFEKGPLSPRFRGEHALRRYPSGEERCIACKLCEAICPAQAITIEAEERKDGSRRTTRYDIDMTKCIYCGFCQEACPVDAIVEGPNFEFSTETHEELLYNKEKLLSNGDKWESEIATNIRADHLYR-