Monarch geneset OGS2.0

DPOGS214731
TranscriptDPOGS214731-TA708 bp
ProteinDPOGS214731-PA235 aa
Genomic positionDPSCF300022 + 114026-117194
RNAseq coverage68x (Rank: top 67%)
Annotation
HeliconiusHMEL0085942e-10378.07% 
BombyxBGIBMGA005137-TA6e-9483.96% 
DrosophilaCG9172-PB4e-7960.71% 
EBI UniRef50UniRef50_G7YF611e-7072.39%NADH dehydrogenase (Ubiquinone) Fe-S protein 7 (Fragment) n=1 Tax=Clonorchis sinensis RepID=G7YF61_CLOSI
NCBI RefSeqNP_001040530.11e-9183.42%NADH-ubiquinone oxidoreductase 20 kDa subunit precursor [Bombyx mori]
NCBI nr blastpgi|1140531792e-9083.42%NADH-ubiquinone oxidoreductase 20 kDa subunit [Bombyx mori]
NCBI nr blastxgi|1140531791e-9083.42%NADH-ubiquinone oxidoreductase 20 kDa subunit [Bombyx mori]
Group
Gene OntologyGO:00166512.8e-106oxidoreductase activity, acting on NADH or NADPH
GO:00515392.8e-1064 iron, 4 sulfur cluster binding
GO:00551142.8e-106oxidation-reduction process
GO:00480381.9e-70quinone binding
GO:00081371.9e-70NADH dehydrogenase (ubiquinone) activity
KEGG pathwaydme:Dmel_CG91723e-77 
 K03940 (NDUFS7)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[83-234] IPR0144062.8e-106[NiFe]-hydrogenase-3-type complex, small subunit/NADH:quinone oxidoreductase, subunit NuoB
[85-225] IPR0061381.9e-70NADH-ubiquinone oxidoreductase, 20 Kd subunit
[97-226] IPR0061376.2e-43NADH:ubiquinone oxidoreductase-like, 20kDa subunit
Orthology groupMCL11080 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214731-TA
ATGATCAGAAAAGGATTTAATACGAGATGTTTACTATTCAGGAATTTTACTTCAAAAGCCAATGAGCCTAAGGAGCTGCAAAAGACAGAAGAGCCTCAGACGAAGAAAGAAGCAGATCCTTGTGCGGAAGTAAAGAAACCAAAAGCCAAGAAGTATCCACCCGTCAACCTGCATGGCCCGCAGAGCTTCAAGCTCAGTGAGAAGAGACCTTACTCGCCATTTCATTTCAAAGGACAGTCCACAGTGGAGTGGGTGGTCGCCCGAGCTGATGACATACTCAACTGGGGGAGGAAGAACTCCTTGTGGCCTCTGACCTTCGGTCTCGCCTGTTGCGCGCTAGAAATGATGCACTATGCCGGTCCCAGATACGACATGGATCGCTTCGGCATGGTGTTCCGTGGGACTCCACGTCAAACGGATGTGATCATCGTGGCGGGAACTGTCACCAACAAGATGGCTCCCATCTTGCGCAAGACCTACGACCTCATGCCGGATCCCAAATTCGTTGTGTCAATGGGTAGCTGTGCCAACGGCGGCGGCTACTACCACTACACGTACTCCACCGTGCGAGGAGCTGATAGAATCATACCAGTTGATATCTACGTCCCAGGCTGTCCTCCGTCGGCCGAGGCTCTGTTGTATGCGATGCTGCAGCTTCAGAAGAAAGTCAAACGTATGCGCATGGTTCAGACCTGGTACAGGCATTAA

Protein sequence:

>DPOGS214731-PA
MIRKGFNTRCLLFRNFTSKANEPKELQKTEEPQTKKEADPCAEVKKPKAKKYPPVNLHGPQSFKLSEKRPYSPFHFKGQSTVEWVVARADDILNWGRKNSLWPLTFGLACCALEMMHYAGPRYDMDRFGMVFRGTPRQTDVIIVAGTVTNKMAPILRKTYDLMPDPKFVVSMGSCANGGGYYHYTYSTVRGADRIIPVDIYVPGCPPSAEALLYAMLQLQKKVKRMRMVQTWYRH-