Monarch geneset OGS2.0

DPOGS206432
TranscriptDPOGS206432-TA633 bp
ProteinDPOGS206432-PA210 aa
Genomic positionDPSCF300070 - 889752-891051
RNAseq coverage4078x (Rank: top 3%)
Annotation
HeliconiusHMEL0127023e-8392.81% 
BombyxBGIBMGA005439-TA2e-11390.00% 
DrosophilaCG9172-PB7e-9076.85% 
EBI UniRef50UniRef50_O752512e-8085.35%NADH dehydrogenase [ubiquinone] iron-sulfur protein 7, mitochondrial n=652 Tax=root RepID=NDUS7_HUMAN
NCBI RefSeqNP_001040456.17e-11290.00%NADH-ubiquinone oxidoreductase Fe-S protein 7 [Bombyx mori]
NCBI nr blastpgi|1140521441e-11090.00%NADH-ubiquinone oxidoreductase Fe-S protein 7 [Bombyx mori]
NCBI nr blastxgi|1140521443e-11090.43%NADH-ubiquinone oxidoreductase Fe-S protein 7 [Bombyx mori]
Group
Gene OntologyGO:00166518.8e-129oxidoreductase activity, acting on NADH or NADPH
GO:00515398.8e-1294 iron, 4 sulfur cluster binding
GO:00551148.8e-129oxidation-reduction process
GO:00480381.7e-76quinone binding
GO:00081371.7e-76NADH dehydrogenase (ubiquinone) activity
KEGG pathwayame:4089091e-91 
 K03940 (NDUFS7)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[53-210] IPR0144068.8e-129[NiFe]-hydrogenase-3-type complex, small subunit/NADH:quinone oxidoreductase, subunit NuoB
[61-200] IPR0061381.7e-76NADH-ubiquinone oxidoreductase, 20 Kd subunit
[72-201] IPR0061379.8e-45NADH:ubiquinone oxidoreductase-like, 20kDa subunit
Orthology groupMCL11080 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206432-TA
ATGCAAGCTATCAAGGCTTTGGGGGCTGCAGCCCCGGCTACAGTTATGTCTGTAGTAAAAAAAGGTGCATTCACGCCTGTAGTGGAACAAGTACGTCGTACTCATTTGCCGGCACCAAACCCTAAGGAAAATAGACCATATTCTCCTTTCCAAGATACCAGTAATGCGGGAGAATATGCTGTGGCAAGATTAGATGATTTATTGAATTGGGGAAGGAAGGGTTCTATGTGGCCCATGACATTTGGATTAGCTTGTTGCGCTGTTGAGATGATGCACATTGCAGCCCCCCGATATGATATGGACAGATATGGTGTTGTGTTCCGAGCATCCCCTCGTCAGTCTGATGTCATGATAGTTGCTGGTACTTTAACCAACAAGATGGCTCCAGCTTTGAGGAAAGTGTATGATCAAATGCCCGAGCCGAGATGGGTTATATCTATGGGAAGTTGTGCCAATGGTGGTGGATATTATCACTACTCTTATTCTGTTGTAAGAGGTTGTGATCGTATTGTGCCAGTTGACATTTATGTTCCTGGTTGTCCACCGACTGCAGAAGCTTTGTTGTATGGTGTTTTACAATTGCAAAAGAAAGTAAAGAGAATGAAAACTGTTCAAGTATGGTACAGAAAGTAA

Protein sequence:

>DPOGS206432-PA
MQAIKALGAAAPATVMSVVKKGAFTPVVEQVRRTHLPAPNPKENRPYSPFQDTSNAGEYAVARLDDLLNWGRKGSMWPMTFGLACCAVEMMHIAAPRYDMDRYGVVFRASPRQSDVMIVAGTLTNKMAPALRKVYDQMPEPRWVISMGSCANGGGYYHYSYSVVRGCDRIVPVDIYVPGCPPTAEALLYGVLQLQKKVKRMKTVQVWYRK-