Monarch geneset OGS2.0

DPOGS211615
TranscriptDPOGS211615-TA1926 bp
ProteinDPOGS211615-PA641 aa
Genomic positionDPSCF300232 + 271941-273866
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0140120.070.94% 
BombyxBGIBMGA005568-TA0.068.03% 
DrosophilaND75-PB7e-17048.37% 
EBI UniRef50UniRef50_Q945111e-16748.37%NADH-ubiquinone oxidoreductase 75 kDa subunit, mitochondrial n=30 Tax=cellular organisms RepID=NDUS1_DROME
NCBI RefSeqXP_973797.10.050.93%PREDICTED: similar to NADH-ubiquinone oxidoreductase 75 kDa subunit, mitochondrial precursor (Complex I-75kD) (CI-75kD) [Tribolium castaneum]
NCBI nr blastpgi|910882350.050.93%PREDICTED: similar to NADH-ubiquinone oxidoreductase 75 kDa subunit, mitochondrial precursor (Complex I-75kD) (CI-75kD) [Tribolium castaneum]
NCBI nr blastxgi|910882351e-17451.08%PREDICTED: similar to NADH-ubiquinone oxidoreductase 75 kDa subunit, mitochondrial precursor (Complex I-75kD) (CI-75kD) [Tribolium castaneum]
Group
Gene OntologyGO:00166511.7e-166oxidoreductase activity, acting on NADH or NADPH
GO:00551141.7e-166oxidation-reduction process
GO:00515361.7e-166iron-sulfur cluster binding
GO:00164911.4e-24oxidoreductase activity
KEGG pathwaytca:6626180.0 
 K03934 (NDUFS1)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[1-542] IPR0102281.7e-166NADH:ubiquinone oxidoreductase, subunit G
[229-377] IPR0066561.4e-24Molybdopterin oxidoreductase
[37-77] IPR0195743.5e-18NADH:ubiquinone oxidoreductase, subunit G, iron-sulphur binding
Orthology groupMCL10865 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211615-TA
ATGTGCCTTGTGGAGATTGAAGGCATGTGGAAGCCACAAATCGCCTGTGCCATGCCAGTTGCTAAGAATATGAAAATAAGAACGAACTCTGAAGTGGCTTACAAGGCACAAGAGAGTGTTTTAGAATTTCTTTTAACTGATCATCCTCTGGATTGTCCTATTTGTGATCAAGGAGGCGAATGCGACCTTCAAGATCTGTCTATGAAGTTCGGAAGTGACAGGACTCGGTTCACTGATATACATTTTGAAGGGAAACGGGCTGTAGAAAACAAAAACCTAGGGCCATTGATACGAACAGAGATGACTAGATGTATTCATTGCACGAGATGTATCAGATTTGCGTCTCAGGTTTGTGGATTGGATGTTTTGGGCACAGCGGGTCGTGGTGGCGAAATGCTCGTCGGAACATATGTTGACAAAATGTTTTTGTCGGAGCTATCTGGTAACATTATCGATTTGTGTCCCGTCGGCGCTCTGACTAACAAACCGTACATGTTCAAAGCTCGTCCGTGGGAAATTAATAGAGCTAATTCCATAGATGTTACAGATGCTACCGGAACAAATATAAGCGTGAATTATAGATTCAATAGAGTTTTAAGAGTTTTGCCCAGAGAAAATGAAGAAGTGAATCAAGAATGGCTGTCTGACAAGGGGCGTTGGTCTATAGACTCTTTAGATATACAAAGGCTTGTAACCCCTATGTACAAATGTAACGACTGCTTGGTGGCCACAGAATGGGATATTGTATTGAAAATGGTGTCAAAACAATTAAAATGTACTAAGCCTTTTGATATTATGGCCATAGCTGGTCCGTATTGTAATGCAGAAACATTAGTAGCTACAAAAGATTTGCTTAATGTACTTGGTTCGGAGCACACCTACATAGAAAGAAATGTTGATTATTCTGAAGCAGTGGTTGATATAAGAGCTTCATATTCTTTAAATATATGCATGAAAAATATTGCATTATCTGATAAAATTTTGTTAGTCGGCACAAATCCGCGATTTGAAGCCCCTGTATTAAATGCTCGAATAAGGCAAGCGTATATGTTTAATGAATGTGACGTTTACGTTATTGGACCTAAATGCGAATACAATTATCACGTTGAATACGTTGGACAAAGTGTGAAAGACTTATCAAATGCTAAAAAATACTTACAAAATGCAAAGTCTCCTCTGATTTTTGTTGGTATTGACCAGTTGCAAACCCCAAATGCTGTTACTTTGATGAGAGAGCTTATAAGAATATCTAATTCTCTAAAGAAGTCAAGCGATTGGAAAGTCTTGAATATATTACCAAAGGAAGCTAGCTTTGCTGGTGCTCTGGAGGCAGGATGGAAACCTGGAGGTTTAAAGGCAATACAATCTTTAAAACCGCGCGTCATTTTATCTTTAGGTGCTGATGATGTTTTTAGACAATGGTCTCCTCCGAAGGATTGTACTGTTATTTACATTGGCTTTCAAGGAGATAGCGGTGCCGCATGTGCATCAATTATTCTACCAGGAAGTGCTTACACCGAAGCTGGTGGAACTTTTCTGAACATGGAATGTCGTTCTCAATACGCCCAGCCGGCTGTGAGTCCGCCTGGAAAAGCCCGTTATGATTGGAAAATTATTCGCGCAATAGCTGAATACTGCCAAATATGCCTATTTTATACCGATCCAGATTCAATATGTAGGCGATTAGCACAAATAAGTCCAAACTTCGTCAGTTGCGGCACGTGTCAAAAAAGACTTTTTGAAGATCTAATTCCTCAATTGCTGGCTAATGATAGCTCTGACTGCGTCGGACAACCTTTTGATGTGTGTATGAAAAAATTGAAAGAATATTATTGTTCCGACATTTATACTAGCAACTCTCCAACAATGGTAAAAGCTCAAAAGGCTGTCATGAAAATGGAAAAGTCTCCTTACTGTCATGTTTAA

Protein sequence:

>DPOGS211615-PA
MCLVEIEGMWKPQIACAMPVAKNMKIRTNSEVAYKAQESVLEFLLTDHPLDCPICDQGGECDLQDLSMKFGSDRTRFTDIHFEGKRAVENKNLGPLIRTEMTRCIHCTRCIRFASQVCGLDVLGTAGRGGEMLVGTYVDKMFLSELSGNIIDLCPVGALTNKPYMFKARPWEINRANSIDVTDATGTNISVNYRFNRVLRVLPRENEEVNQEWLSDKGRWSIDSLDIQRLVTPMYKCNDCLVATEWDIVLKMVSKQLKCTKPFDIMAIAGPYCNAETLVATKDLLNVLGSEHTYIERNVDYSEAVVDIRASYSLNICMKNIALSDKILLVGTNPRFEAPVLNARIRQAYMFNECDVYVIGPKCEYNYHVEYVGQSVKDLSNAKKYLQNAKSPLIFVGIDQLQTPNAVTLMRELIRISNSLKKSSDWKVLNILPKEASFAGALEAGWKPGGLKAIQSLKPRVILSLGADDVFRQWSPPKDCTVIYIGFQGDSGAACASIILPGSAYTEAGGTFLNMECRSQYAQPAVSPPGKARYDWKIIRAIAEYCQICLFYTDPDSICRRLAQISPNFVSCGTCQKRLFEDLIPQLLANDSSDCVGQPFDVCMKKLKEYYCSDIYTSNSPTMVKAQKAVMKMEKSPYCHV-