Monarch geneset OGS2.0

DPOGS209900
TranscriptDPOGS209900-TA801 bp
ProteinDPOGS209900-PA266 aa
Genomic positionDPSCF300049 + 395600-396400
RNAseq coverage1774x (Rank: top 7%)
Annotation
HeliconiusHMEL0067511e-14089.47% 
BombyxBGIBMGA014483-TA1e-13484.53% 
DrosophilaCG12079-PA4e-10376.65% 
EBI UniRef50UniRef50_O754894e-8765.82%NADH dehydrogenase [ubiquinone] iron-sulfur protein 3, mitochondrial n=131 Tax=cellular organisms RepID=NDUS3_HUMAN
NCBI RefSeqXP_316497.29e-10772.48%AGAP006456-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583887282e-10572.48%AGAP006456-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583887284e-10378.60%AGAP006456-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00166517.8e-42oxidoreductase activity, acting on NADH or NADPH
GO:00551147.8e-42oxidation-reduction process
GO:00081377.5e-32NADH dehydrogenase (ubiquinone) activity
KEGG pathwayaga:AgaP_AGAP0064563e-106 
 K03936 (NDUFS3)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[89-206] IPR0102187.8e-42NADH dehydrogenase, subunit C
[97-201] IPR0012687.5e-32NADH:ubiquinone oxidoreductase, 30kDa subunit
Orthology groupMCL12867 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209900-TA
ATGTCTTTCTTTCTAAAACGCACCATTGGTGCAGGACGTAAACTAAGTCGGGCAATTTTGAATAATAACCAATACCCTGGTCTTCAACTTGTAGCAACAAAAACTGACCAAGTTCAACCTCAAGTCGAAACGCGACCGACTGTTGCCAAATTTGATCCATTGCAAAAAGCTCACCTAGTAGATTTTGGCAAATATGTAGCTGAATGCCTACCTAAATTCGTGCAGAAAGTTCAAATTACTGCAGGTAACGAACTTGAAGTTCTGGTGCCGACAGATGGTGTCATCCCTGTGCTTCAATTCCTTAAGGATCATCACAATGCACAGTTCGCAAATCTCGTGGATATTGGTGGCATGGATGTGCCTAGCCGACCCTACAGGTTCGAAATTATCTACAACCTACTGTCACTGCGCTACAATGCTCGAATCCGTGTGAAAACCTACACTGATGAACTGACACCAATCGATTCAGCTTGCGAAGTGTTCAAAGCTGCCAACTGGTATGAAAGAGAGATTTGGGACATGTACGGTGTCTTCTTCGCTAACCACCCAGACTTGAGAAGAATTTTAACTGACTACGGTTTTGAGGGTCACCCGTTCAGAAAGGACTTCCCCCTCAGTGGATATGTAGAATTGCGTTATGATGATGAACAGAAAAGGGTTGTGGTTGAACCATTGGAACTGGCCCAGGAGTTTAGGCGCTTCGAGTTAAGTGCACCCTGGGAGCAGTTCCCAAACTTCAGAGGAAATCCTGTGTCTGAGGATGTCGTAGATAAAACTGATGACCAACCCAAGAAAGAATAG

Protein sequence:

>DPOGS209900-PA
MSFFLKRTIGAGRKLSRAILNNNQYPGLQLVATKTDQVQPQVETRPTVAKFDPLQKAHLVDFGKYVAECLPKFVQKVQITAGNELEVLVPTDGVIPVLQFLKDHHNAQFANLVDIGGMDVPSRPYRFEIIYNLLSLRYNARIRVKTYTDELTPIDSACEVFKAANWYEREIWDMYGVFFANHPDLRRILTDYGFEGHPFRKDFPLSGYVELRYDDEQKRVVVEPLELAQEFRRFELSAPWEQFPNFRGNPVSEDVVDKTDDQPKKE-