Monarch geneset OGS2.0

DPOGS204928
TranscriptDPOGS204928-TA1401 bp
ProteinDPOGS204928-PA466 aa
Genomic positionDPSCF300160 - 589472-595411
RNAseq coverage4270x (Rank: top 3%)
Annotation
HeliconiusHMEL0037410.092.49% 
BombyxBGIBMGA011131-TA0.091.99% 
DrosophilaCG1970-PA0.086.04% 
EBI UniRef50UniRef50_O753060.076.39%NADH dehydrogenase [ubiquinone] iron-sulfur protein 2, mitochondrial n=824 Tax=cellular organisms RepID=NDUS2_HUMAN
NCBI RefSeqNP_001040366.10.088.41%NADH dehydrogenase-ubiquinone Fe-S protein 2 precursor [Bombyx mori]
NCBI nr blastpgi|1140514470.088.41%NADH dehydrogenase-ubiquinone Fe-S protein 2 [Bombyx mori]
NCBI nr blastxgi|1140514470.088.41%NADH dehydrogenase-ubiquinone Fe-S protein 2 [Bombyx mori]
Group
Gene OntologyGO:00166511.9e-185oxidoreductase activity, acting on NADH or NADPH
GO:00551141.9e-185oxidation-reduction process
GO:00512872.2e-129NAD binding
GO:00480382.2e-129quinone binding
KEGG pathwaydgr:Dgri_GH239240.0 
 K03935 (NDUFS2)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[81-466] IPR0102191.9e-185NADH dehydrogenase I, subunit D
[196-466] IPR0011352.2e-129NADH-quinone oxidoreductase, subunit D
Orthology groupMCL11061 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204928-TA
ATGGTTTCGCTGGTAAATGTTTTGCTGCGGAAAACCGCAAAATTGGGTGCCAACAGAGCCCTACTGGCAAATTTACCAGGAGTTTACAATGCCAACTCTCAAAGAAATGCCCACCGTTGGATGCCAGACGAAGACTTCATTAAGCAGTTTGAGGGCGCCGTGTTGTACTCCGATGGCAGACCTGAACTCATGAAGCACCCACCTTACAATAGTATTGTGGCCCCAGCTGAAAAGCAGGTCAAGAACATGATCCTCAACTTTGGTCCGCAGCACCCGGCTGCTCACGGTGTGTTGAGACTCGTACTCGAACTGGATGGTGAGACTGTCCGCGGAGCCGATCCTCACATTGGTCTCCTTCACCGTGGTACGGAGAAGCTGATTGAGTACAAGACCTACACCCAGGCGTTGCCTTACTTCGACAGGCTGGACTATGTGTCTATGATGTGCAACGAACAATGTTACAGTCTGGCCGTTGAGAAACTGCTGAATATTGAAGCTCCCATCAGAGCCAAGTACATAAGAACTCTGTTTGCTGAGATAACCAGAATCCTGAACCACATAATGGCGGTGGGGACGCACGCCCTGGATGTGGGTGCTCTCACTCCGTTCTTCTGGCTGTTCGAGGAGAGAGAGAAGATGATGGAGTTCTACGAGAGAGTCAGCGGCGCCAGGATGCACGCGGCCTATATACGACCTGGAGGAGTGTCCTTGGATATGCCGTTAGGTCTGATGGATGACATATACGAGTTCGCGAGCAAGTTCGGCGAACGGCTCGACGAGGTGGAGGACGTGCTCACCACCAACAGGATCTGGGTTCAGCGGACCAAGGATGTGGGGGTGGTGACCGCACAAGACGCTCTCAACTACGGCTTCAGCGGTGTGATGCTGAGAGGGTCCGGCATCAAGTGGGATCTGAGAAAAACTCAGCCTTATGACGCTTACGATAAGGTCGAGTTTGATGTACCCATTGGAACCAACGGCGACTGTTATGACAGGTATCTGATCCGCGTGGAGGAGATGCGTCAGTCGCTCCGCATCATAGACCAGTGCCTGAACCAGATGCCGCCCGGAGAGGTGAAGACAGACGACGCCAAGCTCACCCCGCCCTCCAGGGAGGAGATGAAGACATCCATGGAGGCGTTGATCCACCACTTCAAGTTGTTCACCCAGGGGTACGCCGTGCCCCCCGGCGCTACGTACACCGCCGTAGAAGCACCCAAGGGAGAGTTTGGTGTCTACCTTGTATCTGACGGAGGATCGAAACCCTATAGATGCAAGATCAAAGCCCCCGGCTTCGCTCATCTGGCTGCATTGGAGAAGATCGGGAAGAATTCCATGTTGGCTGACATAGTGGCCATTATCGGAACGTTGGATGTAGTGTTCGGAGAAATAGATCGATAG

Protein sequence:

>DPOGS204928-PA
MVSLVNVLLRKTAKLGANRALLANLPGVYNANSQRNAHRWMPDEDFIKQFEGAVLYSDGRPELMKHPPYNSIVAPAEKQVKNMILNFGPQHPAAHGVLRLVLELDGETVRGADPHIGLLHRGTEKLIEYKTYTQALPYFDRLDYVSMMCNEQCYSLAVEKLLNIEAPIRAKYIRTLFAEITRILNHIMAVGTHALDVGALTPFFWLFEEREKMMEFYERVSGARMHAAYIRPGGVSLDMPLGLMDDIYEFASKFGERLDEVEDVLTTNRIWVQRTKDVGVVTAQDALNYGFSGVMLRGSGIKWDLRKTQPYDAYDKVEFDVPIGTNGDCYDRYLIRVEEMRQSLRIIDQCLNQMPPGEVKTDDAKLTPPSREEMKTSMEALIHHFKLFTQGYAVPPGATYTAVEAPKGEFGVYLVSDGGSKPYRCKIKAPGFAHLAALEKIGKNSMLADIVAIIGTLDVVFGEIDR-