Monarch geneset OGS2.0

DPOGS203997
TranscriptDPOGS203997-TA1221 bp
ProteinDPOGS203997-PA406 aa
Genomic positionDPSCF300005 + 1461325-1462545
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0138640.089.16% 
BombyxBGIBMGA002989-TA0.077.78% 
DrosophilaCG1970-PA6e-15260.05% 
EBI UniRef50UniRef50_O753061e-13757.91%NADH dehydrogenase [ubiquinone] iron-sulfur protein 2, mitochondrial n=824 Tax=cellular organisms RepID=NDUS2_HUMAN
NCBI RefSeqXP_308864.31e-15160.55%AGAP006891-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582866832e-15060.55%AGAP006891-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582866837e-14760.55%AGAP006891-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00166511.4e-157oxidoreductase activity, acting on NADH or NADPH
GO:00551141.4e-157oxidation-reduction process
GO:00512872.1e-104NAD binding
GO:00480382.1e-104quinone binding
KEGG pathwayaga:AgaP_AGAP0068913e-151 
 K03935 (NDUFS2)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[22-406] IPR0102191.4e-157NADH dehydrogenase I, subunit D
[136-406] IPR0011352.1e-104NADH-quinone oxidoreductase, subunit D
Orthology groupMCL26564 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203997-TA
ATGGAGAAGTATTATAGGGTAATATATCATGGCAGAGTAAGACCGGTGGAGCGTAAATTACGAAACATGTGGATAAACTTTGGACCTCAACATCCTGCTGCACATGGAGTGCTTCGACTTATTTTGGAATTAGATGGCGAATTAGTTGTCAGAGCAGATCCTCATATTGGTTTTCTACACAGAGCTAGTGAAAAATTAATGGAACACAAACATTATACACAAAGTTTGCCATATGTTGATCGCTTTGATTATGTGTCAACCCTAGCAAATGAACAAGGATTTGCAATTGCTGTCGAAAGACTTCTTAACATAGAAGTTCCTCCGAGAGCTCAAGCTATAAGAGTATTGTGTAGTGAACTTTCTCGCATAGCTAATCATCTATTAAATATTTCTGGCACTATTCTTGATGCAGGAGGAATAACACCATTTTTTTGGATGTGTGAGGAGCGAGAGAAGATATATGAGTTATCTGAACGACTTTGTGGTGCTCGAATTCATTGTGCTTATGTCAGACCAGGAGGTGTATCCCAAGACATTCCTATAGGTTTCATGGATGATATACATGAGTTTTGTATGAAACTCGGTGAACGGTGTGACGAAACTGAAGATATCGCTACTGGTAATAGGTTGTATTACGCAAGAACTGCAGGGGTTGGCGTTGTTACTGCTCACGATGCTATATATCATGGCTTTAGTGGACCAATGCTTAGAAGTACAGGAGTTAAGTGGGATTTAAGAATTGCATTTCCTTACGATGGTTACGATCTTTATGACTTTGACGTTCCCATAGGCACTTTTGGGGACAGTTTTGATAGACATCTTCTCCGTTTAATGGAATTACGGCAATCAATTCGAATAATTAACCAAGTAATTGACACGATGCCAGAAGGCGAAGTTAGAACAGACGATTCTAAAGTTTCACCGCCATTGAGATCAGAAATGAAAACTTCTATGGAAGCGCTTATTCATCACTTTAAATTATGTAGCGAAGGCTACGTTGTTCCTCCAGGAGCAACCTATACTGGTGTTGAATGTCCTAAGGGAGAATTAGGTTTCTATATGGTTGGAGATGGTACTTCTAAGCCATATCGAGTTGGTATACGATCTTGTTCTTATAACCATCTAGCGGGCATTGCATTTATGGGTAAAGGTTTACTTCTCGCTGATATATCTATTCTTATTGCAACCATCGATATTGTGTTTGGAGATATCGACCGTTAA

Protein sequence:

>DPOGS203997-PA
MEKYYRVIYHGRVRPVERKLRNMWINFGPQHPAAHGVLRLILELDGELVVRADPHIGFLHRASEKLMEHKHYTQSLPYVDRFDYVSTLANEQGFAIAVERLLNIEVPPRAQAIRVLCSELSRIANHLLNISGTILDAGGITPFFWMCEEREKIYELSERLCGARIHCAYVRPGGVSQDIPIGFMDDIHEFCMKLGERCDETEDIATGNRLYYARTAGVGVVTAHDAIYHGFSGPMLRSTGVKWDLRIAFPYDGYDLYDFDVPIGTFGDSFDRHLLRLMELRQSIRIINQVIDTMPEGEVRTDDSKVSPPLRSEMKTSMEALIHHFKLCSEGYVVPPGATYTGVECPKGELGFYMVGDGTSKPYRVGIRSCSYNHLAGIAFMGKGLLLADISILIATIDIVFGDIDR-