Monarch geneset OGS2.0

DPOGS215520
TranscriptDPOGS215520-TA1200 bp
ProteinDPOGS215520-PA399 aa
Genomic positionDPSCF300467 - 24632-26856
RNAseq coverage990x (Rank: top 13%)
Annotation
HeliconiusHMEL0096235e-17175.94% 
BombyxBGIBMGA014059-TA1e-18077.19% 
DrosophilaND42-PB1e-11453.80% 
EBI UniRef50UniRef50_P919292e-11253.80%NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 10, mitochondrial n=20 Tax=Endopterygota RepID=NDUAA_DROME
NCBI RefSeqXP_002104364.12e-11354.08%GD20916 [Drosophila simulans]
NCBI nr blastpgi|1955727624e-11254.08%GD20916 [Drosophila simulans]
NCBI nr blastxgi|1947430642e-11054.35%GF18063 [Drosophila ananassae]
Group
Gene OntologyGO:00055245.5e-103ATP binding
GO:00167735.5e-103phosphotransferase activity, alcohol group as acceptor
GO:00061395.5e-103nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
KEGG pathwaydan:Dana_GF180636e-113 
 K03954 (NDUFA10)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[1-398] IPR0158284.2e-148NADH:ubiquinone oxidoreductase, 42kDa subunit
[7-392] IPR0026245.5e-103Deoxynucleoside kinase
Orthology groupMCL14230 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215520-TA
ATGGCGACCTTTATTAGAACTACGTTTGTGAAGGTGATCACCCCTGTTCACGGTGGAAAAATCGCTGGATGTACCATTATTCAGAACAGGAGCATAATGGGCAAGGCTCTTCGAGAGTCTCTGCCGCCTCGACCCCCTAAACCCGCACCATTCGACTATGTCAACAAAGACTACACCTGGTTACGTAGCTTATTCGATCGTACCACCCATAGATTTGATGAAAATACGAAGGTTTTAGTAGTTGAAGGGCCAGTAGCTGCAGGAAAAACTGAATTCGCAGCCGCTCTCGCTGAGGATCTTGGTATGAAACATTTTCCAGAGGCCAACATGGATATTCACTACATAAGACCAAATGGTGTCGACCTTCGTATCTTTGATAAAGATATTCCAGAAGACACCAGAACCTTTGACCATGTGAACTTCAACTGTCATCCTAATCACCGTCTGGCTGGAAATTTCCAGATCATGATGTACATGGCCCGGTACGGCCAGTACATTGATGCTCTGGCTCATCTTCTTAATACCGGCCAAGGAGTTGTCCTTGAAAGGTCACCATACTCTGATTTTGTGTTCCTAGAAGCTATGTTCTCCCAAAAATATGTCAGCAGAGGCATCAAGTCTGTTTATTATGAGCTCAGAGCCAATACCATTGAGGAACTCATGAGACCACATCTGGTTATATATCTAGATGTGCCAGTTGACAAAGTAAGCGAAGCTATCAAGAAACGTGGGCTCAAACATGAAGTTGATGGTAAAGCCTTAACACCTGCATTCCTCACCGAGATGGAGCATCAGTACAAGAACAAATATCTGAGGGACATTGCTACTCATGCTGAACTTTTAGTTTACGACTGGACTGGCGGTGGTGATGTCGAAGTGGTTGTTGAGGACATTGAACGTCTGGACTTTGACAAGTACACCGAGAGAGAGGAGCCCAAGATGAAGGACTGGAGACTCCCGAGGGAGGTTGAGTGGGCCGACCAGAGGCAAATATTCACCAACAACAAACACTACCTCATGAACCTCTTCAACATACCAAGGACCGATGTACCCGAATTGATCACACAGGCTGATGACGGTTACATGAGGGATAAGGTAATCTACGACCATCCGGCCTTCCAATACACGGAGGGCTACACCCCGAATGACAAGGGACTGCTTCTCAAGAACAAAGTCCCTAAATACCATGAATTTGTCTAA

Protein sequence:

>DPOGS215520-PA
MATFIRTTFVKVITPVHGGKIAGCTIIQNRSIMGKALRESLPPRPPKPAPFDYVNKDYTWLRSLFDRTTHRFDENTKVLVVEGPVAAGKTEFAAALAEDLGMKHFPEANMDIHYIRPNGVDLRIFDKDIPEDTRTFDHVNFNCHPNHRLAGNFQIMMYMARYGQYIDALAHLLNTGQGVVLERSPYSDFVFLEAMFSQKYVSRGIKSVYYELRANTIEELMRPHLVIYLDVPVDKVSEAIKKRGLKHEVDGKALTPAFLTEMEHQYKNKYLRDIATHAELLVYDWTGGGDVEVVVEDIERLDFDKYTEREEPKMKDWRLPREVEWADQRQIFTNNKHYLMNLFNIPRTDVPELITQADDGYMRDKVIYDHPAFQYTEGYTPNDKGLLLKNKVPKYHEFV-