Monarch geneset OGS2.0

DPOGS211948
TranscriptDPOGS211948-TA1188 bp
ProteinDPOGS211948-PA395 aa
Genomic positionDPSCF300011 + 939166-941705
RNAseq coverage2153x (Rank: top 6%)
Annotation
HeliconiusHMEL0177611e-17775.19% 
BombyxBGIBMGA000897-TA4e-18075.51% 
DrosophilaCG6020-PA2e-13566.48% 
EBI UniRef50UniRef50_B4LGC02e-13667.78%GJ11526 n=6 Tax=Endopterygota RepID=B4LGC0_DROVI
NCBI RefSeqXP_002008276.12e-13868.06%GI13402 [Drosophila mojavensis]
NCBI nr blastpgi|1951276405e-13768.06%GI13402 [Drosophila mojavensis]
NCBI nr blastxgi|1951276401e-13468.06%GI13402 [Drosophila mojavensis]
Group
Gene OntologyGO:00054884.6e-28binding
GO:00442371.7e-12cellular metabolic process
GO:00038241.7e-12catalytic activity
GO:00506621.7e-12coenzyme binding
KEGG pathwaydmo:Dmoj_GI134027e-138 
 K03953 (NDUFA9)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[56-294] IPR0160404.6e-28NAD(P)-binding domain
[56-267] IPR0015091.7e-12NAD-dependent epimerase/dehydratase
Orthology groupMCL13124 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211948-TA
ATGGCTTCACTAGCTTTTACAGGCAATTTGGTCTCCAAAATCATTCCTAAAAAGGGACATATAAGCATTGTTTATATACAATCTGGGCAATACAGCAGTGATCAAAATCTAGCTGCTTATAAACGTGGTACCGGTGGCCGCAGCAGCTTCAATGGGGTAGTAGCTACTGTCTTTGGATGCACAGGATTTGTTGGTCGCTATGTGTGCAACAAACTTGGCAAGATTGGAACTCAGATGATTTTGCCTTATAGAGGAGACTTCTATGATGCAGCTCGTTTGAAAGTGTGCGGTGACTTGGGACAGGTTTTGTTTACACCTTATGACCTTAGAGATGAAGAGTCCATCAGGAGAGCAGTGAAATACTCCAACGTTGTCATCAATCTTGTTGGAAGAGACTATGAAACCAAAAACTTTAAATACAAGGATGTCCATGTGGATGGACCACGCCTTCTCGCCAGAATCAGCAGAGAAATGGGTGTGGAGAGATTCATTCACCTCTCTTACTTGAATGCTGAACCCAATCCCAAACCTTTAGTGATGAAGTCACCGTCCATGTTCAAGGTATCCAAGTACCACGGTGAGCTGGCCGTCAGAGAAGAGTTCCCTACAGCCACCATCATCCGAGCCTCCGACATTTATGGTTCAGAAGATAGATTTATTAGGTCGATAGCGTCAATCTGGCGTCGTCACAACCGCTACATGCCGCTGTACCGCCACGGCATGGACACGGTGAAACAGCCCGTGTACGTGTCGGACGTGGCGCAGGGCATCGTGAACGCGGCGCGGGACCCCGACACGCGCTGCCAGGTCTACCAGGCGGTGGGGCCCAAAAGATATTTACTGCATGATCTAGTCTGGTGGTTCTTCCGGTTGATGCGTAAAGATGAGAAGTGGGGCTTCAAAACGTTTGATATGAAGTATGACCCTGTCCTCTCTATCAAAGTGGCTATGGCTAACATGTCTCCTGCTTATCCCTTTGGGTCTCTTCATTGGGAGGGTCTGGAGAAGGAAGCGACCACAGATAATGTTGTGAAAGGTGTTCCAACTCTCGAGGACTTGAATGTGACTCTCACACACATGGAAGACCAAGTTCCGTGGGAGCTCAGACCCTACCGCGCCCACCAGTACTACATCGATCAAATTGGAGAGTTCCCAAGACCACCAAACCCAAAAGTTGTTAATTATTAA

Protein sequence:

>DPOGS211948-PA
MASLAFTGNLVSKIIPKKGHISIVYIQSGQYSSDQNLAAYKRGTGGRSSFNGVVATVFGCTGFVGRYVCNKLGKIGTQMILPYRGDFYDAARLKVCGDLGQVLFTPYDLRDEESIRRAVKYSNVVINLVGRDYETKNFKYKDVHVDGPRLLARISREMGVERFIHLSYLNAEPNPKPLVMKSPSMFKVSKYHGELAVREEFPTATIIRASDIYGSEDRFIRSIASIWRRHNRYMPLYRHGMDTVKQPVYVSDVAQGIVNAARDPDTRCQVYQAVGPKRYLLHDLVWWFFRLMRKDEKWGFKTFDMKYDPVLSIKVAMANMSPAYPFGSLHWEGLEKEATTDNVVKGVPTLEDLNVTLTHMEDQVPWELRPYRAHQYYIDQIGEFPRPPNPKVVNY-