Monarch geneset OGS2.0

DPOGS206088
TranscriptDPOGS206088-TA1212 bp
ProteinDPOGS206088-PA403 aa
Genomic positionDPSCF300028 - 45187-50527
RNAseq coverage149x (Rank: top 53%)
Annotation
HeliconiusHMEL0122674e-3130.25% 
BombyxBGIBMGA011217-TA3e-6653.11% 
DrosophilaCG4020-PA6e-3432.21% 
EBI UniRef50UniRef50_D9D5H49e-7251.23%Fatty-acyl CoA reductase 1 n=2 Tax=Obtectomera RepID=D9D5H4_OSTNU
NCBI RefSeqXP_967757.15e-5241.23%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2984029093e-7151.23%fatty-acyl CoA reductase 1 [Ostrinia nubilalis]
NCBI nr blastxgi|2984029092e-7651.65%fatty-acyl CoA reductase 1 [Ostrinia nubilalis]
Group
Gene OntologyGO:00166202e-24oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
GO:00551142e-24oxidation-reduction process
KEGG pathwaynvi:1001176835e-36 
 K13356 (FAR)maps-> Peroxisome
InterPro domain[257-350] IPR0042622e-24Male sterility
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206088-TA
ATGGTGGGTTGGATCGAGAACCTCAACGGGCCGGTTGCTATATTGATAGCGAGTGGCAAAGGTATTCTTCACACAATGTACACAGACCCCAACCTCATCTCAGACTACATGCCAGTGGACATAGCTATAAAAGCTTTCATAGCTGCCGCGTGGGCAAGAGGTACCAAAAAGCTGGAACCAACAGACGACATACATCTGTATAACTGCTCATCAAGCGAAATCAAAGCGTTGACTATGGGTCAGATAGTCGAATTGGGGATGGAAATAAGTAAAAAGATCCCCTTGGATTCGTTAGTATGGCATCCCTGCGGAGGCTTAACTTCATCCAAGCTAGTTAATTACGTAAAGGTCCTGCTATTACACCTACTACCGGCTCTGTTAGTTGATGGGATTTTGAAACTAATAGGAAAGAAGCCCATGGCTGGCGACAGATTGGACTTGAATTTCTGGCTGGTTGAAGTTGAAGTTTTGTTTAGTATTCTTCACACTATGTACACAGACCCCAACCTCATCTCAGACTACATGCCAGTGGACATAGCTATAAAAGCTTTCATAGCTGCCGCGTGGGCAAGAGGTACCAAAAAGCTGGAACCAACAGACGACATACATCTGTATAACTGCTCATCAAGCGAAATCAAAGCGTTGACTATGGGTCAGATAGTCGAATTGGGGATGGAAATAAGTAAAAAGATCCCCTTGGATTCGTTAGTATGGCATCCCTGCGGAGGCTTAACTTCATCCAAGCTAGTTAATTACGTAAAGGTCCTGCTATTACACCTACTACCGGCTCTGTTAGTTGATGGGATTTTGAAACTAATAGGAAAGAAGCCCATGTTGACGAAGGTTCAGCGGCGTATATACGTAGCGAACTTAGCTCTAGAGTACTACGTCACCCAGCAGTGGACCTTCAAAAACGTGAACATAGTTAAACTTCGGTCGAAAATCAAGGAAGAGGATTTGAAAGAATTCTTCTATGAAATGGAAACTATTGATATACATGAATATTTCATGAACTCCTGCTACGGCGGAAAGCTGTACATATTGAAAGAGAAACTTGAAGATCTGCCGGCAGCGAGAATACATTACAGAAGAATGGAGCTCCTGCATAAAGTCGTGATGATAATCTTCAAGCTGTCTATCCTCTGGTTCATTTACAACACCAGCTTCTTCAGGGATGTCATGAATATGCTCTACGCCGTGTTTGTTGGTTGA

Protein sequence:

>DPOGS206088-PA
MVGWIENLNGPVAILIASGKGILHTMYTDPNLISDYMPVDIAIKAFIAAAWARGTKKLEPTDDIHLYNCSSSEIKALTMGQIVELGMEISKKIPLDSLVWHPCGGLTSSKLVNYVKVLLLHLLPALLVDGILKLIGKKPMAGDRLDLNFWLVEVEVLFSILHTMYTDPNLISDYMPVDIAIKAFIAAAWARGTKKLEPTDDIHLYNCSSSEIKALTMGQIVELGMEISKKIPLDSLVWHPCGGLTSSKLVNYVKVLLLHLLPALLVDGILKLIGKKPMLTKVQRRIYVANLALEYYVTQQWTFKNVNIVKLRSKIKEEDLKEFFYEMETIDIHEYFMNSCYGGKLYILKEKLEDLPAARIHYRRMELLHKVVMIIFKLSILWFIYNTSFFRDVMNMLYAVFVG-