Monarch geneset OGS2.0

DPOGS213661
TranscriptDPOGS213661-TA942 bp
ProteinDPOGS213661-PA313 aa
Genomic positionDPSCF300219 - 534895-544816
RNAseq coverage187x (Rank: top 49%)
Annotation
HeliconiusHMEL0157252e-13874.36% 
BombyxBGIBMGA010611-TA6e-13878.08% 
DrosophilaCG9743-PA5e-9154.42% 
EBI UniRef50UniRef50_Q6US805e-14176.62%Desaturase n=3 Tax=Endopterygota RepID=Q6US80_SPOLI
NCBI RefSeqXP_967943.12e-11165.55%PREDICTED: similar to stearoyl-coa desaturase [Tribolium castaneum]
NCBI nr blastpgi|345386512e-14076.62%desaturase [Spodoptera littoralis]
NCBI nr blastxgi|345386518e-14276.62%desaturase [Spodoptera littoralis]
Group
Gene OntologyGO:00551149.2e-47oxidation-reduction process
GO:00167179.2e-47oxidoreductase activity, acting on paired donors, with oxidation of a pair of donors resulting in the reduction of molecular oxygen to two molecules of water
GO:00066292.9e-17lipid metabolic process
KEGG pathwaytca:6563136e-111 
 K00507 (SCD, desC)maps-> Biosynthesis of unsaturated fatty acids
    PPAR signaling pathway
InterPro domain[14-34] IPR0158769.2e-47Fatty acid desaturase, type 1, core
[45-251] IPR0058042.9e-17Fatty acid desaturase, type 1
Orthology groupMCL16313 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213661-TA
ATGAAATATGAAGCTCTAGAATTTAAACCACGAATTAAATGGCCGGATTTATTAGTTCAGCTGTGCTTACATTCGACAACAGTTTATGGCTTCTACCTCATACTAGTTAATCGAGTGAAATTATATACTATATTATTTGTTTTTTTTACTATATACACATCGGGCTTCGGTATTACAGCCGGCGTGCACAGGCTGTGGTCGCACAGAGCGTACAAAGCGAATCTGCCTTTAAGAATTTTACTCGCACTGCTATTCACTGTCACTGGACAGCGCGATATTTATACATGGGCCTTGGATCATCGGGTTCATCATAAGTACACTGAAACCGTGGCGGATCCTCACGACGTGAGACGTGGGTTCTGGTTCGCTCACGTTGGCTGGCTGGTGCTGACCCCCCACCCTGCCGTTGAAGATCGAAGGATCTCATTGAGAAAAACGTCGCTGGACTTGCTCGCTGATCCAGTTGTTAGATGGCAGAAAATATTTTTTATTCCATTATTTGTACTACTGAATGTCTTCCTACCGATCGCTATTCCTGTTTATTTTTGGCACGAGAGCTACATCAACAGTTTCGTGTTGAGCTTCGTTACGAGATTCACAATCACGCTGAACATTGCCTACAGCGTGAACAGTTTCGCTCACATGTGGGGAAATAAACCTTACGACAGATTCATAAAATCTGTGGAGAACAGAATAGTCAGTTTGGCAGCTTTGGGCGAGGGGTGGCACAATTACCACCACGTGTTCCCCTGGGACTATCGGACTTCGGAACTAGGAATGATTAATATATCCACAACCTTCATAGATGCGTTCGCAAAAATTGGATGGGCTTATGATCTGAAAGTAGCTACGAATGAGATGATAAGGAACCGAGCGAGACGGAATGGTGATAGAAGTCTGCGACATCTTGAGGAACCAGATCCGAGTGTCTCATCCGAATAG

Protein sequence:

>DPOGS213661-PA
MKYEALEFKPRIKWPDLLVQLCLHSTTVYGFYLILVNRVKLYTILFVFFTIYTSGFGITAGVHRLWSHRAYKANLPLRILLALLFTVTGQRDIYTWALDHRVHHKYTETVADPHDVRRGFWFAHVGWLVLTPHPAVEDRRISLRKTSLDLLADPVVRWQKIFFIPLFVLLNVFLPIAIPVYFWHESYINSFVLSFVTRFTITLNIAYSVNSFAHMWGNKPYDRFIKSVENRIVSLAALGEGWHNYHHVFPWDYRTSELGMINISTTFIDAFAKIGWAYDLKVATNEMIRNRARRNGDRSLRHLEEPDPSVSSE-