Monarch geneset OGS2.0

DPOGS210986
TranscriptDPOGS210986-TA927 bp
ProteinDPOGS210986-PA308 aa
Genomic positionDPSCF300004 + 135840-137179
RNAseq coverage550x (Rank: top 23%)
Annotation
HeliconiusHMEL0250032e-11061.97% 
BombyxBGIBMGA006471-TA1e-12166.23% 
Drosophiladesat2-PA3e-8555.47% 
EBI UniRef50UniRef50_A4ZKB88e-9453.54%Desaturase n=8 Tax=Ostrinia RepID=A4ZKB8_OSTNU
NCBI RefSeqXP_313877.41e-9156.32%AGAP004572-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479721721e-9455.00%AGAP004572-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479721721e-9555.00%AGAP004572-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00551147.5e-45oxidation-reduction process
GO:00167177.5e-45oxidoreductase activity, acting on paired donors, with oxidation of a pair of donors resulting in the reduction of molecular oxygen to two molecules of water
GO:00066291.2e-18lipid metabolic process
KEGG pathwaydme:Dmel_CG59252e-83 
 K00507 (SCD, desC)maps-> Biosynthesis of unsaturated fatty acids
    PPAR signaling pathway
InterPro domain[19-39] IPR0158767.5e-45Fatty acid desaturase, type 1, core
[2-205] IPR0058041.2e-18Fatty acid desaturase, type 1
Orthology groupMCL18019 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210986-TA
ATGTGGACTTCATTACTTCTATTCACAAGTACAGAAGGTATCACCATTGGAGCACATAGACTATGGTCACACAGAACATTTAAAGCAACGCCTTTACTAAAGACCATTTTAATGATATTCCAAACTCTTGCTGGACAGAACTCAATTTTCACTTGGTGTCGAGACCACCGTCTACATCATCGCTACTCAGACACGGATGCGGATCCCCATAACGCAAAAAGAGGATTTTTTTTTAGCCATATTGGATGGCTCCTATGCAAGAAGCACCCATATGTAAAGGAATTAGGAAAAAGAATTGATATGAGTGACCTACAAAATGACTGGATGATAATGGCACAAAAAAAATATTACTACTACCTATATTTAATATTTGCTGTGATAATCCCTGTCTCAGTACCATACTATTATTTCGGAGAATCCCTAAAGAATTCGCTGTTGGTATGTTACTTCGCCAGATATGTTTTTCAATTAAATGGAACTTGGTTAGTTAATAGTGCCGCTCATCTCTATGGAACACGACCATATGATAAGAAACTTCAACCCGTTGAGTCTTGGTTCGTCTCGTTTATAAGTTTTGGTGAGGGTTGGCATAACTATCACCATGCATTCCCTTGGGACTACAAAGCTGCTGAGCTATCTATGCATTTCAATCAATCAGCTAAATTTATAAGGATCTTTGAAAAATTAGGACTGGCTTATGATTTGAAAACAGCATCGCCAGAAATGGTGCAACGTCGCATAATTCAAACAGGAGATGGAACGCATTACGCTCTCGGTAATGACGACGATAGAAATGCCGTCACTTGTATTGGTTATAAGCATCCTATAAATCCTACATATAATGTGAAATATCAAGCTCCTACTGCCACGCTGGGAAATAGAGGTTTACCCCTTAACCACCAAGATGATTACTTAGAACCTGAATAA

Protein sequence:

>DPOGS210986-PA
MWTSLLLFTSTEGITIGAHRLWSHRTFKATPLLKTILMIFQTLAGQNSIFTWCRDHRLHHRYSDTDADPHNAKRGFFFSHIGWLLCKKHPYVKELGKRIDMSDLQNDWMIMAQKKYYYYLYLIFAVIIPVSVPYYYFGESLKNSLLVCYFARYVFQLNGTWLVNSAAHLYGTRPYDKKLQPVESWFVSFISFGEGWHNYHHAFPWDYKAAELSMHFNQSAKFIRIFEKLGLAYDLKTASPEMVQRRIIQTGDGTHYALGNDDDRNAVTCIGYKHPINPTYNVKYQAPTATLGNRGLPLNHQDDYLEPE-