Monarch geneset OGS2.0

DPOGS204541
TranscriptDPOGS204541-TA1062 bp
ProteinDPOGS204541-PA353 aa
Genomic positionDPSCF300297 - 23017-24899
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0120522e-10551.01% 
BombyxBGIBMGA005471-TA0.082.68% 
Drosophiladesat1-PA8e-10752.37% 
EBI UniRef50UniRef50_D2KCJ11e-12155.91%Acyl-CoA delta-9 desaturase isoform n=6 Tax=Neolepidoptera RepID=D2KCJ1_9NEOP
NCBI RefSeqXP_002053763.12e-10754.14%GJ23166 [Drosophila virilis]
NCBI nr blastpgi|2813334354e-12155.91%acyl-CoA delta-9 desaturase isoform [Hepialus pui]
NCBI nr blastxgi|2813334351e-12256.16%acyl-CoA delta-9 desaturase isoform [Hepialus pui]
Group
Gene OntologyGO:00551141.4e-61oxidation-reduction process
GO:00167171.4e-61oxidoreductase activity, acting on paired donors, with oxidation of a pair of donors resulting in the reduction of molecular oxygen to two molecules of water
GO:00066296.2e-11lipid metabolic process
KEGG pathwaydvi:Dvir_GJ231667e-107 
 K00507 (SCD, desC)maps-> Biosynthesis of unsaturated fatty acids
    PPAR signaling pathway
InterPro domain[44-64] IPR0158761.4e-61Fatty acid desaturase, type 1, core
[72-283] IPR0058046.2e-11Fatty acid desaturase, type 1
Orthology groupMCL23318 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204541-TA
ATGCCTCCTCAAGCCGACCCTACTCCGAGTGGAGTGCTATTTGAAACCGACACCCAAACTCCAGACCTAGGTCTTGATAAAGATGTATCGATTTTAAAGAAAGCCAGACCTAGGAAATATGAATACGTTTGGTTTAATATTATGTGGTTCTTATTCTTACACGTCGCATCCGCTTACGGTATTTATTTAGTTTTCACTTCCGCAAAATGGCAGACTAATGTTTTTGCATTCCTTGTACATCTTTGTTGCGCTATTGGTATTGGCGCCGGATCTCATAGGCTGTGGACACATCGAAGTTTTAAAGCAAAAACTCCTCTTCGCATTCTTCTTATGATTTGGCAGACTATGGGTTTCCAAGACTGTATATTTGAATGGGCTCGGGACCATCGTACTCATCACAAGTACGCAGACACGGATGCTGATCCTCATAACGCTGAGCGTGGTCTTTTCTACTCACACATGGGCTGGCTTTGTGTTAAGAAATCCCCAGAAGTATTAGAAGGTGGACGCCGCATAGAACTTAAAGACCTCTACGACGATCCAGTCGTAATGCATTACTTGAAAATGATGCCAATCCTGTGCTTCGTGTTTCCGACTATCATTCCTGTGTATTTCTGGAACGAGACGTGGTTGAACGCTTTCCTAATCCCCACAATTCTGCGGTACACGTGCGGCATTAATATAGTGTGGAGCGTAAACAGTTTTGCTCACGTCTTTGGCTACCGACCATATGACAAGTCGTTGAACCCACGAGAGAACATCGCAGTATGGATGTTCTGTGTAGAAGGCTTTCATAATTATCACCACACTTTCCCCTGGGACTACCGCGCGACTGAACACCCGATACTTAATATGCTGACTCCCACAATCATGTTCATTGACCTTATGGCTAAGCTGGGACAAGCTTACGATCTGAAAACGGTAACACCAGAAATAATCAAACAAAGAGCTCAACGCACTGGCGACGGTACACATCATCTGTGGGGTTGGGATGATCCCGAATTTACTGAGAAAATGAAGTTACGCTACGGAATAACAAATTCCGAGAGAAAAACAGCGTAA

Protein sequence:

>DPOGS204541-PA
MPPQADPTPSGVLFETDTQTPDLGLDKDVSILKKARPRKYEYVWFNIMWFLFLHVASAYGIYLVFTSAKWQTNVFAFLVHLCCAIGIGAGSHRLWTHRSFKAKTPLRILLMIWQTMGFQDCIFEWARDHRTHHKYADTDADPHNAERGLFYSHMGWLCVKKSPEVLEGGRRIELKDLYDDPVVMHYLKMMPILCFVFPTIIPVYFWNETWLNAFLIPTILRYTCGINIVWSVNSFAHVFGYRPYDKSLNPRENIAVWMFCVEGFHNYHHTFPWDYRATEHPILNMLTPTIMFIDLMAKLGQAYDLKTVTPEIIKQRAQRTGDGTHHLWGWDDPEFTEKMKLRYGITNSERKTA-