Monarch geneset OGS2.0

DPOGS212154
TranscriptDPOGS212154-TA1059 bp
ProteinDPOGS212154-PA352 aa
Genomic positionDPSCF300038 + 572889-573947
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0125356e-17682.05% 
BombyxBGIBMGA006734-TA1e-16377.49% 
DrosophilaCG12170-PA1e-12962.43% 
EBI UniRef50UniRef50_Q7QCU61e-14167.32%3-oxoacyl-[acyl-carrier-protein] synthase n=8 Tax=Endopterygota RepID=Q7QCU6_ANOGA
NCBI RefSeqXP_312104.43e-14267.32%AGAP002809-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123734441e-14168.64%hypothetical protein AND_17425 [Anopheles darlingi]
NCBI nr blastxgi|3838569601e-13669.60%PREDICTED: 3-oxoacyl-[acyl-carrier-protein] synthase, mitochondrial-like [Megachile rotundata]
Group
Gene OntologyGO:00090583.3e-187biosynthetic process
GO:00038243.3e-187catalytic activity
GO:00081522.6e-62metabolic process
KEGG pathwayaga:AgaP_AGAP0028097e-142 
 K09458 (fabF)maps-> Fatty acid biosynthesis
InterPro domain[2-351] IPR0007943.3e-187Beta-ketoacyl synthase
[154-352] IPR0160392.6e-62Thiolase-like
[2-200] IPR0160381.7e-61Thiolase-like, subgroup
[3-186] IPR0140303.7e-48Beta-ketoacyl synthase, N-terminal
[195-310] IPR0140316.6e-33Beta-ketoacyl synthase, C-terminal
[3-352] IPR0208417.5e-13Polyketide synthase, beta-ketoacyl synthase domain
Orthology groupMCL13666 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212154-TA
ATGATGTCAAAATCTAATTTAAAATTAATGGCCCCAGCTACTTGCTTAGCTCTTTTTGCCACTGAAGAAGCTTTAAATGATGCAAAGTGGGTGCCACAACTAGAATCAGATATGGAATCAACAGGAGTCACTTTAGGAATGGGTATGATAGATCTTAAGGATGTTTGTGATACAAATAATGCTCTACAATCAGGCTACAATAAAGTAAGTCCATTTTTCGTACCAAGAATACTACCAAACATGGCAGCAGGACATATAAGTATAAAATATGGATTTAGGGGGCCAAATCATGCTGTTTCAACAGCATGTGCAACTGGAGCACATTCTATTGGTGATGCCTTTAGATTTATCAGAAATGGAGATGCCGATGTCATGGTAAGTGGTGGAGCAGAGGCCTGTATAAGTCCATTAGCCATAGCAGGGTTCTGTCGGTTAAGAGCTTTAAGCACCTCATTCAATGATAAACCTACCATGGCTTCTAGGCCCTTTGATAAAAATAGAGATGGATTTGTTATGGGCGAGGGAGCTGCTGTTTTAGTTTTAGAAGAATACAAGCATGCATTGCAAAGAAATGCTAAGATTTATGCTGAAATTTTAGGATATGGTTTGTCGGGTGATGCGGCCCATATAACTACACCTCGGGTAGATGGCAGTGGTGCCATACTCTCCATGAACAGAGCCCTACAGGATAGTAATATAAGTTACAAAGAAATTTCATATATAAATGCACATGCAACATCAACGCCAGTCGGTGATGGAATAGAATCCACTGCTATTAGAACCTTGTTCAAAGAAAAAGCTAAAGACATACTGATCTCTTCAACAAAAGGAGCCCATGGACATTTACTTGGGGCTGCGGGAAATCTTGAGACAGCTTTCACCGTTCTAGCAATATCCGAGGGAGTTGTGCCTCCTACATTAAACTTAGACAATCCCCTTGATGATCTGAATTATGTAGCTAAAGTCCCACAGAAATGGACTAAAGATAGAAGAATAGCATTAAAAAACTCTTTTGGATTCGGCGGCACGAATGCAACACTCTGTATTTCCAGTGTTTAA

Protein sequence:

>DPOGS212154-PA
MMSKSNLKLMAPATCLALFATEEALNDAKWVPQLESDMESTGVTLGMGMIDLKDVCDTNNALQSGYNKVSPFFVPRILPNMAAGHISIKYGFRGPNHAVSTACATGAHSIGDAFRFIRNGDADVMVSGGAEACISPLAIAGFCRLRALSTSFNDKPTMASRPFDKNRDGFVMGEGAAVLVLEEYKHALQRNAKIYAEILGYGLSGDAAHITTPRVDGSGAILSMNRALQDSNISYKEISYINAHATSTPVGDGIESTAIRTLFKEKAKDILISSTKGAHGHLLGAAGNLETAFTVLAISEGVVPPTLNLDNPLDDLNYVAKVPQKWTKDRRIALKNSFGFGGTNATLCISSV-