Monarch geneset OGS2.0

DPOGS210607
TranscriptDPOGS210607-TA891 bp
ProteinDPOGS210607-PA296 aa
Genomic positionDPSCF300168 + 13935-15745
RNAseq coverage566x (Rank: top 22%)
Annotation
HeliconiusHMEL0059021e-3165.29% 
BombyxBGIBMGA014412-TA4e-7163.59% 
Drosophilabeg-PA4e-6441.89% 
EBI UniRef50UniRef50_E3X9X51e-6641.08%Putative uncharacterized protein n=3 Tax=Coelomata RepID=E3X9X5_ANODA
NCBI RefSeqXP_001865020.13e-7145.45%malonyl CoA-acyl carrier protein transacylase [Culex quinquefasciatus]
NCBI nr blastpgi|1700586676e-7045.45%malonyl CoA-acyl carrier protein transacylase [Culex quinquefasciatus]
NCBI nr blastxgi|1700586674e-6845.45%malonyl CoA-acyl carrier protein transacylase [Culex quinquefasciatus]
Group
Gene OntologyGO:00081521.8e-32metabolic process
GO:00167401.8e-32transferase activity
GO:00038242.1e-11catalytic activity
GO:00054882.1e-11binding
KEGG pathwaycqu:CpipJ_CPIJ0149349e-71 
 K00645 (fabD)maps-> Fatty acid biosynthesis
InterPro domain[187-281] IPR0012271.8e-32Acyl transferase domain
[1-281] IPR0160351.6e-30Acyl transferase/acyl hydrolase/lysophospholipase
[16-252] IPR0140434.4e-14Acyl transferase
[111-188] IPR0160362.1e-11Malonyl-CoA ACP transacylase, ACP-binding
[1-296] IPR0208014.7e-09Polyketide synthase, acyl transferase domain
Orthology groupMCL15254 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210607-TA
ATGGGTCGTAATTTACAAGAAGTACCTGCTGCTAAAGAGCTGTACGAACTTGCCTCAAGCATTGTCGGGTGGGACGTGGGACGAGTCTGTCGCGAGGGTCCGGCCGAGGAGCTCGACCGCCGCTGTCAGACGGCCGTGGTGGTGACCGCGCTGGGAGCCCTGGAGATGGCGCGCGACACGCGGCCCGCCGCCGTGGAACGAGCTCGCGCCGCCGCCGGCTTCTCGCTCGGGGAGATCACCGCGCTCGTGTTCGCGGACGCGCTCAGCTTCGAGTCGGCGCTGCGGCTCGTAGAGCTGCGAACGGCGGCCATGGAGGCGGCAGCGCGCGAGCGCACCGGCGGTATGCTCACGGTGTGGCTCGCGCCCGACGCCAAGGTCAAGGAGCTGCTGGCTCGAGCGAGAGACCGCGCGCGGTCCCCCGACCTTCCGGAACCCGTCTGCCTCATCGCCAACTACCTTTTCCCCGGCTGCAAGGTCCTGGCGGGAGACGAACAGGCATTGAAGTTCGTGGAGGCGGAGGGCCGCAAGTGGGGCGTGAAGCGCTCGGCCCGCGTGCGGGTGTCGGGCGCCTTCCACTCGCCTCTGATGGCGCGGGCGGAGGAGGCGGTTAGGGAAGCGGTCAACATGTGTTCGGTGCGCGAGCCGCGGCTTCCCGTGACGTCGTGTGTGGACGCTCGCGCCGCGCGGTCGGTGGCGGGAGTCAAGCGCCGCCTGGTGCGTCTCACCACATCTCCAGTTCGCTGGGAGCAGGTCCTCCATGTGTTGTACGCGCGGCCCCCGGACACGCCGCAGCCTCTCACCCTAGCGCTGGGTCCGGGCGGGGCGCTACGCTCTACACTCAAACTCGTCAACGCGCGGGCCTGGGACTCCTCGATTCAGATCGACGTCTGA

Protein sequence:

>DPOGS210607-PA
MGRNLQEVPAAKELYELASSIVGWDVGRVCREGPAEELDRRCQTAVVVTALGALEMARDTRPAAVERARAAAGFSLGEITALVFADALSFESALRLVELRTAAMEAAARERTGGMLTVWLAPDAKVKELLARARDRARSPDLPEPVCLIANYLFPGCKVLAGDEQALKFVEAEGRKWGVKRSARVRVSGAFHSPLMARAEEAVREAVNMCSVREPRLPVTSCVDARAARSVAGVKRRLVRLTTSPVRWEQVLHVLYARPPDTPQPLTLALGPGGALRSTLKLVNARAWDSSIQIDV-