Monarch geneset OGS2.0

DPOGS210892
TranscriptDPOGS210892-TA1257 bp
ProteinDPOGS210892-PA418 aa
Genomic positionDPSCF300045 - 800484-803196
RNAseq coverage326x (Rank: top 35%)
Annotation
HeliconiusHMEL0132937e-14357.68% 
BombyxBGIBMGA003771-TA5e-17272.61% 
DrosophilaCG31683-PA8e-11952.66% 
EBI UniRef50UniRef50_F4WZ351e-13156.96%Group XV phospholipase A2 n=10 Tax=Endopterygota RepID=F4WZ35_ACREC
NCBI RefSeqXP_966553.12e-13254.73%PREDICTED: similar to phosphatidylcholine-sterol acyltransferase (lecithin-cholesterol acyltransferase) [Tribolium castaneum]
NCBI nr blastpgi|3071880733e-13155.34%1-O-acylceramide synthase [Camponotus floridanus]
NCBI nr blastxgi|3071880737e-13060.82%1-O-acylceramide synthase [Camponotus floridanus]
Group
Gene OntologyGO:00083741.2e-157O-acyltransferase activity
GO:00066291.2e-157lipid metabolic process
KEGG pathwaytca:6549915e-132 
 K06129 (LYPLA3)maps-> Lysosome
    Glycerophospholipid metabolism
InterPro domain[9-415] IPR0033861.2e-157Lecithin:cholesterol/phospholipid:diacylglycerol acyltransferase
Orthology groupMCL11820 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210892-TA
ATGTATTATGTGAAAAACTATTTCGTTTTTTTGTTGACATTTAGCTTATTTGCTCAAAACTCGCTCGGGTTTTCACCAGTTATTTTAATTCCTGGTGATGGAGGGAGTCAACTGGAGGCTAAAGTTAATAAAACGAATGTTGTGCATTACATTTGTGCTAAGACCTCAAATGATTTCTACAATGTGTGGCTCAACCTGGAACTGCTGGTACCTTTTGTGATAGATTGTTGGATAGACAATGTACGACTGGAATACAATAATGTTACACGAAAGACGAACAACCCTCAGGGGGTGGAGATTAGAGTACCAGGCTGGGGTAACCCAGAACCTGTTGAGTGGTTAGATCCATCTCATGATTCTGAGGGTGCTTACTTTAATACCATCGGAGATGCTTTAGTCAAAATGGGTTATGTAAGAAATGTGTCTTTGAGAGGGGCTCCATATGATTTCCGAAAGGCACCAAATGAAAACGGTGAGTTCTTTGTGAAGTTAAAGACCCTTGTAGAGGATACATACGCAATGAATAACAAGACAGCCGTGACACTGCTAGTACACAGTATGGGAGGGGCGATGGCACTGCAATTTCTTCAGCTGCAAAGCCAGAGCTGGAAGACCCAGCACATACGGAGGATGATCTCCCTCTCCACACCATGGGGAGGTGCTGTGAAAGCACTCAAAGTATTTGCTATAGGTGATGACCTCGGCTCTATGATGCTGTCTCCCAGTACCTTGCGGGCTCAGCAGATCACGTACCCGTCGTTAGCCTGGCTGCTGCCGTCGACTCGTCTGTGGGGCCCCCACGAGCTGCTCGTCACCACCGACAAATACAACTACACCATCAACGACCTGCAGAAGCTCTTCAACGATATGGAGTTACCGAACGCTTGGGAGATGCGTCGGGACACAGAGAAGTACTCCAGCGACATGGCCGCGCCGGGAGTCGAACTGCACTGTATATATGGATATAACATCTCTACCGTTGAGAGACTGGACTATAAGCCCGGCACTTGGTTGGACGGAAAACCTAATCTGGTGTTCGGTGACGGCGACGGTACGGTCAACCTGCGCTCACTGTCATACTGTGAGCGGTGGGGAAAGACGAGGCGGCGTGCGGGCCTGTGGAGACCCTTACAACGAGGAACTCGGGCCCCGCAGGCCTCGCTGAAGGCCCTGCCACTTAGCAACGCGGAACACCTCAAGATTCTCCACGACCCTCGCGTGGTACAATACATCACCACCGTCATGGCCATACCGTGA

Protein sequence:

>DPOGS210892-PA
MYYVKNYFVFLLTFSLFAQNSLGFSPVILIPGDGGSQLEAKVNKTNVVHYICAKTSNDFYNVWLNLELLVPFVIDCWIDNVRLEYNNVTRKTNNPQGVEIRVPGWGNPEPVEWLDPSHDSEGAYFNTIGDALVKMGYVRNVSLRGAPYDFRKAPNENGEFFVKLKTLVEDTYAMNNKTAVTLLVHSMGGAMALQFLQLQSQSWKTQHIRRMISLSTPWGGAVKALKVFAIGDDLGSMMLSPSTLRAQQITYPSLAWLLPSTRLWGPHELLVTTDKYNYTINDLQKLFNDMELPNAWEMRRDTEKYSSDMAAPGVELHCIYGYNISTVERLDYKPGTWLDGKPNLVFGDGDGTVNLRSLSYCERWGKTRRRAGLWRPLQRGTRAPQASLKALPLSNAEHLKILHDPRVVQYITTVMAIP-