Monarch geneset OGS2.0

DPOGS210321
TranscriptDPOGS210321-TA1398 bp
ProteinDPOGS210321-PA465 aa
Genomic positionDPSCF300025 - 855955-871250
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0051272e-7087.50% 
BombyxBGIBMGA011952-TA6e-3255.86% 
DrosophilaCG32699-PB4e-9741.82% 
EBI UniRef50UniRef50_F4WJD47e-13450.10%Lysophosphatidylcholine acyltransferase 2 n=6 Tax=Bilateria RepID=F4WJD4_ACREC
NCBI RefSeqXP_969176.22e-12354.14%PREDICTED: similar to CG32699 CG32699-PB [Tribolium castaneum]
NCBI nr blastpgi|2700040213e-14255.58%hypothetical protein TcasGA2_TC003327 [Tribolium castaneum]
NCBI nr blastxgi|2700040216e-13855.58%hypothetical protein TcasGA2_TC003327 [Tribolium castaneum]
Group
Gene OntologyGO:00084157.7e-16acyltransferase activity
GO:00081527.7e-16metabolic process
GO:00055096.4e-11calcium ion binding
KEGG pathwaytca:6576365e-123 
 K13510 (LPCAT1_2)maps-> Glycerophospholipid metabolism
    Ether lipid metabolism
InterPro domain[103-213] IPR0021237.7e-16Phospholipid/glycerol acyltransferase
[333-451] IPR0119926.4e-11EF-hand-like domain
Orthology groupMCL10489 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210321-TA
ATGCGGTTGGTGGTGGTAGCTGCTGGTTTCCACCGCGTGCGTGTTCTGGGCCGTCACCACCTACCGTCCAGTCCGAGGGATGCGCCGGTGGTGGTGATGGCACCACACTCCTCCTTCTTCGACGCGATCGCCATCGTCTGCCTTGGCGCTCCAAGTGTCGTCGCTAAAGCTGACACCGCAAGGCTTCCATTCATTGGACGTACGTCGTTGAAACCGCTAATATGCTTTATCGGCAAGATGTCCTACTTGGCTGGTGGGATGGCGATATCTATCCGTGGTCGCCAGGCTAGCCGCAAGGAGGCTCCCATTCTGGTAGTGGCACCGCATTCCTCGTTTTTGGACAGCTGTATAGTGTATGCAACGAGGATGTCTTCTGTCATAGTGAGGAAGGAAAGCATGGACAATTATGTGGGAAAGTTAATAAACTACACGCAGCCGGTGTATGTGTGGCGGGATGACCCTAACTCGAGACAGAACACCATCAAAGAGATTATAGAACGAGCGACCTCCAAAGAAGACTGGCCTCAGGTGCTAATTTTCCCCGAAGGAACTTGCACGAACCGCTCTTGTCTCATCACATTCAAGCCTGGTGGCTTCTACCCTGGGGTACCGGTACAGCCTGTGACGATCAGGTATCCTAACGCTAAAGACACGGTCACTTGGACCTGGGAAGGACCTGGCGCATTGAAGCTCCTATGGCTGACGTTAACTCAAGTGCATAGCTCCTGCGAAATCGAGTTCCTTCCGGTCTACTATCCCAGTGAAGAAGAGAAAAAGGATCCCAAGCTGTACGCGAGAAACGTTAGAGATGTTATGGCCAAAGCTCTGGGGGTGCCAGTTCTAGACTACACATACGACGACTGTAGATTGATCGCCCGCGCCAAGCAGCTCGGCATACCGGGCGGCGCGCTATGTAGGGAAGTCAGCGAACTTAGGGCCCACTTGGGTTTGGACAGTTCCCCTTCAGATATGGAGCCGTCTCCAGGCGCCTGGTATGACTTGCAGCAGTTCGCGCGTCGTCTCGGCACAGACGCGCAGAACCCTCACGCTCGACGTCTCTTCGACATTTACAAACAGAGGTCCAACGGCCTGGTCTTCTTCCCGGACTACCAGCTGTGTGCGTGTTATCTGTCTCTTCAGCATGAGCCTGTTGCGACGATTCTACAACACGCCTTTAAGCTGTACGAGGCGTCAGGTCGATTGAGTCGTAACCAGTTCGAGCACGTCGCCGCCCGCTGTCTCGGCCTGTGTGTGGAGGACGCGGGTCACGCCTTCACTCAGGCGGATATTGACGAGAAGACGTTCATAACATACGATGACTTCATAAACTTCGCTCAGAAAAAGGCGGAATTCTCATTTATCTTCACAAGTGATTCAACAAAACCGAAGACTCAGTGA

Protein sequence:

>DPOGS210321-PA
MRLVVVAAGFHRVRVLGRHHLPSSPRDAPVVVMAPHSSFFDAIAIVCLGAPSVVAKADTARLPFIGRTSLKPLICFIGKMSYLAGGMAISIRGRQASRKEAPILVVAPHSSFLDSCIVYATRMSSVIVRKESMDNYVGKLINYTQPVYVWRDDPNSRQNTIKEIIERATSKEDWPQVLIFPEGTCTNRSCLITFKPGGFYPGVPVQPVTIRYPNAKDTVTWTWEGPGALKLLWLTLTQVHSSCEIEFLPVYYPSEEEKKDPKLYARNVRDVMAKALGVPVLDYTYDDCRLIARAKQLGIPGGALCREVSELRAHLGLDSSPSDMEPSPGAWYDLQQFARRLGTDAQNPHARRLFDIYKQRSNGLVFFPDYQLCACYLSLQHEPVATILQHAFKLYEASGRLSRNQFEHVAARCLGLCVEDAGHAFTQADIDEKTFITYDDFINFAQKKAEFSFIFTSDSTKPKTQ-