Monarch geneset OGS2.0

DPOGS204393
TranscriptDPOGS204393-TA816 bp
ProteinDPOGS204393-PA271 aa
Genomic positionDPSCF300002 - 1464928-1475097
RNAseq coverage845x (Rank: top 15%)
Annotation
HeliconiusHMEL0038322e-4969.06% 
BombyxBGIBMGA007880-TA1e-3644.81% 
Drosophilafu12-PB2e-5742.80% 
EBI UniRef50UniRef50_D2SNX11e-12376.75%Acyltransferase n=2 Tax=Obtectomera RepID=D2SNX1_HELVI
NCBI RefSeqXP_559898.33e-7454.90%AGAP009415-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2609080275e-12376.75%acyltransferase [Heliothis virescens]
NCBI nr blastxgi|2609080273e-11777.04%acyltransferase [Heliothis virescens]
Group
Gene OntologyGO:00084154.1e-34acyltransferase activity
GO:00081524.1e-34metabolic process
GO:00160203.9e-32membrane
GO:00086543.9e-32phospholipid biosynthetic process
GO:00038413.9e-321-acylglycerol-3-phosphate O-acyltransferase activity
KEGG pathwayaga:AgaP_AGAP0094151e-73 
 K13509 (LPAAT, AGPAT1, AGPAT2)maps-> Glycerolipid metabolism
    Glycerophospholipid metabolism
InterPro domain[91-207] IPR0021234.1e-34Phospholipid/glycerol acyltransferase
[75-205] IPR0045523.9e-321-acyl-sn-glycerol-3-phosphate acyltransferase
Orthology groupMCL16976 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204393-TA
ATGTCCGCCTTCAACGTGGTGTTCGGTTGCGTAATGGCACTTCTAATAATTTTGTTTACAATTAGCTCTATAGCTAGATATTATATAAAGTTTACACTCTTTATTGTGCTTTCACTAATATTCGCATCGGTTCCTTTGCCGCTAATGTTCTTCAGACCCTTCAGTCCAAAAAATGCTCTTATACCAGCATATTTCTTGCGACTAACAGCTCGTTTCCTTGGGATCCGCTGGACGGTTCAAGGACTTGAGAATGTGGATGATTCTCGTGGCGCTGTAGTTTTATTAAACCATCAGAGTAGTTTGGATTTGTATGTATTGGCAGTGCTATGGCCGCTTATGGAGCGATGCACAGTGGTTGCCAAGCGTTCTCTCCAGTACTCCGTTCCCTTCGGCACAGCCACCTGGATGTGGGGAACCGTCTTCATAGATCGCGGTGCACAATCCGCCCGTTCTGTTCTCAACAAACAAACTGACGCCATCAAATGTCATAAGCGGAAATTATTACTATTTCCCGAGGGCACGCGGCACGGAGGTGACAAGCTTCTGCCCTTCAGGAAGGGAGCGTTCCACGTGGCTTTAAATGCAGAGGCTCCGATCCAACCAGTCGTTGTATCAAAATATCATTTCTTGGATTCCGATCGCCACCGATTTGGATCAGGTGAAATTATAATAACGATTCTGCCACTGATAGAAACGAAGGGTCTGACCAAGGATGACATCGATGCTTTGGTGGAGACAACCCAGGCTAAAATGCAAGAGGAATTTACAAACACGTCAGCCGAGACGTATTCGAGGCTCTCTCTTAAGAGTAATTGA

Protein sequence:

>DPOGS204393-PA
MSAFNVVFGCVMALLIILFTISSIARYYIKFTLFIVLSLIFASVPLPLMFFRPFSPKNALIPAYFLRLTARFLGIRWTVQGLENVDDSRGAVVLLNHQSSLDLYVLAVLWPLMERCTVVAKRSLQYSVPFGTATWMWGTVFIDRGAQSARSVLNKQTDAIKCHKRKLLLFPEGTRHGGDKLLPFRKGAFHVALNAEAPIQPVVVSKYHFLDSDRHRFGSGEIIITILPLIETKGLTKDDIDALVETTQAKMQEEFTNTSAETYSRLSLKSN-