Monarch geneset OGS2.0

DPOGS216044
TranscriptDPOGS216044-TA624 bp
ProteinDPOGS216044-PA207 aa
Genomic positionDPSCF300067 - 279642-280372
RNAseq coverage472x (Rank: top 26%)
Annotation
HeliconiusHMEL0089203e-11088.89% 
BombyxBGIBMGA009022-TA2e-10485.51% 
DrosophilaGXIVsPLA2-PA4e-6855.45% 
EBI UniRef50UniRef50_F4W4H58e-7164.58%Group XIIA secretory phospholipase A2 n=12 Tax=Endopterygota RepID=F4W4H5_ACREC
NCBI RefSeqXP_973207.26e-7765.89%PREDICTED: similar to AGAP011108-PA [Tribolium castaneum]
NCBI nr blastpgi|2700122432e-7766.82%hypothetical protein TcasGA2_TC006362 [Tribolium castaneum]
NCBI nr blastxgi|2700122434e-8066.03%hypothetical protein TcasGA2_TC006362 [Tribolium castaneum]
Group
Gene OntologyGO:00160426.8e-105lipid catabolic process
GO:00055096.8e-105calcium ion binding
GO:00055766.8e-105extracellular region
GO:00046236.8e-105phospholipase A2 activity
KEGG pathwayame:4096145e-70 
 K01047 (PLA2G)maps-> GnRH signaling pathway
    Fc epsilon RI signaling pathway
    MAPK signaling pathway
    Linoleic acid metabolism
    alpha-Linolenic acid metabolism
    Arachidonic acid metabolism
    Vascular smooth muscle contraction
    Glycerophospholipid metabolism
    Long-term depression
    Ether lipid metabolism
    VEGF signaling pathway
InterPro domain[1-197] IPR0107116.8e-105Phospholipase A2, group XII secretory
[89-176] IPR0160902.3e-16Phospholipase A2
Orthology groupMCL12944 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216044-TA
ATGGATATACCATACAGAAAGATTTTTATTTATATTCTCACCTTTGCCGCCTACGCTTACACGGGCATTGGTTCTACCATGTTAAGAAATCTCAAGGATGCAGTCTTGTCTGCGGAATCAGTATTTGGTGATGTATTTCGAAATGTGATTACGGTTGCTGAGAAATTTAAATCTCTTCATGATGTTTTTGATGCTGCTGTAGAAGAGGATTGCATATTTACATGTCCAGAAGGACATAAACCTGTGAGAAATAGAAATCATGTGCCAAAATCTGATGGTTGTGGCTCGCTCGGGTTTGAAATATCATCAGATTATTTGCCCATTGAGCAAATGACCAGGTGTTGTGATGCACATGATATTTGCTATGACACCTGCAATAGTGGTAAAGAGGCTTGTGATTTAGAGTTCAAGAGATGTTTATACAATTACTGTGATACTTATAAATCTGTGAATGTTGCCGGTGATGCTATTACCAAAGGTTGTAAAGGTGCAGCAAAAGTACTTTTTACAGGAACTTTAACACTTGGCTGTAAGTCCTATTTAGATGCCCAGAAGAATGCATGTTATTGTCCTCCGACCAAAAACAAATATAGAAAGTATACTACTGGAAATGATGAACTATAG

Protein sequence:

>DPOGS216044-PA
MDIPYRKIFIYILTFAAYAYTGIGSTMLRNLKDAVLSAESVFGDVFRNVITVAEKFKSLHDVFDAAVEEDCIFTCPEGHKPVRNRNHVPKSDGCGSLGFEISSDYLPIEQMTRCCDAHDICYDTCNSGKEACDLEFKRCLYNYCDTYKSVNVAGDAITKGCKGAAKVLFTGTLTLGCKSYLDAQKNACYCPPTKNKYRKYTTGNDEL-