Monarch geneset OGS2.0

DPOGS207444
TranscriptDPOGS207444-TA1122 bp
ProteinDPOGS207444-PA373 aa
Genomic positionDPSCF300051 - 724939-737644
RNAseq coverage1118x (Rank: top 11%)
Annotation
HeliconiusHMEL0123241e-5379.37% 
BombyxBGIBMGA009914-TA4e-6477.16% 
DrosophilaCG42237-PA1e-6944.41% 
EBI UniRef50UniRef50_Q7Q4S01e-7349.66%AGAP000899-PA n=4 Tax=Culicidae RepID=Q7Q4S0_ANOGA
NCBI RefSeqXP_316877.42e-7449.66%AGAP000899-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479647114e-7349.66%AGAP000899-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479647112e-7349.83%AGAP000899-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160423.9e-102lipid catabolic process
GO:00055093.9e-102calcium ion binding
GO:00046233.9e-102phospholipase A2 activity
KEGG pathwaydme:Dmel_CG30091e-21 
 K01047 (PLA2G)maps-> GnRH signaling pathway
    Fc epsilon RI signaling pathway
    MAPK signaling pathway
    Linoleic acid metabolism
    alpha-Linolenic acid metabolism
    Arachidonic acid metabolism
    Vascular smooth muscle contraction
    Glycerophospholipid metabolism
    Long-term depression
    Ether lipid metabolism
    VEGF signaling pathway
InterPro domain[78-359] IPR0012113.9e-102Phospholipase A2, eukaryotic
[264-363] IPR0160906.2e-34Phospholipase A2
Orthology groupMCL16532 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207444-TA
ATGTCCTTCCGACGACTGCTCAGCGCGGTGCGGTCGGCCCGCGACGCGCTCACGAGGGGCGCGCCCATGGCCGACGGTTTGTTCAAGTCGCTCGTATCATTGGTGGTGATCCTCTGCATCGCCGTGTCGTCCCCGCAGTCGGCGTCCGCGAAGCCCTTCGCCTTCAGCTTTCCGGCGAGCTGGTCCCTGGCTGGGCTGGTGGGAGGTGTGGGCGCTCGCTCCGAGGACCACCGGAGAGAACCCGCGGGAGTGAAGCCCTACACCGAGCGGAGAGTCTCCAACGACACCCTCCGTATGATCTACTTCCACGACCAGACGGTGGCCGTGGTAGAGCTCGGCTTGGACAAGCTGCTCCTCAACTGCGAGCTCATCGAGACATATGATGAGGACGACACGAGCCGCCTCCTGCGCCAGTTGAGCAGCATCAACCGACCTCTGGCCATCAACTTCCCTCAGATGACCAAGTTAATGAGCCAGTGTCAGCAGTTTTTTGAATTATTAGTGCTTAAGCTTTTATTAAAACCTAAGAAGGAAAGCGTAAGTGATGAGGACGACACGAGCCGCCTCCTGCGCCAGCTGAGCAGCATCAACCGACCTCTGGCCATCAACTTCCCTCAGATGACCAAGTTAATGAGCCAGTGTCAGCAGGTCGACGGAGTGGAGGGGTCGGAGGGATGGGCGGCGTCCCGGCGTAGGGCGGACTGGCGAGAACGAGGAGCAGCGAGACTGCGGGCAGGCGGGCAACACGCGGGGCTGCTGGGAGGCAGTCCGCTGTCACTGCTACAGGGGATAATACCCGGCACTAAATGGTGTGGGACGGGCGACATCGCGGCGGACTACCACGACCTGGGCTCCGACCGGCCCCTGGACCGCTGCTGCCGCACGCACGACCTGTGTCCCAGCAAGGTCCGCGCCTTCTCCACTCGCTACAACCTCACCAACAACTCCCTCTACAGCAAGTCGCACTGCACCTGCGACGACATGCTTTTCGAGTGTTTGAAGGCGACCAACACGTCCGCCTCTCACCTCATGGGGCACATCTATTTCAATATAGTCCAAGTGCCCTGCTTCGAGGACCTTCCCTCCGGCCGGCGGTTCAGAGAAGCGAAGCAAGGCTTCTGA

Protein sequence:

>DPOGS207444-PA
MSFRRLLSAVRSARDALTRGAPMADGLFKSLVSLVVILCIAVSSPQSASAKPFAFSFPASWSLAGLVGGVGARSEDHRREPAGVKPYTERRVSNDTLRMIYFHDQTVAVVELGLDKLLLNCELIETYDEDDTSRLLRQLSSINRPLAINFPQMTKLMSQCQQFFELLVLKLLLKPKKESVSDEDDTSRLLRQLSSINRPLAINFPQMTKLMSQCQQVDGVEGSEGWAASRRRADWRERGAARLRAGGQHAGLLGGSPLSLLQGIIPGTKWCGTGDIAADYHDLGSDRPLDRCCRTHDLCPSKVRAFSTRYNLTNNSLYSKSHCTCDDMLFECLKATNTSASHLMGHIYFNIVQVPCFEDLPSGRRFREAKQGF-