Monarch geneset OGS2.0

DPOGS210832
TranscriptDPOGS210832-TA1023 bp
ProteinDPOGS210832-PA340 aa
Genomic positionDPSCF300027 - 110669-113067
RNAseq coverage600x (Rank: top 21%)
Annotation
HeliconiusHMEL0212494e-14378.33% 
BombyxBGIBMGA003920-TA1e-14568.53% 
DrosophilaCG18258-PA1e-2931.64% 
EBI UniRef50UniRef50_D2SNY87e-5963.74%Neutral lipase (Fragment) n=1 Tax=Heliothis virescens RepID=D2SNY8_HELVI
NCBI RefSeqXP_002101470.13e-3031.54%GE17651 [Drosophila yakuba]
NCBI nr blastpgi|2609080633e-5863.74%neutral lipase [Heliothis virescens]
NCBI nr blastxgi|2609080631e-5763.74%neutral lipase [Heliothis virescens]
Group
Gene OntologyGO:00038241.1e-45catalytic activity
GO:00066291.1e-45lipid metabolic process
KEGG pathwaydme:Dmel_CG56652e-24 
 K01059 (LPL)maps-> Glycerolipid metabolism
    Alzheimer's disease
    PPAR signaling pathway
InterPro domain[138-311] IPR0007341.1e-45Lipase
[84-313] IPR0138183.1e-37Lipase, N-terminal
Orthology groupMCL24956 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210832-TA
ATGAAGTGTGTTTTGATTTTGTTTTGTTATTTCTCCTACATTAGTGCGTGGGGTCGCGGGGATTTAGAAAAATATGGTCCTTTCCAGTTAGCACTGCATTCGAAATTAATTAAATGCGATCACGACAGAAATCTGAATTTAGACGTCAGCGGCATAGACGTATATTTCTACGATTTTCCAAGAAACGACGTGGAAACTTTTACAATTGACAACGCTGCGAGAGGAATCCTCGACATAAAGGAACTCGATAAAACGAGGAAATTCATTATATTCGTTGCTGGATACAAATCCAATATCAATAAAAGAACCGAGGAAAGAGTTAGGGATACTTTTAGAAATTATCCAAACAGCTACTTGATTATCCTCGACCATTCAGAATACACGAACGATAAGCAAGGAAATATCAAAAGCTATGAAAGATCAGTTAAATACGTATTTTATATTGGAAGGGCATTAGCTCATATGCTAGTACGCTTAGAGGAAGGCGGCATATCTCCCAAAAATATACACTGCATCGGTCATAGTTTGGGTTCCCAGATTTTAGGCAATACTGGAGAAATCTTTTATAATATAACTGGGAAGAAGATTGCAAGGATTACGGCTCTGGACCCAGCCGGGCCTTGTTTTTCTAATAGCCTAATACAAGAACAAGTGAGGTCTGGCGTTGCAGATTATGTTGAAGTATATCACTGTAATGCAGGGGGATTGGGGACAACTAGTGTTCTAGGAGACGTAGATTTCTTCGTAAACAAAAAGGGCCAAAGCCAACCGAAATGCGGGACTCCACTAATACCAGGTGTATTCGACTCCTCGAAGGCAGCGAAATGTAACCACAGAGCCTGCATCGATCTTTGGACAGCGACGGTCGCAAATCCAAATTGGTATTTGGCCTGGAAATGTGATTCGTATAAAATGTTCAAAAATGGTGCGTGTGCTGCTAACGACGTCACCATCGCTGGTTTCTGGAATCCTGGTAATGCGACAGGTGTTTACTACTTCAGCACTAATGGCTATGACTACTAA

Protein sequence:

>DPOGS210832-PA
MKCVLILFCYFSYISAWGRGDLEKYGPFQLALHSKLIKCDHDRNLNLDVSGIDVYFYDFPRNDVETFTIDNAARGILDIKELDKTRKFIIFVAGYKSNINKRTEERVRDTFRNYPNSYLIILDHSEYTNDKQGNIKSYERSVKYVFYIGRALAHMLVRLEEGGISPKNIHCIGHSLGSQILGNTGEIFYNITGKKIARITALDPAGPCFSNSLIQEQVRSGVADYVEVYHCNAGGLGTTSVLGDVDFFVNKKGQSQPKCGTPLIPGVFDSSKAAKCNHRACIDLWTATVANPNWYLAWKCDSYKMFKNGACAANDVTIAGFWNPGNATGVYYFSTNGYDY-