Monarch geneset OGS2.0

DPOGS212972
TranscriptDPOGS212972-TA1113 bp
ProteinDPOGS212972-PA370 aa
Genomic positionDPSCF300057 + 573689-577035
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0057806e-12561.35% 
BombyxBGIBMGA011617-TA4e-11959.38% 
DrosophilaCG4582-PA1e-2630.00% 
EBI UniRef50UniRef50_UPI000224713A2e-2935.27%UPI000224713A related cluster n=1 Tax=unknown RepID=UPI000224713A
NCBI RefSeqXP_001605911.12e-2934.88%PREDICTED: similar to lipase [Nasonia vitripennis]
NCBI nr blastpgi|3454882207e-2935.27%PREDICTED: pancreatic lipase-related protein 2-like [Nasonia vitripennis]
NCBI nr blastxgi|3454882201e-2735.86%PREDICTED: pancreatic lipase-related protein 2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00038241.2e-52catalytic activity
GO:00066291.2e-52lipid metabolic process
KEGG pathway 
InterPro domain[94-340] IPR0007341.2e-52Lipase
[94-340] IPR0138182.1e-41Lipase, N-terminal
Orthology groupMCL21155 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212972-TA
ATGGTGATGAGACCGGTGATGGATTTGGTGAATATGTTGGTGATATGTATCCTGGTTGTTATCGGTAGTGGACATTGTATGTATAGTAGCAAGGAGCTCGAAGGTTACCCAGCAGGATTTATGGCAGATTGCCCTGGTTCAGATAAGGAAGCGATCATCACGAAGAACACTTTAAAGAACTTGTCGATGACAGTAGTTAGCACGAGCAATCCAATGAAAGAAATACGAAGGAAATATAATTACTATCAAATGAAGGAGCTAGCTAAAGATCCGTTGTTAGATTTTAACAAACGTACCGTGCTCTACATAGGAGGTTTTCTAGACAGCCCCAACTTTCCCATACCAGGGAGGGTAGCGAAGGTCTACAATAGTATAGGATATAACGTCTTGTTATTAGACACCAGCTATTTTACAACATATGAATATCCAAGAGCGTCACGTCTAGCTCGCCCCGTCGGTGTTCACGGAGCGAAAATGTTGTTCGAACTCAAAAAACAAGGTCTAGACCCGAAGAAGTTAGATGTAGTTGGCCTCAGCCTTGGAGTTCACACTGCCAGCTTTATAGCCAAAAACTTTAGACTCATAACTGGCGTCAATATATCAAGAATAACTGCGTTGGATCCTTCGGGACCTTGCTTTAGGAATTTAGGTCCAGAAGACAGAATCGACAAATCCGATGCCGATTTCGTTGAAATGATTGCCACTAATATAGACGGTTACGGCATGGCGGCCCCCGTGGGTCATGTCAATTATTACGTAAACGGCGGAGAGTTTCAACCTGGAGAATTAGTTTGGTTCCTCTGCATGACCCTGTGTAGTCACATCCGGTCTTATATGGTGTGGCTGTCAGCTCTGCAGCACCCCAATTCATTTATCGCCATGCAGTGTGACTCCGTACAACAGGCGCGATTCAAAAAATGTCGGGAAAGGAAGCCTTTGGTGACTAATTTGATGGGACTAAAGGTGGATAAGAAAAATGAAGGCATATTTTATCTAGCCACTTCCGCTGCATATCCTTATTATTTAAGCGAGAGAGGCTTAAAACAAAACTCTGACCTTTTCAAGTCATTAGCAGCGTCATTTAACAAAGAAAAACTTGTTAAAGTTAGATGA

Protein sequence:

>DPOGS212972-PA
MVMRPVMDLVNMLVICILVVIGSGHCMYSSKELEGYPAGFMADCPGSDKEAIITKNTLKNLSMTVVSTSNPMKEIRRKYNYYQMKELAKDPLLDFNKRTVLYIGGFLDSPNFPIPGRVAKVYNSIGYNVLLLDTSYFTTYEYPRASRLARPVGVHGAKMLFELKKQGLDPKKLDVVGLSLGVHTASFIAKNFRLITGVNISRITALDPSGPCFRNLGPEDRIDKSDADFVEMIATNIDGYGMAAPVGHVNYYVNGGEFQPGELVWFLCMTLCSHIRSYMVWLSALQHPNSFIAMQCDSVQQARFKKCRERKPLVTNLMGLKVDKKNEGIFYLATSAAYPYYLSERGLKQNSDLFKSLAASFNKEKLVKVR-