Monarch geneset OGS2.0

DPOGS200226
TranscriptDPOGS200226-TA1053 bp
ProteinDPOGS200226-PA350 aa
Genomic positionDPSCF300414 + 40045-42896
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0175325e-10457.81% 
BombyxBGIBMGA008382-TA3e-12558.75% 
DrosophilaCG5665-PA2e-3734.57% 
EBI UniRef50UniRef50_Q06VE98e-5235.43%Putative uncharacterized protein n=1 Tax=Trichoplusia ni ascovirus 2c RepID=Q06VE9_TNAVC
NCBI RefSeqXP_001605911.19e-4237.78%PREDICTED: similar to lipase [Nasonia vitripennis]
NCBI nr blastpgi|1163268183e-5135.43%hypothetical protein TNAV2c_gp132 [Trichoplusia ni ascovirus 2c]
NCBI nr blastxgi|1163268186e-5135.43%hypothetical protein TNAV2c_gp132 [Trichoplusia ni ascovirus 2c]
Group
Gene OntologyGO:00038243.8e-82catalytic activity
GO:00066293.8e-82lipid metabolic process
KEGG pathwaydme:Dmel_CG56652e-35 
 K01059 (LPL)maps-> Glycerolipid metabolism
    Alzheimer's disease
    PPAR signaling pathway
InterPro domain[105-343] IPR0007343.8e-82Lipase
[100-342] IPR0138186.9e-58Lipase, N-terminal
Orthology groupMCL22180 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200226-TA
ATGACACTGCGTATTATAAAAGTGTTGTACTTCATGGCAGCAATAAATGTCTACACTAGCGGTGTCTCCAGACAGCTAGGTAGATTTAGTGTCAGTTCTTTATTGGATTCTCTGCCTGTTCTGCAACCAATCGTGCAAGCTGGATCCAATAGATGTGAAGGTTTAAAAAGCTATTTCGGAATAACATATGAACAGTTGTTACAAAAGAATACAACAAAATTTGATGATATAAGCTTAGATCACATAACCAGAAACGGTAAAGTAAAATATAATCTTAATAACACCAATAACTTAAAGAAGATAATGAGAAATACGAGGAATGTCATAATTATTATTCATGGATATATGGAAAGTTCCGACGGTCTAATGGTGAACCGTGTGGCTCCAGAATTTTTAAAGAAAAAGTATGTAGGCGTCTTCGCGATGGATGGTAGTAATGTTTTTAGTTACGAGTATTTTCGAACTTCTACCTATGCACGCTTCTTGGGTGACAAACTTGGTGATTTATTAAGTGAATTGATTAAAAAAGGAGTTGATCCATCAAAAATTACCCTTGTTGGACATAGTCTGGGAGCCCATATAGCTGGTGTTGCGGGAAATAAAGTCAAGCAAAATACTAATAAGCTTCTGCGTCGAATAACAGGTCTTGATCCAGCGGGTCCTTGTTTTAGTAACGTGCATTTAGATGGTAGACTGGATAAACAAGACGCTGAATACGTTGACGTACTTCACACTAACGCTGGTTTATTGGGGTTAAATTTGCCTGTAGGACACAAGGATTTTTATCCGAATAGTGGCATGTACCAGCCCGGTTGCTTTTTGTCTACTTGTGATCATAGCCGAGCTTGGGAGTTTTATGCCGAATCAATGAACAATTCGGACAACTTCCCAGCACGCAAATGTGAAAATTGGACTGCATTTAAAAATGGTATGTGCACGAAGAATGAGATAGCATATATGGGGTTTAACTCTGAACCAGGCTCTCCAGGTTCGTATTTTTTATCAACTGCATCCTCGTCTCCGTATGGTTTGGGTGCATCTGGAAGCGGGTGA

Protein sequence:

>DPOGS200226-PA
MTLRIIKVLYFMAAINVYTSGVSRQLGRFSVSSLLDSLPVLQPIVQAGSNRCEGLKSYFGITYEQLLQKNTTKFDDISLDHITRNGKVKYNLNNTNNLKKIMRNTRNVIIIIHGYMESSDGLMVNRVAPEFLKKKYVGVFAMDGSNVFSYEYFRTSTYARFLGDKLGDLLSELIKKGVDPSKITLVGHSLGAHIAGVAGNKVKQNTNKLLRRITGLDPAGPCFSNVHLDGRLDKQDAEYVDVLHTNAGLLGLNLPVGHKDFYPNSGMYQPGCFLSTCDHSRAWEFYAESMNNSDNFPARKCENWTAFKNGMCTKNEIAYMGFNSEPGSPGSYFLSTASSSPYGLGASGSG-