Monarch geneset OGS2.0

DPOGS213951
TranscriptDPOGS213951-TA1308 bp
ProteinDPOGS213951-PA435 aa
Genomic positionDPSCF300226 + 24473-29949
RNAseq coverage375x (Rank: top 32%)
Annotation
HeliconiusHMEL0152784e-15874.51% 
BombyxBGIBMGA003368-TA4e-12668.91% 
DrosophilaCG6847-PA7e-4336.40% 
EBI UniRef50UniRef50_D2SNY52e-9777.31%Neutral lipase (Fragment) n=1 Tax=Heliothis virescens RepID=D2SNY5_HELVI
NCBI RefSeqXP_001122243.12e-8348.01%PREDICTED: similar to Pancreatic lipase-related protein 2 precursor (Secretory glycoprotein GP-3) [Apis mellifera]
NCBI nr blastpgi|2609080566e-9777.31%neutral lipase [Heliothis virescens]
NCBI nr blastxgi|2609080562e-9677.31%neutral lipase [Heliothis virescens]
Group
Gene OntologyGO:00038243.9e-101catalytic activity
GO:00066293.9e-101lipid metabolic process
KEGG pathwaynvi:1001136318e-78 
 K12591 (RRP6, EXOSC10)maps-> RNA degradation
InterPro domain[65-376] IPR0007343.9e-101Lipase
[68-375] IPR0138182.2e-65Lipase, N-terminal
Orthology groupMCL10341 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213951-TA
ATGACGACTGATGAACTTGAAAACCACATAAGTCCGGAAGACCTAGAAGAGTTCCTCAAATGGTGCAAAAATAATACTGCAACAGAAATTTCCCCGAATCATGATGTCGATTACGCTAGTTTCGAGGATACGACCACAATCCTCCCCGTACCAGAAACAGTGGACTGTTTCGGCTTGGGCAGCCTCGTTGGTTTTTTCAACATGAGGCCGGACTCTTCAAATGACGTCAATACTCATTTTTATTTTTCATCTCGTCAAAAACCGGATAGAGTACAGGTTTATCCAGGAACACAGTTTGGTCTGGAGTGGGTGGATTTTAGGCCGGACAGAAGGACCATATTGATTGTCCATGGCTTCATGAGCCACAGCAACGCGTCCTGGGTTTTGGACATGACGCGGGCGTTTTTAGAATGGAGGGATGTGAACGTGATAGCTGTGGATTGGAGTAAAGGTGGCAACACTTGGAAGTATTGGAGAGCAGTGGCCAATACTAGGAGAGTAGGATCTGATGTTGTTGGGTTCATGAAACAGCTTATGACGGCGACTGGTGCTAATGTTAAGGATTTCCATTTCATAGGTCACAGTCTCGGCGCTCACATCGTGTCCTACGTGTCTTATCATATTGGGAGGGTTGCTAGAATAACAGGTCTAGACCCAGCGCAGCCATGTTTCAGAACATCGAGTCGTGTGGAGCGCTTGGATGAAACTGACGCAGACTTTGTAGACGTCATACACACTAACGGTAGACTATTGAAAAGGATTGGATTTGGACTCCCAGATCCTATTGGTCATGCAGATTTCTATCCCAACGGCGGAATGAAACAACCCGGTTGTAAGAACGAGACCAGAACTATTTGGTCCACACTCTTCCCTGGTTCAGTTGCCAGATTACAGCAGGCTATATGCAGCCATGGGCGAGCGTATCTTCTCTTTACAGAATCTCTGATTAACAATAACTGTAGCTTTATAGCACATAACTGGAATTTAACATACGAAGGGGTCAACGCAAGCATATCAGCTGCTTGTGATAGAGCCGCCCCGTGTTCTGAGATGGGCATCCGAGCTGATCAAAAACGAGTCTATAAAGGGGCTTATTTCGTACTCACCACCGAAAAAGAACCTTATTGCCCAACAGACTACGAGCGGACACATCCCCAGCCAGATTTATTACTAGATTTGCGTAAAGGGTTCAGAATAGACCGGTCGTCAAAAGCAACAACACCGGCACCAGCGTTCGCTGACCCTCAAGACATAACAACCACTACCGCGAAGAACTGGTTCAGGAGAATTATTGACAAATTGGGATAA

Protein sequence:

>DPOGS213951-PA
MTTDELENHISPEDLEEFLKWCKNNTATEISPNHDVDYASFEDTTTILPVPETVDCFGLGSLVGFFNMRPDSSNDVNTHFYFSSRQKPDRVQVYPGTQFGLEWVDFRPDRRTILIVHGFMSHSNASWVLDMTRAFLEWRDVNVIAVDWSKGGNTWKYWRAVANTRRVGSDVVGFMKQLMTATGANVKDFHFIGHSLGAHIVSYVSYHIGRVARITGLDPAQPCFRTSSRVERLDETDADFVDVIHTNGRLLKRIGFGLPDPIGHADFYPNGGMKQPGCKNETRTIWSTLFPGSVARLQQAICSHGRAYLLFTESLINNNCSFIAHNWNLTYEGVNASISAACDRAAPCSEMGIRADQKRVYKGAYFVLTTEKEPYCPTDYERTHPQPDLLLDLRKGFRIDRSSKATTPAPAFADPQDITTTTAKNWFRRIIDKLG-