Monarch geneset OGS2.0

DPOGS210216
TranscriptDPOGS210216-TA1227 bp
ProteinDPOGS210216-PA408 aa
Genomic positionDPSCF300196 - 623547-638726
RNAseq coverage329x (Rank: top 35%)
Annotation
HeliconiusHMEL0137742e-7867.28% 
BombyxBGIBMGA002541-TA0.078.93% 
DrosophilaCG7365-PA2e-6339.94% 
EBI UniRef50UniRef50_UPI00017921A27e-13059.61%UPI00017921A2 related cluster n=1 Tax=unknown RepID=UPI00017921A2
NCBI RefSeqXP_968657.18e-13462.81%PREDICTED: similar to AGAP011513-PA [Tribolium castaneum]
NCBI nr blastpgi|3323760954e-13459.68%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|910777681e-12962.81%PREDICTED: similar to AGAP011513-PA [Tribolium castaneum]
Group
Gene OntologyGO:00167881.8e-20hydrolase activity, acting on ester bonds
GO:00066291.8e-20lipid metabolic process
GO:00167877.2e-09hydrolase activity
KEGG pathwaytgu:1002325356e-44 
 K06269 (PPP1C)maps-> Meiosis - yeast
    Regulation of actin cytoskeleton
    Vascular smooth muscle contraction
    Insulin signaling pathway
    Focal adhesion
    Long-term potentiation
    Oocyte meiosis
InterPro domain[79-348] IPR0010871.8e-20Lipase, GDSL
[274-356] IPR0138317.2e-09Esterase, SGNH hydrolase-type, subgroup
[73-359] IPR0138302.6e-08Esterase, SGNH hydrolase-type
Orthology groupMCL17714 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210216-TA
ATGTGGTTGTTTGGCTTGCTGTTGTTAGTCAGCGAAATAAACGCGTTTAATGTCAGCCTCACAGGTTTTAACGATGTGTGGTCGGTACACTACCCTCCTCGCAAGTCGAACGTGCGTCGTCAGCGCGCCTACCCTCCGTCGATGCCGTTCCCGTGTCACGACTCCGTTTCGTGGGGCCGCAGCAGACAGGTCCCTACCTCCGTGCACCGCCTCAGACCAGGGGATATCGACGTCGTGGCAGCGATCGGAGACTCCTTGGTGGCGGGGAGCGGTGCTTTAGAAGAATTCGCCCTCGGAGCTTTCGTTGAATATAGAGGAATATCCTGGTGTGCAGGCGGTGACAGCACCTGGCGCGAGTTCCTCACTCTCCCCAACATCCTGAAGGAGTACAACCCTCAGCTGCGAGGGTACTCAACGGGGACGGGCGAGTGGCTCGCAAAAAACTCGCGACTGAACGTGGCCTTCCCCGTCGCCTCCGACCAGGACGCCTACAAACAAGCTAAGATCCTGGTGGCCAGGATGCGTTCGTCTCCTGACGTGGACATGAGCCATCATTGGAAGATGGTGACAGTGTTCATAGGGGCCAATGACCTATGTTCAGCCTCGTGTCTGTCCCCGGTGGCGTGGTCGCCGGCCGCTCACGCCAGGAAACTGGCGCGGGCCCTGGATTACCTCCACCAACACCTACCGCGGACTATCGTCAACCTTATACCGGTGTTAGACGTGTCGGTGTCGGTGCGCGTGCTCCGGCCCCTGATGTGTCGTCTGATGCATCCCTTGTTCTGCACGTGTTTCCACCGCGGCGGCGGGGAGCTGCGGGACCTGGTGCGCCAGGCCCGCCTCTACCAGGACGCGGAGATGGCGCTGGTGGAGAGCGGCCGCTACGACTCCCGCGCGGACTTCACGGTGGTGGTGCAGCCGTTCATGCGTCTGTTCAACGCGCCGTCGTCGGAGCCTCCCCTGCCGCTCGTCATACACCAGTCCTACATCACACACGACTGCTTCCACTTCTCCCAGAAGGGACACGCGCTGGCTGCGAACCTGCTGTGGAACAACCTCCTGGAGCCGGTGGGGAACAAGTCTTCTCCGTCGCCGCCGTCGTTGATGAAGACGTTCCGCTGTCCGTCCAGGCGCGCTCCGTTCATCTTCACCAACTTAAACTCCAAGGAGTTCCTCCTCACCGGCTCGCAGCCCGGAGCGGAGGATGACTACAATCGAACTTGGTGA

Protein sequence:

>DPOGS210216-PA
MWLFGLLLLVSEINAFNVSLTGFNDVWSVHYPPRKSNVRRQRAYPPSMPFPCHDSVSWGRSRQVPTSVHRLRPGDIDVVAAIGDSLVAGSGALEEFALGAFVEYRGISWCAGGDSTWREFLTLPNILKEYNPQLRGYSTGTGEWLAKNSRLNVAFPVASDQDAYKQAKILVARMRSSPDVDMSHHWKMVTVFIGANDLCSASCLSPVAWSPAAHARKLARALDYLHQHLPRTIVNLIPVLDVSVSVRVLRPLMCRLMHPLFCTCFHRGGGELRDLVRQARLYQDAEMALVESGRYDSRADFTVVVQPFMRLFNAPSSEPPLPLVIHQSYITHDCFHFSQKGHALAANLLWNNLLEPVGNKSSPSPPSLMKTFRCPSRRAPFIFTNLNSKEFLLTGSQPGAEDDYNRTW-