Monarch geneset OGS2.0

DPOGS212603
TranscriptDPOGS212603-TA1041 bp
ProteinDPOGS212603-PA346 aa
Genomic positionDPSCF300245 - 96321-102553
RNAseq coverage67x (Rank: top 67%)
Annotation
HeliconiusHMEL0024696e-12562.69% 
BombyxBGIBMGA005196-TA6e-9348.70% 
DrosophilaCG17292-PB2e-3934.62% 
EBI UniRef50UniRef50_B4LS983e-4234.57%GJ16163 n=4 Tax=Drosophila RepID=B4LS98_DROVI
NCBI RefSeqXP_553025.31e-4535.06%AGAP009104-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582998353e-4435.06%AGAP009104-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582998351e-4335.06%AGAP009104-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00038241.5e-54catalytic activity
GO:00066291.5e-54lipid metabolic process
KEGG pathwaydme:Dmel_CG56654e-23 
 K01059 (LPL)maps-> Glycerolipid metabolism
    Alzheimer's disease
    PPAR signaling pathway
InterPro domain[84-319] IPR0007341.5e-54Lipase
[61-316] IPR0138183.1e-43Lipase, N-terminal
Orthology groupMCL25111 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212603-TA
ATGTCGATATTAAAAATGTGTAGGTGTAATTATTGGATTGCATTTTTTATTTTAGGTTTATCTTTGGAACACACGCAGACTGCTTGGTTAAAATGTTACAGAGGTTCCATGGACAACTATATGATGACATCACTGAGAAATCCAACTCCACTGTTGGGAGATCCGTGCATTGACAGGAACCTTCGTACGATGATTTATACATTTGGATATCGTGGGAAGCCAAACGGCCCAGCGACCACCGCTGTCTTACAAACATATTTGTCAACCGGCAAGAAGAATGTCATTCTATTGGATTGGCAGGAGGAGGCTAGGAGTGGCCTTTTAGGAATTCCACTGGGATATGCCTTATCTGCTGTCCCAAACGCGAAAAGGGTTGGTCAAGAGCTCGGTGCAGCTTTGATAATGTTATCTCAAGCCGGTCTTAATATGAGTGAGATTCATCTCACGGGACATTCTTTAGGAGCTCACGTCATGGCGTATGCGGGAAGATGGGCCCGAGAAAGAGGCCATGCTATATCAAGAATAACTGGTCTAGATCCAGCAAGAGCGTTGTTCGAGGGAGCCTTCGCTATAAAGACGGGTTTGGATCGCACGTGTGCCAAGTTCGTGGACATAATCCATACAAACCCTGGTAACTATGGTACAAGCAAGTCAAGTGGGACGGTGGACTTGTGGCCGAACTACTCGCCCGACGACGGCATGCAGCCGGGATGTCCGCTAGGAAGTTACAGCATGTTTACACCCGAGGATCTTTGTAGCCACGACCGATCGTGGCGCTACCTGGTGGAGTCGGTGCGCAACGGAACTTCCTTTATGGCGGCCTCTGCCGACAGTTTCGATACATGGCTAGGAATGGACAATCCACCGGCAACTATTTATATGGGGGATTTGGCGAATACACGGGCTCGGGGCAACTTTTTCTTCACCACGAACTCTCAACCGCCATACGGTCGAGGTATGACGGGCGTTCTGCCTGAAGGGCAACGGGCAAGAAGAAATTCAGCTTCAATCACATCACTACTTGGATTATTGAAGCGGTAG

Protein sequence:

>DPOGS212603-PA
MSILKMCRCNYWIAFFILGLSLEHTQTAWLKCYRGSMDNYMMTSLRNPTPLLGDPCIDRNLRTMIYTFGYRGKPNGPATTAVLQTYLSTGKKNVILLDWQEEARSGLLGIPLGYALSAVPNAKRVGQELGAALIMLSQAGLNMSEIHLTGHSLGAHVMAYAGRWARERGHAISRITGLDPARALFEGAFAIKTGLDRTCAKFVDIIHTNPGNYGTSKSSGTVDLWPNYSPDDGMQPGCPLGSYSMFTPEDLCSHDRSWRYLVESVRNGTSFMAASADSFDTWLGMDNPPATIYMGDLANTRARGNFFFTTNSQPPYGRGMTGVLPEGQRARRNSASITSLLGLLKR-