Monarch geneset OGS2.0

DPOGS206593
TranscriptDPOGS206593-TA2721 bp
ProteinDPOGS206593-PA906 aa
Genomic positionDPSCF300048 - 1709696-1721846
RNAseq coverage262x (Rank: top 41%)
Annotation
Heliconius% 
BombyxBGIBMGA008324-TA3e-3372.84% 
DrosophilaCG11055-PC2e-15252.09% 
EBI UniRef50UniRef50_Q7QCC98e-16353.70%AGAP002567-PA n=1 Tax=Anopheles gambiae RepID=Q7QCC9_ANOGA
NCBI RefSeqXP_975636.17e-15856.22%PREDICTED: similar to hormone-sensitive lipase [Tribolium castaneum]
NCBI nr blastpgi|3479680943e-16253.70%AGAP002567-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479680944e-15654.08%AGAP002567-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160424.8e-113lipid catabolic process
GO:00082034.8e-113cholesterol metabolic process
GO:00162984.8e-113lipase activity
GO:00167871.5e-30hydrolase activity
GO:00081521.5e-30metabolic process
KEGG pathwayxla:1001747965e-101 
 K07188 (E3.1.1.79, LIPE, HSL)maps-> Insulin signaling pathway
InterPro domain[26-327] IPR0104684.8e-113Hormone-sensitive lipase, N-terminal
[341-503] IPR0130941.5e-30Alpha/beta hydrolase fold-3
Orthology groupMCL14818 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206593-TA
ATGGAATTCGCGGAGCCCCCTGCGAAGGCATCATTATGCTGTGAGGATACTCCCGAAGGTTCTCCGCCGACGTATGCAATGTATGAGGCTTTAAAGGAATCGTGTCAGAATAATGCTAGTTATTTTCAACCTGATGACAGTGAGAATGGACAGCGATTGTACCAGGGCTTCATGACACTGATAGATCATATAGATACAGTTTGGCCTCTCGTGGATCACGTTCGAAAGGTGGCTCCGTTGTATGACTTCGATGCCAAATCTCCCGGCAATGGCTACCGTAGCTTCGTGTCCGTCGTGGATTCTTGCGTTTTGAATGGACTGAAATTGAGCCGTCAAGTATGCACCGGACGTGACGCGCTACTTTTTAGAAAAGGTTATTTTGTTAAGGAAGTTGAGTCTAACGGTCAGCTGCTAGCGTCTCTAGGAACTTGTCTCCACCACCTCCAGACCTTACTCTCGTGGGCTCCACCCGGGGAACTCTTCCCTACAGAACCCCATTCGCCCGAAGAACTGTTCTCACAAGCGGATACCATCAATCAGTATTGCTTTTACGGAAGATGTCTCGGATTTCAGTTTCTTCCATCCATGAGGAATATATTAAAGGGCATATCTATATGCATGGCCGGTTTCTCGGAAGCGTATTACAGTCACGGTAACTTGATCAGCTCAATGTGGACCGGGGGACAGTATCTCATAGACCCGGAGATGAGGGCGCGTCGTATCGTCAATATATCACAATCGGCGAGCGTTGAATTCTGCAAAGCTTTTTGGTTCCTCGCTGAGAGTGAAATAATGCGTAGAGTGCCGAGCTTGATGTCGTCGACGGTGGCTGTTAATAAATTAATAACAATACCACCAGAACCGCTGGCGGTGACGACGAAAGACGGGAAACAGCTGACAGTCCTGCCCCCAGAAGTACACATAGGGTTACAGGGATTAAATGTTAGGCTCATCAGTGTTAATAAAAGAATGGGAATGTCTGATGAAAGCTCATCGAATCTACCGCCAGCTGAGGGGGTGGTTTTTCATTGTCACGGAGGCGGTTTCGTCGCGCAGAGTTCGAAATCCCATGAGACGTATCTAAGAGAATGGGCCGCGAAATTGAACATGCCGATACTGTCCGTAGATTACAGCCTGGCCCCGCAGGCGCCGTTCCCGAGGGCTCTGGAGGAAGTGTATTACGCTTATTGCTGGTTGTTAAACAATTTCAAGCTAATCGGCACCACTGGTAAACGTATAGTATTTGCTGGGGATTCAGCCGGTGCGAACCTCATAGCCGGTTGCACCCTCAAGATCCTGTCTTCAGGACTTCGTACCCCCGAGGGTTTGTTCATGGCCTACGCCCCCTTACTAGTGAGCTTCATACCGAGTCCCGCAAGACTGCTGTGTCTCATGGATCCCTTACTTCCCTTCGGGTTTATGATGAGGTGTCTCAAAGCCTACGCAAGCCCCAACACTAAAGGTAAGGATGAAAAGCATCCGAACAAAGTGAACACGCCGTCGAACGCTACAAGCCCTGTCGAGGGCAACGGATTCCTTAGAGTGAGCCCATCCCAAGGTAAACGTATAGTATTTGCTGGGGATTCAGCCGGTGCGAACCTTATAGCCGGTTGCACCCTCAAGATCCTGTCTTCAGGACTCCGTACCCCCGAGGGTTTGTTCATGGCCTACGCCCCCTTACTAGTGAGCTTCATACCGAGTCCCGCAAGACTGCTATGTCTCATGGATCCCTTACTTCCCTTCGGGTTTATGATGAGGTGTCTCAAAGCCTACGCAAGCCCCAACACTAAAGGTAAGGATGAAAAGCATCCGAACAAAGTGAACACGCCGTCGAACGCCACAAGCCCTGTCGAGGGCAACGGATTCCTTAGGGTGAGCCCGTCCCAAGAGGGAATAAGCTCAGGACCGTCATCGTTCGAGGAGGTCTCGCCATCTGACCTCGCAGAACTTCAGGCGCACAAGTCCGGCAGTGAGAGGAGACAGTCTGCCGACACCACCATCAGCGGAGGGTCGCTGCTGAGCGAGCACACCGCCACCGGTATATCACCGACGGAGGACAAATCACAACAGTACGTATCAGACTTCCTCGACAAATACGTGTTCAATAGCGACACGGACTCTGAAGGGCGCAAATTGTCTGTTGTCAAAGCTAATAAGAAGTTACAGAGGGACACCGAATCTGAAAGCACACTCGTCGGTGAGCCTCCCCTCATACAAGACCAGGAACACAGAGATAAGAAAAGGATAAAGGCGCGTATAAGTGAAGCAGCTACCGGTTTAATGGGCGCCATGTCGTCAAGGCTAGCGTACATAACCGGTTCAAATAACATAAGGCCCACCCAAGAGGAGTTGTCAGTCCGTTCGAACCTGGACGCGCTGATAGCCCGCAGTCCGTCCGACGAGTTCATATTCTCTGTGCCACGTGACCCGCTCCTGTCGCCGTACTGGGCGGACGACGATCTACTAAAGAGGTTCCCACCCGTGAGGTTGTTGACTGTACATTTAGATCCTTGCCTTGACGACTGCGTGATGTTTGCTAAAAAACTTAAAGGTTTGGGCAACGAGGTGGGTATCGATGTCTTAGAGGGGCTGCCTCATGGATTCCTTAATTTCTCTCTTATGGCCAAAGAAGCGAACGAAGGTTCAAAACTTTGCGTGGAGCGCATAAAACAGTTGTTGGACTTGGAAAATCCTACGACGCCCGAGAACAATCATTTATGA

Protein sequence:

>DPOGS206593-PA
MEFAEPPAKASLCCEDTPEGSPPTYAMYEALKESCQNNASYFQPDDSENGQRLYQGFMTLIDHIDTVWPLVDHVRKVAPLYDFDAKSPGNGYRSFVSVVDSCVLNGLKLSRQVCTGRDALLFRKGYFVKEVESNGQLLASLGTCLHHLQTLLSWAPPGELFPTEPHSPEELFSQADTINQYCFYGRCLGFQFLPSMRNILKGISICMAGFSEAYYSHGNLISSMWTGGQYLIDPEMRARRIVNISQSASVEFCKAFWFLAESEIMRRVPSLMSSTVAVNKLITIPPEPLAVTTKDGKQLTVLPPEVHIGLQGLNVRLISVNKRMGMSDESSSNLPPAEGVVFHCHGGGFVAQSSKSHETYLREWAAKLNMPILSVDYSLAPQAPFPRALEEVYYAYCWLLNNFKLIGTTGKRIVFAGDSAGANLIAGCTLKILSSGLRTPEGLFMAYAPLLVSFIPSPARLLCLMDPLLPFGFMMRCLKAYASPNTKGKDEKHPNKVNTPSNATSPVEGNGFLRVSPSQGKRIVFAGDSAGANLIAGCTLKILSSGLRTPEGLFMAYAPLLVSFIPSPARLLCLMDPLLPFGFMMRCLKAYASPNTKGKDEKHPNKVNTPSNATSPVEGNGFLRVSPSQEGISSGPSSFEEVSPSDLAELQAHKSGSERRQSADTTISGGSLLSEHTATGISPTEDKSQQYVSDFLDKYVFNSDTDSEGRKLSVVKANKKLQRDTESESTLVGEPPLIQDQEHRDKKRIKARISEAATGLMGAMSSRLAYITGSNNIRPTQEELSVRSNLDALIARSPSDEFIFSVPRDPLLSPYWADDDLLKRFPPVRLLTVHLDPCLDDCVMFAKKLKGLGNEVGIDVLEGLPHGFLNFSLMAKEANEGSKLCVERIKQLLDLENPTTPENNHL-