Monarch geneset OGS2.0

DPOGS215038
TranscriptDPOGS215038-TA1863 bp
ProteinDPOGS215038-PA620 aa
Genomic positionDPSCF300208 - 518711-523188
RNAseq coverage832x (Rank: top 15%)
Annotation
HeliconiusHMEL0054710.059.27% 
BombyxBGIBMGA005667-TA1e-17753.12% 
Drosophila% 
EBI UniRef50UniRef50_C9S2610.061.32%Putative acylpeptide hydrolase n=4 Tax=Nymphalidae RepID=C9S261_9NEOP
NCBI RefSeqXP_970931.14e-12741.00%PREDICTED: similar to acylpeptide hydrolase [Tribolium castaneum]
NCBI nr blastpgi|2613359270.061.32%putative acylpeptide hydrolase [Heliconius melpomene]
NCBI nr blastxgi|2613359270.061.64%putative acylpeptide hydrolase [Heliconius melpomene]
Group
Gene OntologyGO:00065085.9e-12proteolysis
GO:00082365.9e-12serine-type peptidase activity
KEGG pathway 
InterPro domain[510-617] IPR0013755.9e-12Peptidase S9, prolyl oligopeptidase, catalytic domain
[362-394] IPR0110429.9e-09Six-bladed beta-propeller, TolB-like
Orthology groupMCL12734 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215038-TA
ATGGGGGATGTGTGGAGAATGGGCTCACATATGGAAAGTGTGATAAAAGCCTATAAGACTCTTAGCAAAATCCCATCCATAGTAGGGGGAACATTAAATAGTAATGGTAATAGGATTATATCGAAGTGGTCGGTACGTAACATTGACAAAGGCAAAAACACAAGATATCAGATTGAATATTTTTTGGATAAGAACTTAGATGTCACCAATGAGAGTTACTTTGGTGTGGATGTTAGTAATGAATTGCTAGTTGCATATTCACCGAATGAGACTTACAAAGCTGTTATATCCGAGGAAAAGGATGAAAAGGATAGCAAAAAGAAGTTCTTTTTGGAAATTTGGACAATGAATTGTCTTAGCCGTTCCATCGACCTCACCGCACTTGACATACATGGAGATGTCTATGCAGACTCCGAATTCGGATGCCTCGATTGGTCTCCCGATGAAAAGAAAATCGTCTATGTGGCGGAGAAGAAAGTTAAGAAATCCGAGCCTTACATTAAAAGGAAGCCGGCAGCTGGCACACCTGATGATAAAACAGTACCTGGGGAGGAGCATTTGTACAAGGAAGATTGGGGGGAACAATTGACATCCAAAATACAAGGGGTGATTGTGGTCTGTGATGTTGATAGTGAAACTTTCACTGTATTGGATAATCTGCCAGACGATTGGTGTCCCGGTCAGGTACGGTTTGCTCCTGATGGGAAGAGTGTGGTGGGGGTGGCCTGGGAGACTGGACTTAGACGTCTAGGTCTTATCTATTGTACTAACCGATATAGCTTCGTATTCAGTCTAACGCTAGACGGAGTGCTCAAGAAGTTAAGCCAAGTGACTTACTCAGTGCGTTCACCACGAGTGTCACCTCACAGAGTAGTGTGGCTACAGAGGTATGCTGGGGGTCCTCATCACTCCTGTCACCAGCTCGTGGGACTGACGTACCAGCAGATAGAGTCCATGAAAAATGTTGAAGTTGAACCAACGATCATAACGGATCTTGTCGAGACCGAGAGGAAGATATCAAACGATTTCTTCTACGGAATTTTCTGCCAGGGACTACCTTTAATGTGTTTCGTCAAAAACAAACAAGGGCTAAAAACGGACGACGAGAGAATAGTGTTCAGTACGCAACAGCAAAACGAAATTAGGAGTTACGTAGTGCACGTTGAGAGCGGTAACATGGTAGACATATCGCACAAGAAGGATGGCCCGGGGTCCACCACCGTGCTGTGTGTGAGGTCGGATGTAGTGCTGGCTACGTTTTCAAACTTAAGAACACCGTCACAGTTGTTCGTCGCTAGACTTCCACCAACAGGTCACGAGGCTGGCATCGAATGGGTGCCGGTGAGTAAACCACACACTCTGCCGAGTTCAATATCTCAAGGGAAAATACAGTACATGCACTTGGACCACAACAACGATGACAAAGTCTCCAAGTTCACGGCGATGTACTTCGGGCCGGACCAGCAAGGCATATATCCTCTGGTGGTGTGGCCACACGGCGGACCGCACTCCGCCTTCTCCAACACCTACTCCTTGGAAGCTGCGTTCTTCAACCTCATCGGATTCGCTACGTTGTTAATAAACTACCGCGGTTCAGCGGGCACTGGGAACGGTTCGATCTGCTATTTGCCGAGTCGCATAGGGACAGCTGATGTTCTGGACTGCAAACTCGCCACCGACAAGGCCATAGATATGTTCCCAGTTAACGATAAGAAGCTGTTGCTGTATGGCGGTTCTCACGGCGGGTTCCTGGTCGCGCACCTCAGCGGGTTGTTCTATGACTTCTACCACGCCGCCGTGCTGAGGAATCCTGTGATAGACCTCGCTTCAATGATCCACACCACTGACATCGCTGATTGGTGA

Protein sequence:

>DPOGS215038-PA
MGDVWRMGSHMESVIKAYKTLSKIPSIVGGTLNSNGNRIISKWSVRNIDKGKNTRYQIEYFLDKNLDVTNESYFGVDVSNELLVAYSPNETYKAVISEEKDEKDSKKKFFLEIWTMNCLSRSIDLTALDIHGDVYADSEFGCLDWSPDEKKIVYVAEKKVKKSEPYIKRKPAAGTPDDKTVPGEEHLYKEDWGEQLTSKIQGVIVVCDVDSETFTVLDNLPDDWCPGQVRFAPDGKSVVGVAWETGLRRLGLIYCTNRYSFVFSLTLDGVLKKLSQVTYSVRSPRVSPHRVVWLQRYAGGPHHSCHQLVGLTYQQIESMKNVEVEPTIITDLVETERKISNDFFYGIFCQGLPLMCFVKNKQGLKTDDERIVFSTQQQNEIRSYVVHVESGNMVDISHKKDGPGSTTVLCVRSDVVLATFSNLRTPSQLFVARLPPTGHEAGIEWVPVSKPHTLPSSISQGKIQYMHLDHNNDDKVSKFTAMYFGPDQQGIYPLVVWPHGGPHSAFSNTYSLEAAFFNLIGFATLLINYRGSAGTGNGSICYLPSRIGTADVLDCKLATDKAIDMFPVNDKKLLLYGGSHGGFLVAHLSGLFYDFYHAAVLRNPVIDLASMIHTTDIADW-