Monarch geneset OGS2.0

DPOGS202235
TranscriptDPOGS202235-TA1404 bp
ProteinDPOGS202235-PA467 aa
Genomic positionDPSCF300149 + 597200-601128
RNAseq coverage1047x (Rank: top 12%)
Annotation
HeliconiusHMEL0091711e-13468.99% 
BombyxBGIBMGA013534-TA4e-14865.22% 
DrosophilaCG2493-PA4e-12647.51% 
EBI UniRef50UniRef50_B7S8R41e-13450.97%Prolylcarboxypeptidase n=13 Tax=Eumetazoa RepID=B7S8R4_9HYME
NCBI RefSeqXP_002424063.16e-13852.30%Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3214681463e-13851.64%hypothetical protein DAPPUDRAFT_225197 [Daphnia pulex]
NCBI nr blastxgi|3214681464e-13851.52%hypothetical protein DAPPUDRAFT_225197 [Daphnia pulex]
Group
Gene OntologyGO:00065081.2e-202proteolysis
GO:00082361.2e-202serine-type peptidase activity
KEGG pathway 
InterPro domain[19-464] IPR0087581.2e-202Peptidase S28
Orthology groupMCL12865 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202235-TA
ATGTTCAGAGTAATATCGCTTGTATTGTTCATAAATTATGTGACCTGTGACTACAAGTTCGAGACGAAATGGTTCAATGTGCCCCTGGATCACTTCGGATTCCAGAGAAACGAAACTTTCAACATAAAATATCTGATCAACGAGGAGTATTGGGACAAGGGAGGCGGACCGATATTCTTCTATACAGGAAATGAGGGACAAATTGAGGTATTCGCGAAACACACCGGCTTTATGTGGGACATCGCTGAGGAATTTAAAGCGAAATTGGTATTTGCAGAACATAGATACTATGGTCAATCAATGCCCTTCGGTAATAAGTCACTGGACAACGAGCACATTGGCTACTTGACATCCGAACAGGCGTTAGCTGATTACGCAGACCTCATAAACTATCTACAGGGAAACAAACAGAGACCGACATACCCCGTCATTGCTTTTGGAGGATCTTACGGTGGAATGCTCTCCGCCTACATACGCATCAAGTACCCTCACCTGGTGACGGGCGCCATAGCCGCCTCGGCCCCGATCCACATGTACCCCGGGATGGTGCCGTGCGAAGTGTTCCACAGGATTGTGACTTCCAGCTTCAAAATAGCGGATGAAAAATGCGTTAAAAATATAAGAAGCTCGTGGGGTGTTCTTAGAAAATTTCTCGAAAGTCAAAACAATACCGATTGGCTTCACAAGAACTGGAACCTGTGCGAGCCCGTGAAGCCTGCGGATGTGAACACCTTGATGGAGTTCCTCCAGTCGATGTACGAAACCCTCGCGATGGTGAACTATCCCTTCCCGTCGGACTTCCTGCTGCCCCTGCCCGCGCAGCCGGTGCGAGTAGTGTGTCAGTACTTGAACGAGACCCTCAGTGGACAAAAACTCATTGAGGCTATTGGTAAGGTGATCAAAGTGTACAGCAACTATGATGGCAAAGCCCCCTGTGTCGACTACAAGAAGGGAGACGACTTCGGCAATCTTGACGCTAGCGGATGGGACTATCAGGCGTGCACAGAGATGATAATGCCGATGTGCACTACCGGAAACCAAGATATGTTCGAGCCCTCCCCTTGGAACTTCACCAAATACGCTGAAGACTGTCACAGGAAATACAACGTGTACCCGCGACAGGAGGCGGCTCGGATACAGTACGGAGGAGACAGGCTTCGAGCGGCGACCAATATTGTGTTCAGTAATGGACTGCTGGATCCCTGGGCGGGAGGCGGCATCCTGAATAGTATCAGTAATTCAGTGAAGGCAGTTGTTATCATCGACGCGGCCCATCACCTTGACCTGATGCCTTCCAACCCAGCTGATCCCAATTCAGTAAAACTCGCCAGAAACATACACAAACAGAACATAGACAAATGGATACGAGAGTTCCGCACGGAACGCTCCGACAGACACCATTAG

Protein sequence:

>DPOGS202235-PA
MFRVISLVLFINYVTCDYKFETKWFNVPLDHFGFQRNETFNIKYLINEEYWDKGGGPIFFYTGNEGQIEVFAKHTGFMWDIAEEFKAKLVFAEHRYYGQSMPFGNKSLDNEHIGYLTSEQALADYADLINYLQGNKQRPTYPVIAFGGSYGGMLSAYIRIKYPHLVTGAIAASAPIHMYPGMVPCEVFHRIVTSSFKIADEKCVKNIRSSWGVLRKFLESQNNTDWLHKNWNLCEPVKPADVNTLMEFLQSMYETLAMVNYPFPSDFLLPLPAQPVRVVCQYLNETLSGQKLIEAIGKVIKVYSNYDGKAPCVDYKKGDDFGNLDASGWDYQACTEMIMPMCTTGNQDMFEPSPWNFTKYAEDCHRKYNVYPRQEAARIQYGGDRLRAATNIVFSNGLLDPWAGGGILNSISNSVKAVVIIDAAHHLDLMPSNPADPNSVKLARNIHKQNIDKWIREFRTERSDRHH-