Monarch geneset OGS2.0

DPOGS201916
TranscriptDPOGS201916-TA2475 bp
ProteinDPOGS201916-PA824 aa
Genomic positionDPSCF300507 + 26829-37779
RNAseq coverage413x (Rank: top 29%)
Annotation
HeliconiusHMEL0085830.059.61% 
BombyxBGIBMGA009184-TA2e-16348.62% 
Drosophila26-29-p-PA1e-7231.15% 
EBI UniRef50UniRef50_D2KMR24e-16148.62%Putative peptidase n=1 Tax=Bombyx mori RepID=D2KMR2_BOMMO
NCBI RefSeqXP_001605879.12e-7432.84%PREDICTED: similar to homologue of Sarcophaga 26,29kDa proteinase [Nasonia vitripennis]
NCBI nr blastpgi|2813982061e-16048.62%putative peptidase [Bombyx mori]
NCBI nr blastxgi|2813982066e-16648.77%putative peptidase [Bombyx mori]
Group
Gene OntologyGO:00082343.6e-86cysteine-type peptidase activity
GO:00065084.9e-70proteolysis
KEGG pathwaynve:NEMVE_v1g1811813e-48 
 K01365 (CTSL)maps-> Lysosome
    Phagosome
    Antigen processing and presentation
InterPro domain[501-803] IPR0131283.6e-86Peptidase C1A, papain
[587-804] IPR0006684.9e-70Peptidase C1A, papain C-terminal
Orthology groupMCL25583 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201916-TA
ATGTGGGTGAGGACTGTAATCGCTGTGGCCTGTTTCGGCCTGGTAGTAACGGCTCAGTCTCGATATTCATCAGAAGAAAGATCATACAGACGGAACCGTCCATCGCGGCCGAGTCAATGGAGAGATCTTGCTGACGAATACGACAGAGATGAAGACAGAAGTCCACAAACACCGAGCATTAAATTTCCACTTCAAGCAATACCTTATGAATTACCTATAGGGGTTCGTGGTGCCAACTTACAGCTACTGGTACTCCCAATACCAACGTCAGCACTCAATCCTGGAGCTAGTCAGAAGTCAGAATGTAAACTAACACCAGAACCCGAGGAACCACAAATCGATGTGAGATGGGGTGTCCCAAGTATAGCATCAAGTCCGCCTGCTGGTATTTTGTCAAGTCCGCCTGTTGGTATATTATCAAGGCCAGCTGCTCCTAACACCATGAGTAACATTCCATTAAAAGCTACAAATCGTGATGAAGTTCCAATCAAGGCACCTTCGTATCCATCGTACATTAACGTAGAAAAATTGGATCAACCTGTGAGCCCACCGGAAATTCCTCCTTCTGTGTCATCGAAGGAAATATTAGAGCAATTCCTTGCACAGGCGCCTCAAGCCCCTCAACTGCCACCTGTACTGTTATCAGAATATGACAAAAGTGAACGTATACCGCATCCAGAAGAAGAGGATGATGAAATGAAGTATAACTTCATAGAAGAAAGTGCAACAGTGCCAACCGTCAAAACTAGTTTGCCATTTCAACAAAACATTTACAGTCGAGGAAGAAAAGTTCAACCCCAACCATCAAGGCCCCGTTTCTCACAATCAGTGGTGATACGAGGCGAACTTTTCGTCCCTAGAGCTGACTTCACAGAACCTTATACAGCCTGGTGGGATGCCTCGAGCGGATCGTCTAAGGTTGACTTCCATGGTGGATCCACAAGCACATACAGGACAGTGATGGCCGACAAAAGGCTTCAGCGATTGGAAATGCGAGTGGATCGCTCCGGAGAACGCGCTGTTCGTCGTTGTGGGAAAGCCTCCTCCTACCAAAACCATCCCCTTGACATCACCCATCCCGCGTTACCAGACATCGATTTATTTAACTTCGTTGGTTATGAGCCTGACGGGCGTGTGGAGCGTTGGCAGCACACAGTCGTTGGTCGAGAAGGAGAGCTTGGTGCTATTCAAGGGGAGTCCCTTAGTCTAAAACATGATCTGCTTCTCATACGTGATCCTGAAGACAAAGCAACGCCTATACTTTACACAGTGTCAGTGAACAGTTCTATCCTTGGTCCAAACGCTGACAGCTATGAGCATCGTTATTTGGATGTACGTCGCCACGAACAGAACGCTGATTTCTTCATACCAAAAATAAATGATCTATGTGACACTGTTGAACTGTTGAATGTTTCTCTACCGAATCATTTATCTCGCCTCGAACCTCTAAGGGAATACATACAACCTCATCGCGACCAGAAATACGAGGCGGCGATACGAGATTTCAAGGTGAAATACAACCGTCGTTACATAGACAGCTCCGAAGAAGCTGTGAGAACGACGTTACTTATGCAGCACAAGCGTTTCATTTCATCCGGTAACCGAGAGGGGGCCACTTTCGAGCTAGGGGTGAACTTTCTAGGAGACAGGCTGGACAACGAGCTGGGACAACTGCGCGGCGTGAAGCTAGAGGAGGAAAGAACGCAAGCGGAGCAATTCCCTCATTCAAGAAGTATGCTGAGAAAGGAAAGCGCTAAACTACCCGATAATTTCGACTGGAGGGTTAAAGGCGGAGTCTCACCTGTAGGATTCCAGGGTAAGTGTTCATCGTGTTGGTCGTTCGCGGTATCTGGTGCTGTGGAAGGGGCGCTGTTCGCAAGGACCGGCAAGCTCGTGCCCCTGTCACAACAATGCCTCGTGGATTGTGCACATCCATTTGGAGGCAAAGGTTGCAAAGGCACTTGGCCAAGCCATGCGTATGACTACGTCAAAAACCGGGGTCTGCCTGCCCTCGACGAGTACCCATCATATAAAGCGAAGGTCGAACAATGCGCAGAAAAATCAGTTCGTCCCGTAACACGTATTAGTGGACACGTTAACGTTACCGAAAATAGCCTTTCAGCTCTTAAGGTGGCCATACGAGACCACGCGCCCACCGTGGTCATCGTTGACGCGAAACTCAAGAGCTTTGTGTTTTACAAGCATGGAATAAATAAGATAAATAAATATTGCAGCGGTAAGACGCGTCCGCGTTTGAATCACGCTGTGTTAGCTGTGGGTTGGGGAGACCAGAATGAAGAGCACTTTATCCTGAAGAACTCCTGGTCGGAGTCGTGGGGCGAGCGCGGCTTCATGCGAATACACGCAAGGTCCAACACATGCGGTGTACTCTCCAGACCCAGCTACCCTCGTCTCGAAGACAGTGACGTCCTAAAACTGCCTGGACGTCCCTCAGCCAAGGACGTCCGCCAGTGA

Protein sequence:

>DPOGS201916-PA
MWVRTVIAVACFGLVVTAQSRYSSEERSYRRNRPSRPSQWRDLADEYDRDEDRSPQTPSIKFPLQAIPYELPIGVRGANLQLLVLPIPTSALNPGASQKSECKLTPEPEEPQIDVRWGVPSIASSPPAGILSSPPVGILSRPAAPNTMSNIPLKATNRDEVPIKAPSYPSYINVEKLDQPVSPPEIPPSVSSKEILEQFLAQAPQAPQLPPVLLSEYDKSERIPHPEEEDDEMKYNFIEESATVPTVKTSLPFQQNIYSRGRKVQPQPSRPRFSQSVVIRGELFVPRADFTEPYTAWWDASSGSSKVDFHGGSTSTYRTVMADKRLQRLEMRVDRSGERAVRRCGKASSYQNHPLDITHPALPDIDLFNFVGYEPDGRVERWQHTVVGREGELGAIQGESLSLKHDLLLIRDPEDKATPILYTVSVNSSILGPNADSYEHRYLDVRRHEQNADFFIPKINDLCDTVELLNVSLPNHLSRLEPLREYIQPHRDQKYEAAIRDFKVKYNRRYIDSSEEAVRTTLLMQHKRFISSGNREGATFELGVNFLGDRLDNELGQLRGVKLEEERTQAEQFPHSRSMLRKESAKLPDNFDWRVKGGVSPVGFQGKCSSCWSFAVSGAVEGALFARTGKLVPLSQQCLVDCAHPFGGKGCKGTWPSHAYDYVKNRGLPALDEYPSYKAKVEQCAEKSVRPVTRISGHVNVTENSLSALKVAIRDHAPTVVIVDAKLKSFVFYKHGINKINKYCSGKTRPRLNHAVLAVGWGDQNEEHFILKNSWSESWGERGFMRIHARSNTCGVLSRPSYPRLEDSDVLKLPGRPSAKDVRQ-