Monarch geneset OGS2.0

DPOGS207085
TranscriptDPOGS207085-TA1686 bp
ProteinDPOGS207085-PA561 aa
Genomic positionDPSCF300001 + 2773222-2785946
RNAseq coverage2478x (Rank: top 5%)
Annotation
HeliconiusHMEL0041430.068.05% 
BombyxBGIBMGA013049-TA3e-11965.08% 
DrosophilaCG31326-PA2e-4127.94% 
EBI UniRef50UniRef50_Q5MPB50.064.29%Hemolymph proteinase 19 n=1 Tax=Manduca sexta RepID=Q5MPB5_MANSE
NCBI RefSeqXP_970870.25e-10639.24%PREDICTED: similar to hemolymph proteinase 19 [Tribolium castaneum]
NCBI nr blastpgi|564184190.064.29%hemolymph proteinase 19 [Manduca sexta]
NCBI nr blastxgi|564184190.064.44%hemolymph proteinase 19 [Manduca sexta]
Group
Gene OntologyGO:00038241.2e-75catalytic activity
GO:00042528e-55serine-type endopeptidase activity
GO:00065088e-55proteolysis
KEGG pathway 
InterPro domain[297-559] IPR0090031.2e-75Peptidase cysteine/serine, trypsin-like
[308-554] IPR0012548e-55Peptidase S1/S6, chymotrypsin/Hap
[339-354] IPR0013142.8e-10Peptidase S1A, chymotrypsin-type
Orthology groupMCL18391 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207085-TA
ATGATTGGTGCGATCAGTATTCTAGTCATAGGTCTTTTTGTGCCAACACACGAACAAAGTACAACTATATCGCCATGCCCCAACGTGTTTATGTACGAACCCTCCGGCACAGAACCGGGAAGATGGTACGGGGTTGTCAATCTGTCAACAGATAGCACCTTACACTCTCTATGGTTGAATATCGTGCTTGATAGCAAGGCTGATATTTTAGGGAATTGGGTAGGAGATGTAACGACCACAGACAATATAGATTTCAAAATTGAAAACACAAGAATGAAAATACATCCTGGCCCGGCTGTGGCAGTTCGTTTCTTCGTACAATACAGCCCTCTCAATAAAGCACCACGCTTGAGTGCTATCAGACTAAATGGTAGAGAAATCTGCAATGCCAAAACACCACAACCAGCGATTGAAGTGGTTGAAACCGCGAGGCCTGATCCAACATCAATCAGACCAAGGCCTGAAACCTCAAGGCCTGTAGACCGGCCGGTAGATAGGCCAATAGATCGCCCTGTTGAAAGACCGATCGATCGTCCGGTAGACAGGCCCAGCGTCCAAACAAGACCTGTTCTGAGACCTGAAGATAAAACAAAACCTGTGTATGGAAGACCTTCTGGGGATGGACCTGTGTATGTACCAACATCTAATCCACCAATCGAGCAGAGCAATGTTAACCCATATAGCATCGGTGGTGGCCAGGTACCAGCTCAGAGTGTAACCCATTCAAGGCCAACATACCAGTTGACCACGACTTCTTATACATCCACCACGCCCGAAGACGAAGATAGTGACAGCGATGCTGACCCTTCAGAATACTTCAACGGCGGTCAACTACTGGTCACACCGGTACCCAGCGGCCAAGGATATGTACAGCCTAAAAATGAACAATGCGGTAAAGTGCTCCGAAACAATCCGAATCCTCTGGTGGTGAACGGCAAGCCGACGCTCGAAGGACAATGGCCCTGGCAGATAGCCCTTTATCAAACACAGACGGTGGATAGCAAGTACATTTGCGGCGGTACTCTCGTCTCCCACAAGCACGTGGTGACGGCAGCGCACTGCGTCACCCGCAAAGGTTCCAGTCGTACTGTGAACAAGAACACCCTCACCGTGTACTTGGGAAAACACAACCTCCGGACCTCTGTAGAGGGAGTTGAAATCAGACTTGTGGGTGAGATAACTGTCCACCCTCAGTACAACGCGTCCTCGTTCAGTCGTGATCTCAGCATCCTCAAGCTCCGCAAAGCCGTCGAGTACACAGAATTCATACGTGCCGCCTGCCTCTGGCCGGAGAACCAGATCGATTTGACGAACGTCATCGGCAAAAAGGGCTCCGTGGTAGGGTGGGGTTTCGACGAGACGGGAGTCGCAACTGAAGAACTGACACTAGTGGAGATGCCGGTGGTGGATCAAGAAACTTGCATCCGCTCTTACAGCGAGTTCTTCGCCAGATTCACTTCTGAGTACACATACTGCGCTGGATATAGAGATGGCACGTCAGTGTGTAATGGTGACAGCGGTGGGGGTATGGTGTTCGAGATGCAAGGATCGTGGTATCTGAGAGGCCTGGTATCCCTCTCAGTGGCGAGACAAAACGAATACAGATGTGACCCAACACACTACGTAGTATTTACAGACTTAGCCAAATTTTTATCTTGGATAAAGCAGCATGTAACTAGCGTCTAA

Protein sequence:

>DPOGS207085-PA
MIGAISILVIGLFVPTHEQSTTISPCPNVFMYEPSGTEPGRWYGVVNLSTDSTLHSLWLNIVLDSKADILGNWVGDVTTTDNIDFKIENTRMKIHPGPAVAVRFFVQYSPLNKAPRLSAIRLNGREICNAKTPQPAIEVVETARPDPTSIRPRPETSRPVDRPVDRPIDRPVERPIDRPVDRPSVQTRPVLRPEDKTKPVYGRPSGDGPVYVPTSNPPIEQSNVNPYSIGGGQVPAQSVTHSRPTYQLTTTSYTSTTPEDEDSDSDADPSEYFNGGQLLVTPVPSGQGYVQPKNEQCGKVLRNNPNPLVVNGKPTLEGQWPWQIALYQTQTVDSKYICGGTLVSHKHVVTAAHCVTRKGSSRTVNKNTLTVYLGKHNLRTSVEGVEIRLVGEITVHPQYNASSFSRDLSILKLRKAVEYTEFIRAACLWPENQIDLTNVIGKKGSVVGWGFDETGVATEELTLVEMPVVDQETCIRSYSEFFARFTSEYTYCAGYRDGTSVCNGDSGGGMVFEMQGSWYLRGLVSLSVARQNEYRCDPTHYVVFTDLAKFLSWIKQHVTSV-