Monarch geneset OGS2.0

DPOGS204359
TranscriptDPOGS204359-TA1941 bp
ProteinDPOGS204359-PA646 aa
Genomic positionDPSCF300040 - 438431-444055
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0107798e-15158.92% 
BombyxBGIBMGA013737-TA1e-5733.98% 
DrosophilaSp212-PC1e-2731.14% 
EBI UniRef50UniRef50_Q5MPB93e-11949.53%Hemolymph proteinase 16 n=1 Tax=Manduca sexta RepID=Q5MPB9_MANSE
NCBI RefSeqXP_318412.42e-5640.81%AGAP003960-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|564184111e-11849.53%hemolymph proteinase 16 [Manduca sexta]
NCBI nr blastxgi|564184116e-11649.53%hemolymph proteinase 16 [Manduca sexta]
Group
Gene OntologyGO:00038247.2e-73catalytic activity
GO:00042522.8e-54serine-type endopeptidase activity
GO:00065082.8e-54proteolysis
KEGG pathway 
InterPro domain[371-644] IPR0090037.2e-73Peptidase cysteine/serine, trypsin-like
[381-636] IPR0012542.8e-54Peptidase S1/S6, chymotrypsin/Hap
[412-427] IPR0013141.4e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL19851 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204359-TA
ATGATATTGAAGGAAGTAAAATGGTTGTCTTTCAAAGTTATAGGGAAATGGGCACCTCACCACGCTCCTTACATAATTAGTTTAAAAATAAATGATGTCGAAAACTGCTACGAGCCTCATCGAACGTATTTCAGTTTATTTGAATTCCTTGGAGTGACAAAGTTTTTTGAGGCACCTGATCAATTGTGTGGACGACGAAAAATAGATAGAAAGTACACTGAACTTATAGTGAATGGCTCTCCCACAAAGTATGGTGACTGGCCGTGGCACGCCGCTATATATCGCTATCGCAAGGCTTCTCTTAAATACATATGTGGGGGTACGCTCATCACCAAAGTTCTAGTCTTAACAGCTGCTCGCTGCGCCACAATAAGGGGAGAAACAGTCATTCCGGAAAGCCTTGGCGTTGTACTCGGCAAATATACGCTATACTCCATCGATATAGCCGTACAATCAAAAGAGGTTGTTGGTTGGGGGTTCGATGGTAATGATGAACTAACATCAGAACTCCATGAAAGTTTGATACCCGGCGTGGATGATAGTACATGTATTAAAAAAGAGCCCATCTTTTATTCTATTATGTTAGATTCTACTATATTATCTAGTATGTTAGATGGCAAAAAATTCTGTGCAGGTTACAGTAATGTTCTAAAATTTACCGTGTGTGCGAAAGCTGTGTGGACGAATGTCGGATCGGCGTATTTCTCAGTAAGACTGTGTAAGGATTCAAACGAAATCTCAGCATCATATGAGTCCGACCAGCCTCCCGAACTTGAAAACGCCTATGACGTTCATGTTTCTATATTATTTCCAAAATATACAACAGTGTACATAAAGTTAGACTCCGAAGGCAGTATTAAGTTGGCGGAAAAAACATATGCAAGAATATATCCCTATGACAATAATGAATTTTCAATTCGTTTTTTTGCTGAACATGATGGTTTGGGTTTCAAAGTCCATGGAAAAAAAATAGGAGTTGTTCCATATATTACTAGTTTGACAATTAATTCTCAAGAATATTGTTCAAAACCAGCCCTGGGATTTCTTGATGGATATGTGTCGGGATATAAAGACTATGCAGAAAGTGAAAATCGTAAGTTAGAAGGGAACTGTGGTAGACGTAAAGTATCGCACACAGAATTAATAGTCAATGGATCTCCCACTAAACCCGGTGACTGGCCCTGGCACACCGCAATATATAGACTGGATAGATCACAAATCAAATATATTTGCGGAGGAACACTTATTTCTAAATATTTTGTATTAACAGCTGCACACTGCACATCAATAAGAGGAGTTGCGCTTCTACCCGAAGTGTTAAGTGTTGTTCTTGGTAAATACAACTTAATTGGTGGGGATTTAGCTTCGGAAGAAAGAGAAATTCATCAAATAATTGTTCATGAGCAATACGACAAAAGATCGTTGGACAACGACATTGCATTACTGAAATTAAAAACCGAAGCAGTATTTACTGACTATATTCAACCAGCTTGCTTGTGGTACAGCAAAGCTTCGGAAAAATTCTCCGGTCGTGAGATAATTGGCAAAGTCGTTGGATGGGGTTTTGATAATACCGATAACTTAGCGCTCAAGCTCCGACAAGCTAGTGTCCCGTTAGTTTCAGATGTTGTTTGTATCAAAAGTAATGCTGTTTTCTATTCAAGGGTGCTCAATGGCAATAAATTTTGTGGTGGAAATCACAATGGTACTTCGGCATGTAATGGTGATAGCGGTGGAGCTTTCCAAGTGTTTATTCCTGATGATGCACAGGATCAAAGTGTGAACGCGTCAGGAGCTTGGCACGTCCGAGGAATCGTGTCGCAGACGATATCAAGATTTGACGTGCCGATATGTGATCCCCACCAATACGTTGTGTTCACGGATGTAGAAAAGTATAGATCTTGGATCGATAAGCATTTAGAGATAAATAATGAAATGTAA

Protein sequence:

>DPOGS204359-PA
MILKEVKWLSFKVIGKWAPHHAPYIISLKINDVENCYEPHRTYFSLFEFLGVTKFFEAPDQLCGRRKIDRKYTELIVNGSPTKYGDWPWHAAIYRYRKASLKYICGGTLITKVLVLTAARCATIRGETVIPESLGVVLGKYTLYSIDIAVQSKEVVGWGFDGNDELTSELHESLIPGVDDSTCIKKEPIFYSIMLDSTILSSMLDGKKFCAGYSNVLKFTVCAKAVWTNVGSAYFSVRLCKDSNEISASYESDQPPELENAYDVHVSILFPKYTTVYIKLDSEGSIKLAEKTYARIYPYDNNEFSIRFFAEHDGLGFKVHGKKIGVVPYITSLTINSQEYCSKPALGFLDGYVSGYKDYAESENRKLEGNCGRRKVSHTELIVNGSPTKPGDWPWHTAIYRLDRSQIKYICGGTLISKYFVLTAAHCTSIRGVALLPEVLSVVLGKYNLIGGDLASEEREIHQIIVHEQYDKRSLDNDIALLKLKTEAVFTDYIQPACLWYSKASEKFSGREIIGKVVGWGFDNTDNLALKLRQASVPLVSDVVCIKSNAVFYSRVLNGNKFCGGNHNGTSACNGDSGGAFQVFIPDDAQDQSVNASGAWHVRGIVSQTISRFDVPICDPHQYVVFTDVEKYRSWIDKHLEINNEM-