Monarch geneset OGS2.0

DPOGS200027
TranscriptDPOGS200027-TA2415 bp
ProteinDPOGS200027-PA804 aa
Genomic positionDPSCF300337 - 60287-71516
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0130645e-10545.21% 
BombyxBGIBMGA012427-TA7e-10357.72% 
Drosophilasnk-PB1e-4334.19% 
EBI UniRef50UniRef50_Q1HPQ61e-10057.72%Serine protease 7 n=4 Tax=Obtectomera RepID=Q1HPQ6_BOMMO
NCBI RefSeqNP_001040537.13e-10157.72%serine protease 7 [Bombyx mori]
NCBI nr blastpgi|564183999e-10558.46%hemolymph proteinase 9 [Manduca sexta]
NCBI nr blastxgi|564183995e-10657.66%hemolymph proteinase 9 [Manduca sexta]
Group
Gene OntologyGO:00038242.8e-72catalytic activity
GO:00042526.2e-70serine-type endopeptidase activity
GO:00065086.2e-70proteolysis
KEGG pathwaydpo:Dpse_GA195436e-40 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[526-799] IPR0090032.8e-72Peptidase cysteine/serine, trypsin-like
[540-794] IPR0012546.2e-70Peptidase S1/S6, chymotrypsin/Hap
[572-587] IPR0013144.4e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL21003 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200027-TA
ATGTCTGCTCTTATAATGTTTTTCATTTTATCAAACATATATTTTGAAGATCCCTGTGACTTCGTCGATCCTGTTCTGCCAGACTTCAGGTATCCTGGGAAAAGAATTTGCGAAGTTAAATGTGATGAATACGTTTGGCAGAGAATAAATCGGGACAAAGCTGCAAAAAGGTTTCAGGAATGTTTGGAATATAATAGAAATCAAGGTGTCCCCGGACCTTATGGTGCTATTGGTGGAAGAGATTCATTGTCAGGAGAATTTCCACATATGGGAGCTATAGGGTGGAGAGCTGTTCAGGGTACTTGGGTATTTAAATGCGGCAGCACACTCATCAGCCCGAAGTTCACTCTTACAGCAGCCCACTGTACCAGAGCGCCGCCAGACCCAAGAACTGTTTCTGAAGTACCACAAATTATAAGGTTCGGAGAAAAAAACATTATTGACGTGTTAGATCCAGTCGATGCGAACATAGTAAGATTTTTTGTCCACCCACAATATAAATCTCCTTCAAAATACAACGATATCGCATTGATTGAAATTGATACGGAGCTAAAGTTCTCTAAAAACATTCAGCCAGCTTGTCTATGGAGCTACATTGACACTAGTGTGCTCGGTTCAAGCGCTACTTTAACAGGGTGGGGAGTTATTGACACAGCTACAGGAAAGACGTCTCCAATTCTGCAAGCAGCAGGCGTAAATGTAATCGACGACGAGCTCTGTAACAGATTGTTGAAACGATCGTGCAGTAGACGATGGTGCGGAGTCAAAGATCAGATCTGTGCAGGAAAACTAGAAGGAGGAGTAGATGCTTGTCAAGGTGATTCCGGTGGACCCTTACAAATAAAAATACCTCTACCGCCATCTGATGAGGGTTCAATGCACTATGTAATAGGTGTGACGTCATTTGGGATAGGTTGCGCTAGACCAAATCTGCCTGGTTTGCTCAGTTCAACTGAGGCTATATATCTTAAAAGAAAATATGTTAAAAGGAATACAGATAAAATAGAGTCGGATCCTTGCGTGCCATACAATGCCACTCTGCCGAACTTCAAGAAATATGGACGAAGAATTAGTGAAGTCAAATGCGATGAATACGTTTGGCAGAGGATGAATAGACAGGAAAAACTCAACAGATGGTTCCGATGTCTCGCGAAACGGAGAGAAAAGGAAGGACCTGACGGGCTTTTCGTTTCGACTGAAGCAATTGGAGGCCGAGATGCGCTGCCAGGGGAATTCCCACACATGGGGGCATTAGGTTGGAAAGCTGTAGAGGGTACTTGGATATTCAAATGCGGTAGTACTCTTATCAGTCCAAAGTTCACTCTTACGGCAGCTCACTGCTCTAAGACACCTCCAGACCCCAAAACGAGTTCCCAAATTCCTCAAATTGTGAGATTTGGAGATAAAAACATAATAGATGTGAATACAGATGAAATAGAGTCGGATCCTTGCGTGCCATACAATGCCACTCTGCCGAACTTCAAGAAATATGGACGAAGAATTAGTGAAGTCAAATGCGATGAATACGTTTGGCAGAGGATAAATCGAGAGGAAAAAGCCAACAGAGAGTTCCGATGTCTCGCGCAACGGAGAGAAGAGGAAGGATCCGAGGAGACTGAAGCGATTGGAGGCCGAGATGCGCTACCAGGGGAATTCCCACATATGGGTGCATTAGGTTGGAAAGCTGTAGAGGGTACTTGGATATTCAAATGCGGTAGTACTCTCATCAGTCCAAAGTTCACTCTTACGGCCGCTCACTGCTCTAAGACACCTCCAGACCCCAAAACGAGTTCCCGAATTCCTCAAATTGTAAGATTTGGAGATAAAAACATAATAGATGTGTTTGCCAATGGATTACCTCCGATAGATGCTAACATAGTCACTATAACTGTTCACCCGCAATATAAGTCACCTTCGATGTATAATGATATAGCACTAGTTAAATTATACAAGGACATTACATTTATGAGTAACGTACAACCAGCTTGCCTTTGGAGCCATTCTGATACTAGTATACTAAGTTCGACAGCAACTTTAACTGGCTGGGGAGTTATCGACACAGCTACAAGAAAGACGTCTCCAATTCTTCAAGCAGCAGTCGTAGATGTAATCGACGATGAGCTCTGTAACAAATTGTTACAACGATCCTGTAGTAGACGATGGTGCGGAGTCAGAGATCAGATCTGTGCAGAAAAACTAGAAGGAGGAGTAGATGCTTGTCAAGGTGATTCCGGTGGACCTTTACAAGTAAAAATACCTTTACCGCCATCTGGTGAGGGTTCAATGCATTACGTTATAGGAGTAACATCGTTTGGAATCGGATGCGCTAGACCAAATCTTCCCGGCGTCTACACGAAAGTTTCCAGCTTCGTGGATTGGATTGAAAGTATCGTGTGGCCTGAAGAAATAATGTAA

Protein sequence:

>DPOGS200027-PA
MSALIMFFILSNIYFEDPCDFVDPVLPDFRYPGKRICEVKCDEYVWQRINRDKAAKRFQECLEYNRNQGVPGPYGAIGGRDSLSGEFPHMGAIGWRAVQGTWVFKCGSTLISPKFTLTAAHCTRAPPDPRTVSEVPQIIRFGEKNIIDVLDPVDANIVRFFVHPQYKSPSKYNDIALIEIDTELKFSKNIQPACLWSYIDTSVLGSSATLTGWGVIDTATGKTSPILQAAGVNVIDDELCNRLLKRSCSRRWCGVKDQICAGKLEGGVDACQGDSGGPLQIKIPLPPSDEGSMHYVIGVTSFGIGCARPNLPGLLSSTEAIYLKRKYVKRNTDKIESDPCVPYNATLPNFKKYGRRISEVKCDEYVWQRMNRQEKLNRWFRCLAKRREKEGPDGLFVSTEAIGGRDALPGEFPHMGALGWKAVEGTWIFKCGSTLISPKFTLTAAHCSKTPPDPKTSSQIPQIVRFGDKNIIDVNTDEIESDPCVPYNATLPNFKKYGRRISEVKCDEYVWQRINREEKANREFRCLAQRREEEGSEETEAIGGRDALPGEFPHMGALGWKAVEGTWIFKCGSTLISPKFTLTAAHCSKTPPDPKTSSRIPQIVRFGDKNIIDVFANGLPPIDANIVTITVHPQYKSPSMYNDIALVKLYKDITFMSNVQPACLWSHSDTSILSSTATLTGWGVIDTATRKTSPILQAAVVDVIDDELCNKLLQRSCSRRWCGVRDQICAEKLEGGVDACQGDSGGPLQVKIPLPPSGEGSMHYVIGVTSFGIGCARPNLPGVYTKVSSFVDWIESIVWPEEIM-