Monarch geneset OGS2.0

DPOGS204144
TranscriptDPOGS204144-TA684 bp
ProteinDPOGS204144-PA227 aa
Genomic positionDPSCF300643 + 154-3558
RNAseq coverage1703x (Rank: top 7%)
Annotation
HeliconiusHMEL0225973e-8558.30% 
BombyxBGIBMGA001745-TA4e-7853.87% 
Drosophilasnk-PB1e-3544.51% 
EBI UniRef50UniRef50_G6D6G61e-7655.06%Serine protease 7 n=2 Tax=Obtectomera RepID=G6D6G6_DANPL
NCBI RefSeqXP_969745.26e-4839.10%PREDICTED: similar to trypsin-like serine protease [Tribolium castaneum]
NCBI nr blastpgi|3640236013e-7756.83%seminal fluid protein CSSFP025 [Chilo suppressalis]
NCBI nr blastxgi|3640236016e-7756.83%seminal fluid protein CSSFP025 [Chilo suppressalis]
Group
Gene OntologyGO:00038245.4e-61catalytic activity
GO:00042523.6e-48serine-type endopeptidase activity
GO:00065083.6e-48proteolysis
KEGG pathwaydpo:Dpse_GA195432e-27 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[8-223] IPR0090035.4e-61Peptidase cysteine/serine, trypsin-like
[15-218] IPR0012543.6e-48Peptidase S1/S6, chymotrypsin/Hap
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204144-TA
ATGTCTATATCCGCCCCAAAATGTGACTACAACGGTGTGGAGCTGATCGTGGGCGGAGAGGACGCTGATACAGGAGAGTTCCCACACATGCGTCCCGGTCGCTATACTAGTAAAGCTACCGCGTCTTCACGCGAATGTTTTTATATTCAGGTGCCAATAAAGAATATAATAAAGCATCCGCTATACAAGTCTCCGGGCAAGTATCATGATATAGCGTTACTGGAGCTGGCGTCCGACGTGGACTTCGACTCTTCCATCAGACCAGCCTGCCTCTGGTACAGACCCGACTTCCCTGGACACACTAAAGCAGTGGCCACCGGCTGGGGGGTCGTTGACCCGCGTACACAGCGCGCTTCAAACGAATTACAGAAAGTATCGCTGACGCTACTAGAGAACGATTTTTGTAATGTTTTATTAAAGACGAAGAGGAACAGGCTGTGGATGGACGGGTTCACTGCGGACCAGCTGTGTGCTGGAGAACTGAGGGGCGGCAAGGACACCTGTCAGGGCGACTCTGGGTCCCCCCTCCAGGTGGTGTCCCGGGAGAACAAGTGCGTGTTCCATATAGTCGGCATCACGTCCTTCGGTCACAGATGCGCCCAGTCTGGGAGTCCAGCTGTCTACACCAGGGTCTCTTCATACTTGGACTGGATAGAGTCTGTGGTATGGCCGGGGGAGGGATGA

Protein sequence:

>DPOGS204144-PA
MSISAPKCDYNGVELIVGGEDADTGEFPHMRPGRYTSKATASSRECFYIQVPIKNIIKHPLYKSPGKYHDIALLELASDVDFDSSIRPACLWYRPDFPGHTKAVATGWGVVDPRTQRASNELQKVSLTLLENDFCNVLLKTKRNRLWMDGFTADQLCAGELRGGKDTCQGDSGSPLQVVSRENKCVFHIVGITSFGHRCAQSGSPAVYTRVSSYLDWIESVVWPGEG-