Monarch geneset OGS2.0

DPOGS209870
TranscriptDPOGS209870-TA741 bp
ProteinDPOGS209870-PA246 aa
Genomic positionDPSCF300302 - 29519-32231
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0075291e-4446.86% 
BombyxBGIBMGA004425-TA3e-7553.14% 
DrosophilaTry29F-PC4e-1329.59% 
EBI UniRef50UniRef50_P515886e-1429.82%Trypsin n=16 Tax=Schizophora RepID=TRYP_SARBU
NCBI RefSeqXP_002057902.12e-1430.53%GJ18385 [Drosophila virilis]
NCBI nr blastpgi|17177882e-1329.82%trypsin-like enzyme [Neobellieria bullata]
NCBI nr blastxgi|1953985864e-1230.53%GJ18385 [Drosophila virilis]
Group
Gene OntologyGO:00038249.8e-31catalytic activity
GO:00042529.3e-18serine-type endopeptidase activity
GO:00065089.3e-18proteolysis
KEGG pathwaydpo:Dpse_GA218792e-10 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[1-229] IPR0090039.8e-31Peptidase cysteine/serine, trypsin-like
[2-215] IPR0012549.3e-18Peptidase S1/S6, chymotrypsin/Hap
[13-28] IPR0013144.2e-06Peptidase S1A, chymotrypsin-type
Orthology groupMCL25017 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209870-TA
ATGGTGGCGATATTGAAAAAATCAGTTTATATCAGCGCTGGAGCATTAATTGACCAAAGCTGGGTCCTCACGGGTGCTGATTCCTTGTTTATGATTAGAGAAACTACGAGGTTCATACGAGTTCGTCTTGGCAGTGTTAATTACAAGGAGGGGGGTTATCTGACCGGTATTAAGTTCTTCGAAATCCATCCTTACTTCGACGACAGTAAGCCACTTTTTGACGTGGCGCTTATAAAACTACCCGAACCAGTGAGAATGACCCCCAGTTTGAATCCAATAAGACTCCAAAAAAGATATCGTGATGTGATCCCAACTCATTTCACGGTAACCGCGTGGCCGAGACGGGGCAGAAATACATCAGGTCATAAGGAATCCCTGGAGGATATCAAACGACGAAGATTGTTATCCGTAACTCATTTGCATCCGTTGGACAGTGAGCAATGTTCAGACGACCTTGACACTTGGGTGCCTGACTTTAATAATAAGAAATTAATCATGTGTCTTGAGCCACCTATTAATGGTGATCCTTGTGAGAGAGATATAGGCGCTCCGGTTGTTCTTAACGGGATTCTATGGGGCGTGATATCATCTTGGAAGTCTGAAGATTGTGATGTTGATGGTGATTCGATATTTATGTCCTTGGTGTCAGCTGTAGAGATCAGCTCCTGGATTCACTCCACTATTCACGCACATAGATGGACGAAAAAACACACCATAGATTACGACGACAACTTCATTTGA

Protein sequence:

>DPOGS209870-PA
MVAILKKSVYISAGALIDQSWVLTGADSLFMIRETTRFIRVRLGSVNYKEGGYLTGIKFFEIHPYFDDSKPLFDVALIKLPEPVRMTPSLNPIRLQKRYRDVIPTHFTVTAWPRRGRNTSGHKESLEDIKRRRLLSVTHLHPLDSEQCSDDLDTWVPDFNNKKLIMCLEPPINGDPCERDIGAPVVLNGILWGVISSWKSEDCDVDGDSIFMSLVSAVEISSWIHSTIHAHRWTKKHTIDYDDNFI-