Monarch geneset OGS2.0

DPOGS205777
TranscriptDPOGS205777-TA1074 bp
ProteinDPOGS205777-PA357 aa
Genomic positionDPSCF300144 - 420013-431348
RNAseq coverage2299x (Rank: top 5%)
Annotation
HeliconiusHMEL0051201e-7655.24% 
BombyxBGIBMGA010590-TA8e-7752.63% 
DrosophilaCG31954-PA1e-3937.07% 
EBI UniRef50UniRef50_Q9NB921e-6853.60%Trypsin AiT6 n=14 Tax=Obtectomera RepID=Q9NB92_AGRIP
NCBI RefSeqXP_001356703.18e-4138.53%GA16585 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|83476381e-7456.00%trypsin precursor AiT9 [Agrotis ipsilon]
NCBI nr blastxgi|1569682953e-7654.40%protease [Helicoverpa armigera]
Group
Gene OntologyGO:00042524.7e-76serine-type endopeptidase activity
GO:00065084.7e-76proteolysis
GO:00038243.7e-70catalytic activity
KEGG pathwayani:AN2366.21e-36 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[22-351] IPR0012544.7e-76Peptidase S1/S6, chymotrypsin/Hap
[14-247] IPR0090033.7e-70Peptidase cysteine/serine, trypsin-like
[53-68] IPR0013144.7e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL18546 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205777-TA
ATGCGTGTGTTAATACTGTTAGCTGTTCTTGGAGCCGCTTGTGCGGCCCCAAGAAAATCAAACCGTATCGTGGGTGGCCAAAACACCAATATTGAAAGATATCCTTTCATGTCTGGTATGTTAGAAAATTCCTTCTGGGGAATAAGGCAGATGTGTGGTGGCACACTGATAACCAACAGAGCTGTAGTCTCCGCTGCTCATTGTTATTCAGGTCTTTCGCCATCCGCGTTGAGAGTGCGTTTGGGATCAACTTATGCTTCGTCTGGCGGACAAGTCCAAGTGGTTTCTAGAATCATCATGCATCCACAGTATAACTCACGCTTAATTATAAACGACGTCGCAGTAATCAGACTTCAAAACTCTGTTTCAATGTCTAATCAAATTCAAGTTGCACGAATTGCTGGACCACAGTACAACTTACCAGACAATACACGTCTCGATGTTATTGGTTGGGGAGTGACAAGGTACCAAGGAAGACCTTCTGAAGTACTCCAGCACGTTTCCGTTAACGTCATCAACCAAAGAATTTGTGTGGAACGCTACGCACAGCTCCAATCTTTACCCGGTATGGGATCCTGGCCAAGGGTAACTCCTGAGATGATGTGCGCTGGTATTCTGGATGTCGGTGGCAAGGACGCCTGCCAAGGAGACTCCGGCGGACCTGTTGTTCACAGCGGAAACGTACTTGTAGGAATTACTTCCTGGGGATACGAATGCGCTCACCCAACTTATCCGGGCGTTAACTACCAAGGAAGACCTTCTGAAGTACTCCAGCACGTTTCCGTTAACGTCATCAACCAAAGAATTTGTGTCGAACGCTACGCACAGCTCCAATCTTTACCCGGTATGGGATCCTGGCCAAGGGTAACTCCTGAGATGATGTGCGCTGGTATTCTGGATGTCGGTGGCAAGGACGCCTGCCAAGGAGACTCCGGCGGACCTGTTGTTCACAGCGGAAACGTACTTGTAGGAATTACTTCCTGGGGATACGAATGCGCTCACCCAACTTATCCGGGCGTTAACGTACGCGTTTCATCTTACGCCAACTGGATCTCGGCCAACGCAGTTAACTAA

Protein sequence:

>DPOGS205777-PA
MRVLILLAVLGAACAAPRKSNRIVGGQNTNIERYPFMSGMLENSFWGIRQMCGGTLITNRAVVSAAHCYSGLSPSALRVRLGSTYASSGGQVQVVSRIIMHPQYNSRLIINDVAVIRLQNSVSMSNQIQVARIAGPQYNLPDNTRLDVIGWGVTRYQGRPSEVLQHVSVNVINQRICVERYAQLQSLPGMGSWPRVTPEMMCAGILDVGGKDACQGDSGGPVVHSGNVLVGITSWGYECAHPTYPGVNYQGRPSEVLQHVSVNVINQRICVERYAQLQSLPGMGSWPRVTPEMMCAGILDVGGKDACQGDSGGPVVHSGNVLVGITSWGYECAHPTYPGVNVRVSSYANWISANAVN-