Monarch geneset OGS2.0

DPOGS206563
TranscriptDPOGS206563-TA1221 bp
ProteinDPOGS206563-PA406 aa
Genomic positionDPSCF300108 - 636570-639579
RNAseq coverage435x (Rank: top 28%)
Annotation
HeliconiusHMEL0043681e-9346.33% 
BombyxBGIBMGA013746-TA5e-5645.13% 
DrosophilaSp7-PA1e-4631.62% 
EBI UniRef50UniRef50_A0JCK67e-7444.24%PxProphenoloxidase-activating proteinase 3 n=1 Tax=Plutella xylostella RepID=A0JCK6_PLUXY
NCBI RefSeqNP_001036832.19e-6438.52%prophenoloxidase activating enzyme [Bombyx mori]
NCBI nr blastpgi|1179701972e-7344.24%pxProphenoloxidase-activating proteinase 3 [Plutella xylostella]
NCBI nr blastxgi|1179701975e-7644.36%pxProphenoloxidase-activating proteinase 3 [Plutella xylostella]
Group
Gene OntologyGO:00038244.8e-76catalytic activity
GO:00042528.9e-63serine-type endopeptidase activity
GO:00065088.9e-63proteolysis
KEGG pathwaydpo:Dpse_GA159031e-43 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[145-404] IPR0090034.8e-76Peptidase cysteine/serine, trypsin-like
[154-396] IPR0012548.9e-63Peptidase S1/S6, chymotrypsin/Hap
[26-79] IPR0227004.9e-14Proteinase, regulatory CLIP domain
[182-197] IPR0013141.6e-12Peptidase S1A, chymotrypsin-type
[26-80] IPR0066045.5e-10Disulphide knot CLIP
Orthology groupMCL34658 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206563-TA
ATGGATTTCGCTATACTACTATGTGCTGCCTTGTACATCGTTCATTTAACGGAATCTGTTGAAGCTGAAAAAAAATGTAAAGTTCCGAACGGTGACGCCGGTAACTGTGTCAGTATAAATATATGCCCACCACTGAAAAGCCTCTATAAGAAGAAACATAAAACAGTCAATGAAAACAGATTTATAAGACAATCTATATGTGGTTCCCGAGACTCCGATCCAATAAAGATCTGCTGTCCACCACAGTCTTCGTGGGTTGGGTTCGTCCCCACGCCGGTGGTAGTTCCACCATCACCAGTGTATAGACGACCTAAACCAGAACCGACCTTGTTACAACCCAATACACTCATGATACCGTCATTCAACAGCACCGACAGACAAACAGACAAACCGATGAGACCAAAACCTGACCTGCAAGATTACAACAGTGTGTGCGGCATAGACTCATCCAGTGGGAATAGAATTACAAATGGCAACGAAACCGCTGTGGACCAGTATCCTTGGTTGGCTTTATTGGAATATTCAAACGGCTTCCTCGGCTGCGGCGGGAGCTTGATCAGCTCCAGATACGTTCTTACAGCTGCACACTGTCTTAAAAGCTTACAAAACGGAGAGCCATTATATGTCCGTCTGGGGGAGTACAACATCACTTCCTTCCCCACGGACATCGTTGAAATAGACGGCGGTGGTTTTGAAGTTGTCACAGTAACAGTCATAGCCATTAGGGCTATGTATACACATCCGTTGTATTATAGAGACCTGAGACTACACGATATAGGTCTTATTGAAATGGAAGAAGCAGCAAATTTCAGCGATTTCATCAAAGTAATATGTCTTCCCCAAATGGATTACATGCCAATCTTCAACAGCTCAACCATATTCTACGTAGCTGGCTGGGGCAGTGACAATTTTAGTTCTGGCACTGAGGTCAAGATGGAGACCAGCGTCCCATACAAACTCCACAGTCAGTGCCCGTTGGTGATGGAACCGTATCCGATTCACCAGATATGTGCTGGCGGGGAAGGTGGAAGGGACACCTGCAGCGGGGACTCAGGTGGTCCATTGATGTATGAGACTCCGAGCCACAGATACGAAGCTGTAGGGATCGTGAGTTACGGCTCCAGGGATTGTGGTAAAGAAGGGGAACCGGCTGTTTACACTTATGTATACAATTACCTGCCCTGGATAAGGAATATTCTCAGCGGTAATGTGGACCAATGA

Protein sequence:

>DPOGS206563-PA
MDFAILLCAALYIVHLTESVEAEKKCKVPNGDAGNCVSINICPPLKSLYKKKHKTVNENRFIRQSICGSRDSDPIKICCPPQSSWVGFVPTPVVVPPSPVYRRPKPEPTLLQPNTLMIPSFNSTDRQTDKPMRPKPDLQDYNSVCGIDSSSGNRITNGNETAVDQYPWLALLEYSNGFLGCGGSLISSRYVLTAAHCLKSLQNGEPLYVRLGEYNITSFPTDIVEIDGGGFEVVTVTVIAIRAMYTHPLYYRDLRLHDIGLIEMEEAANFSDFIKVICLPQMDYMPIFNSSTIFYVAGWGSDNFSSGTEVKMETSVPYKLHSQCPLVMEPYPIHQICAGGEGGRDTCSGDSGGPLMYETPSHRYEAVGIVSYGSRDCGKEGEPAVYTYVYNYLPWIRNILSGNVDQ-