Monarch geneset OGS2.0

DPOGS201966
TranscriptDPOGS201966-TA2238 bp
ProteinDPOGS201966-PA745 aa
Genomic positionDPSCF300060 - 439278-449615
RNAseq coverage1158x (Rank: top 11%)
Annotation
HeliconiusHMEL0024021e-15971.93% 
BombyxBGIBMGA010546-TA1e-15067.66% 
DrosophilaSp7-PA2e-7641.11% 
EBI UniRef50UniRef50_D5LPT42e-14163.88%Prophenoloxidase activating proteinase 1 n=1 Tax=Biston betularia RepID=D5LPT4_9NEOP
NCBI RefSeqXP_312744.34e-14339.45%CLIP-domain serine protease subfamily B (AGAP003058-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1569684014e-14166.12%prophenoloxidase activating enzyme [Helicoverpa armigera]
NCBI nr blastxgi|2948460611e-14363.88%prophenoloxidase activating proteinase 1 [Biston betularia]
Group
Gene OntologyGO:00038241.8e-84catalytic activity
GO:00042526.7e-76serine-type endopeptidase activity
GO:00065086.7e-76proteolysis
KEGG pathwaydpo:Dpse_GA159033e-73 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[480-741] IPR0090031.8e-84Peptidase cysteine/serine, trypsin-like
[488-736] IPR0012546.7e-76Peptidase S1/S6, chymotrypsin/Hap
[381-434] IPR0066045.5e-17Disulphide knot CLIP
[22-74] IPR0227007.2e-15Proteinase, regulatory CLIP domain
[521-536] IPR0013141.8e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL16713 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201966-TA
ATGAAGCACATTTTGTTATTACTTGGCTTGTCGCTGCTATGGAATAGAATACTGTCGGATAGCTGTAAAACACCTGTTGGACAAACGAGCCACTGCGTATCTTTGTATGAATGTCCTCAACTCGTCACTGCTTTCGAACAGAAGCCATTAAAAAGTGAAGTAGTTACTTTTTTAAAACAATCACAGTGTGGTTTTGAAGACGATGTTCCGATGGTTTGTTGTGGAGGTCTACCAGATGGAATGGTTCAGGCGACTTCAAATACGCCCAAGCCAGTACAACAATCAAAATTAAGGATGAATGCGGATCCCGTATTCAAAGAAGATTCTTATCTAACAGCTCCAGAGCAATGTGCTGTAGACACGAATGGTGATAGAATATATGGAGGGCAGATTGCAGAAATTGACGAGTTTCCTTGCATGGCCTTACTAGGATACAAATCTGCCACTAAATCAAAATTGGTGTACGACTGTGGGGGAGCGCTGATAAATAGAAGATATGTTCTAACAGCAGCTCATTGCGTCGTTGGAAAAATTGAAACTGAGGTTGGGAAGTTAAACACTGTTCGTTTGGGTGAATACGATTTACAAGCGGACATCGACTGCTCTGACGGTGTGTGTGCTGATCCTGTTCAAGATATCTCTGTGCAGTCAGTATATCCACATCCGGGATTCTCTGACCAAAATATAAACAGAAAAGATGATATCGCCGTGATACGACTCGCTCAGAGAGCTACTTATTCACATTATGTCCAGCCAATATGTCTAGCGCAGAACAGTCCTTTGGATACAATCAGTTATTTCTATGTTGTTGGATGGGGAGCTACAGTCGGTGGCAAAAGCAGTCCAGTTAAGCTGAAATTGCCGTTACCGATATTTGATAAAACTCTATGTGTTCAGAAATATAGAGCCCTTAAAGCAGAGTTAACAACCGGGCAAATCTGTGCCGGTGGAAACTTTTCTAAAGATACATGCAATGGAGACTCTGGTGGTCCTCTTGCTAGGAAGACCGAGTCTGGGATTTGGGAAGCAGTTGGTGTTGTTTCTTTTGGATATGGATGTGGTAGAGATGGCTGGCCAGGCGTGTACACCTCTGTGCCCAATTACTTTGATTGGATACAAGACACCATACTTTCAGACAACTGTAAGACACCTCTCGGAAGAACAAGTCAGTGTATATCACTCTACGATTGCCCTCAGCTTGTTAGTGCATTCGAACAAAGACCACTAAGAAACGATGTCGTTTCATTCCTCAGACAATCACAGTGTGGATTTGAAGGTTATGTGCCAAGAGTTTGCTGTGGACCATTGCCTGATCTAGCGCCACAAAGACCGACAAGCACGGTCAGGCCAACACAACGTCCCAGACCGGATACAAATATCAATAGCAATGTGGATCCTGTGTTCCCTGAAGATTCTAATCTAGCTCCCCGCGACCAATGTGGAATAGATACTAATGGTGATAGGATATATGGTGGACAATTTACAGAATTGGATGAGTTCCCATGGATGGCTTTACTGGGATATAAACCTAATACCAGTCCACGATTGACTTACCAATGCGGAGGAGTACTGATCAATAGAAGATATGTTTTAACCGCTGCCCACTGTGTAGTCGGAAGTATTGAAACCGCAGTAGGAAAATTGAGCACTGTTCGTCTGGGTGAATATGATTTGCAAACGGACATCGACTGCTCTGATGGTCTCTGTGCCGATCCTGTTCAAGAGATCTCTGTGCAATCAGCATATCCAAACCCAGGGTTCTCTGACCAAAATATAAACAGAAAAGATGATATCGCCTTAGTGAGACTTTCGAAGAGAGCTACTTATTCATATTATGTCCAGCCAATATGTTTAGCAGATAACAGTCTTCGTTTAGACGTCGGTACTGACGTGTATGTCGCTGGTTGGGGGAATACTTTGGGAGGCAAAAGTAGTCCAGTGAAACTGAAACTGGCCTTACCTCTATTCAGTAAGTCGCGGTGCGTTCAGAAGTATAGAAGTCTGCAGGCGGAATTGACAAGCGGACAGTTATGCGCTGGTGGAGTTTTCGCCGAAGACGCGTGCAGAGGAGACTCTGGTGGTCCTTTAATGAGGAAATCTCCTTCTGGTATCTGGCAGTCAATTGCGATTGTCTCGTTTGGAAATGGATGTGGCAGAGATGGCTGGCCAGGCGTGTACACCTCTGTGCCCAGCTATTTGGATTGGATACAACAAACTATGCGCTCCTCCAACGTTTAA

Protein sequence:

>DPOGS201966-PA
MKHILLLLGLSLLWNRILSDSCKTPVGQTSHCVSLYECPQLVTAFEQKPLKSEVVTFLKQSQCGFEDDVPMVCCGGLPDGMVQATSNTPKPVQQSKLRMNADPVFKEDSYLTAPEQCAVDTNGDRIYGGQIAEIDEFPCMALLGYKSATKSKLVYDCGGALINRRYVLTAAHCVVGKIETEVGKLNTVRLGEYDLQADIDCSDGVCADPVQDISVQSVYPHPGFSDQNINRKDDIAVIRLAQRATYSHYVQPICLAQNSPLDTISYFYVVGWGATVGGKSSPVKLKLPLPIFDKTLCVQKYRALKAELTTGQICAGGNFSKDTCNGDSGGPLARKTESGIWEAVGVVSFGYGCGRDGWPGVYTSVPNYFDWIQDTILSDNCKTPLGRTSQCISLYDCPQLVSAFEQRPLRNDVVSFLRQSQCGFEGYVPRVCCGPLPDLAPQRPTSTVRPTQRPRPDTNINSNVDPVFPEDSNLAPRDQCGIDTNGDRIYGGQFTELDEFPWMALLGYKPNTSPRLTYQCGGVLINRRYVLTAAHCVVGSIETAVGKLSTVRLGEYDLQTDIDCSDGLCADPVQEISVQSAYPNPGFSDQNINRKDDIALVRLSKRATYSYYVQPICLADNSLRLDVGTDVYVAGWGNTLGGKSSPVKLKLALPLFSKSRCVQKYRSLQAELTSGQLCAGGVFAEDACRGDSGGPLMRKSPSGIWQSIAIVSFGNGCGRDGWPGVYTSVPSYLDWIQQTMRSSNV-