Monarch geneset OGS2.0

DPOGS210568
TranscriptDPOGS210568-TA1239 bp
ProteinDPOGS210568-PA412 aa
Genomic positionDPSCF300408 - 85880-94163
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0085050.080.63% 
BombyxBGIBMGA009678-TA2e-16477.78% 
DrosophilaCG8172-PF1e-12681.42% 
EBI UniRef50UniRef50_Q17PV41e-14756.43%Serine protease n=4 Tax=Culicidae RepID=Q17PV4_AEDAE
NCBI RefSeqXP_001660119.12e-14856.43%serine protease [Aedes aegypti]
NCBI nr blastpgi|3504267073e-15261.74%PREDICTED: hypothetical protein LOC100740075 [Bombus impatiens]
NCBI nr blastxgi|3072114694e-15159.87%Serine proteinase stubble [Harpegnathos saltator]
Group
Gene OntologyGO:00038241.2e-92catalytic activity
GO:00042521.4e-90serine-type endopeptidase activity
GO:00065081.4e-90proteolysis
KEGG pathwayecb:1000513062e-43 
 K01344 (PROC)maps-> Complement and coagulation cascades
InterPro domain[160-411] IPR0090031.2e-92Peptidase cysteine/serine, trypsin-like
[169-406] IPR0012541.4e-90Peptidase S1/S6, chymotrypsin/Hap
[200-215] IPR0013149.9e-16Peptidase S1A, chymotrypsin-type
Orthology groupMCL15924 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210568-TA
ATGGATTTATTAGGAAAAATGATACCACAAACCTGTCGTTATAGAGGCGCAAGGTACGCCTGCGCCCTCAGCATCTCGTGCGTGCTGGGTGGTGGGAAACCGTTGGATCTGTGCTCCGGGGGCATGATCTGGTCCTGCTGTGTGGACAAGGAGACGACAGAACGACCAACGGAAGTGGCGCCACCTGTACACAACGCCAGTGACCTTATAGAATTGATCGGGACATTCCCACACGAGACCTTCGACTACCAAACAAACACACAATTCGAAGATCCCATCATCATTTACACGGAAACAAACCGCCCAAAACCGTACAAACCCACAGACCGGCCGCAACCGGCGCAAAACGGTTTGAACCGGCCGAAACCGGTTTGGGGCACAGACTACAATCACTACTACCACGTAAAACCCTCGTACCACGAGAGCACAACACACACAGACGACTTTTACACGGAGAGTGTAGGTGATAGACCTGGCTGCGGTGAACACTACACCCGGTCAAACCGGATAGTCGGTGGGCACTCGACCGGTTTCGGTTCTCATCCTTGGCAAGCCGCCCTCATCAAGTCCGGTTTCCTCAGTAAGAAGCTGGCCTGTGGAGGAGCCCTCATATCTGATCGATGGGTGATCACTGCCGCTCATTGTGTAGCCACTACTCCTAATTCACAACTGCGGGTCCGTCTCGGGGAGTGGGACGTTCGTGATGCTGGGGAGAGATACTCGCACGAGGAGTTTGCTGTCCAGCGCAAGGAAGTCCATCCGTCATACGAACCCTCGGACTTCAGGAATGACGTGGCCTTAGTGCAGTTGGAGAGGGGTGTGGTGTTCAAGCAACATATACTGCCGGTATGTCTACCACAAAAGCAAATGAAGTTAGCCGGCAAAATGGCGACCGTAGCTGGCTGGGGGCGGACAAGACACGGACAGAGCACAGTCCCATCAGTGCTACAGGAGGTGGATGTAGAGGTTATCCCCAACGAGCGCTGTCAGCGTTGGTTCCGAGCCGCCGGGAGACGAGAAACGATTCACGACGTGTTCCTCTGCGCCGGCTACAAAGAGGGTGGTAGAGATTCCTGTCAGGGCGACAGCGGAGGCCCGCTGACCTTGAAATACGAGGGGCGAAGCACCCTCATAGGTCTCGTGTCGTGGGGCATCGGCTGCGGCAGGGAACACCTGCCGGGTGTCTACACTAACATACAGAAATTCGTCCCTTGGATCGACAAGCTAATCAACTCTTAG

Protein sequence:

>DPOGS210568-PA
MDLLGKMIPQTCRYRGARYACALSISCVLGGGKPLDLCSGGMIWSCCVDKETTERPTEVAPPVHNASDLIELIGTFPHETFDYQTNTQFEDPIIIYTETNRPKPYKPTDRPQPAQNGLNRPKPVWGTDYNHYYHVKPSYHESTTHTDDFYTESVGDRPGCGEHYTRSNRIVGGHSTGFGSHPWQAALIKSGFLSKKLACGGALISDRWVITAAHCVATTPNSQLRVRLGEWDVRDAGERYSHEEFAVQRKEVHPSYEPSDFRNDVALVQLERGVVFKQHILPVCLPQKQMKLAGKMATVAGWGRTRHGQSTVPSVLQEVDVEVIPNERCQRWFRAAGRRETIHDVFLCAGYKEGGRDSCQGDSGGPLTLKYEGRSTLIGLVSWGIGCGREHLPGVYTNIQKFVPWIDKLINS-