Monarch geneset OGS2.0

DPOGS208137
TranscriptDPOGS208137-TA918 bp
ProteinDPOGS208137-PA305 aa
Genomic positionDPSCF300545 + 19268-25648
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0148421e-3767.57% 
BombyxBGIBMGA011211-TA7e-10875.50% 
DrosophilaCG8299-PA1e-4137.04% 
EBI UniRef50UniRef50_D6WWR44e-6449.06%Serine protease P74 n=1 Tax=Tribolium castaneum RepID=D6WWR4_TRICA
NCBI RefSeqXP_975289.17e-6549.06%PREDICTED: similar to adhesive serine protease [Tribolium castaneum]
NCBI nr blastpgi|910887391e-6349.06%PREDICTED: similar to adhesive serine protease [Tribolium castaneum]
NCBI nr blastxgi|910887398e-6549.24%PREDICTED: similar to adhesive serine protease [Tribolium castaneum]
Group
Gene OntologyGO:00042522.4e-86serine-type endopeptidase activity
GO:00065082.4e-86proteolysis
GO:00038245.9e-86catalytic activity
KEGG pathwayecb:1000566506e-42 
 K01324 (KLKB1)maps-> Complement and coagulation cascades
InterPro domain[54-293] IPR0012542.4e-86Peptidase S1/S6, chymotrypsin/Hap
[41-298] IPR0090035.9e-86Peptidase cysteine/serine, trypsin-like
[87-102] IPR0013142.2e-10Peptidase S1A, chymotrypsin-type
Orthology groupMCL20534 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208137-TA
ATGTTCAGCCAAATAAATCATATGGAATTTTATAAGAACATAACCCGCCCTTCGAAATCGTTCAGCAGTTTCTCTAACAACTACTTCAATCAGTTACCCAAGGGTAAGGAGAAAGTGCAGTGCGGGACGCGGACGGTAGACTATACCCTGCGGAGGGGTAAAATTGTCGGAGGCTCTGAGGCGCCCTATGGAGCATTCCCCTGGCAAGTTGAGATACAGATGTTGGATACAGATAAATTGACGTTCGAACATCACTGTGGTGGCGCTGTGATCGCTGAGCGACTAGTCGTTTCCGCCGCTCATTGTTTTGATAAGCAACCCTTGCACCTCGACCACATTAGGATTCTGGTCGGAGAGCACAGACTGAAGATGCAAGACAAGCATGAGAACAGATTCCTGGCGGAGAAGGTGGTTCCTCACCCGGAGTTTAGAAAGAACGGGCCTCATAGCAACGACATAGCCGTTGTGCTGGTGTCTAAATCTGGTGTTCAGTTCAATTCCCATGTTAGACCGATCTGTCTCCCCGATAAGGTCGAAGCGAACGGCAGGTGGTGTGCTGTCAGTGGTTGGGGCTACCAGGATGAGAGTACGGAGAATTTCTCTCCAGTGCTGAGGGCTGCAGCGGTTCCAGTTTTAGATTTGGCGACCTGCCGCAGAGACAAAGTTCTGGGCAGCTGCAAGCAGACGATATTGGACTCTATGATATGCGCTGGTGCACTGTCGGGGGGTGTGGACGCTTGTAAAGGGGATTCTGGAGGACCGCTGGCATGTCTCGGCCATCGCTGGCAGTTAACTGGCGTGGTCTCTTGGGGCGCGGGCTGCGGTCGACGAGCACGGCCGGGTGTCTACACGAGAGTAGCTTCCTATGTTGACTGGTTAAGACACACGGCCGCCCAGATGGGGCAGAGAATAGCCTAG

Protein sequence:

>DPOGS208137-PA
MFSQINHMEFYKNITRPSKSFSSFSNNYFNQLPKGKEKVQCGTRTVDYTLRRGKIVGGSEAPYGAFPWQVEIQMLDTDKLTFEHHCGGAVIAERLVVSAAHCFDKQPLHLDHIRILVGEHRLKMQDKHENRFLAEKVVPHPEFRKNGPHSNDIAVVLVSKSGVQFNSHVRPICLPDKVEANGRWCAVSGWGYQDESTENFSPVLRAAAVPVLDLATCRRDKVLGSCKQTILDSMICAGALSGGVDACKGDSGGPLACLGHRWQLTGVVSWGAGCGRRARPGVYTRVASYVDWLRHTAAQMGQRIA-