Monarch geneset OGS2.0

DPOGS215180
TranscriptDPOGS215180-TA2016 bp
ProteinDPOGS215180-PA671 aa
Genomic positionDPSCF300143 - 390001-401216
RNAseq coverage334x (Rank: top 35%)
Annotation
HeliconiusHMEL0092600.081.69% 
BombyxBGIBMGA008665-TA0.067.07% 
Drosophilamas-PB4e-14187.04% 
EBI UniRef50UniRef50_D6W7X10.059.43%Serine protease H51 n=2 Tax=Endopterygota RepID=D6W7X1_TRICA
NCBI RefSeqXP_968416.10.059.43%PREDICTED: similar to AGAP002815-PA [Tribolium castaneum]
NCBI nr blastpgi|910760980.059.43%PREDICTED: similar to AGAP002815-PA [Tribolium castaneum]
NCBI nr blastxgi|910760980.061.29%PREDICTED: similar to AGAP002815-PA [Tribolium castaneum]
Group
Gene OntologyGO:00038243.5e-87catalytic activity
GO:00042525.4e-83serine-type endopeptidase activity
GO:00065085.4e-83proteolysis
KEGG pathwayhsa:38182e-41 
 K01324 (KLKB1)maps-> Complement and coagulation cascades
InterPro domain[415-670] IPR0090033.5e-87Peptidase cysteine/serine, trypsin-like
[426-662] IPR0012545.4e-83Peptidase S1/S6, chymotrypsin/Hap
[454-469] IPR0013146.6e-15Peptidase S1A, chymotrypsin-type
Orthology groupMCL15897 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215180-TA
ATGAGAGTGCTCGCTATATTCCTCCTCCTGGCGGTCCCCATCCTCGGCCAGGATGAATCGTTCGCCAGCTCCTTCCTATCAGGTCTCCTTGACACCCTGGATACTCAAGTAGATGCGAAGAACTGTCCCGGGGTGTGCATGCATGCGCTGGCTTCGCTCATTTGTTCGAATGTTCTGGAGGAAGTGGAGTGTCCCAAGCCCTCCATGAAATGCTGCGTTGACGAACCTCTCGGTAACGACACGATAACGACAACTCGGAAGCCCTTCACGACGCGATACACGACAACAGAAGCCGAGGAGGACGAAGATGTTACGACGCGACCTGACAAATACAAAGACAGTGGAAACATCGCGTGTCCGGGCATGTGTGTGGAGTCGCGGTACAGTCAGTATTGCGAGGCATACCTGGCGTCCACTGGACTGTGCGTGTCTGGAAGACAATGCTGTGTCTCCAGGGATGTATACGGAGAGAAGAGGCCGCCGGATCTCGTGGTGCCCAGCGAGAAGAAACAATCTACTAAGAGGCCAACGACGGCGCTGACGACGACGCCGCAACCAAAACAGAGGCCAGGGCACAAATGCCGCGGCGACTGCATCACGGGGCTGTTCGCCTTGCTGTGCGACCACGTCGACGAGGACGCGACCTGCCCCTCAGACGGAACCTGCTGCATCACCGAGTCCAGCACCGAGCAGGTCACCAGGAGACCCACCACGCCCAGGCCCACCACTCCTGCGCCGCTCCCGCGATGTCCAGGCTACTGCCTGCTCAATCTCATGTCTGCGTTCTGCGAGCGCCCCGCCGTCCGCCTTCACAACACCCAATGTAAACTCAGCGGCTCCATATGTTGTGATAACAGCAGAGTCCCGCCTCGCACCACGCCGCGGCCCACCACCACCACCACCACGACCACGCCGGCGCCGCACGACTCCCGGCCCGACTGCCCCGGCTCGTGCATCGTGTCCCTGCTGTCGTTCACGTGTTTCCGAAACGCTGAGATGACGGACGTCTTCAAGTGTAAAAAGGCTGGGACCCAGTGCTGCGCGCCGAAGTCGAAGGTTCTAGAGGCCATGGGCATCAGCAGGAACGACACCTTCCCGCTGGCCACCACTCACCCACACACGCAGGCCCACACCCCCCACACATACACCCCGCAGCCGTTGCCGTACACGACGGCCATGACTCAGTACGAACCCAGCTACGTCACGACTTTGAGGACTCCCGAAAAGTACAACAAATACGTGTGCGGCGTCAAAGGTACGTCAAGTCGTGCCGGGCGGGTCATGGGCGGCGAGGACGGTTCCCGCGGCGAGTGGTGCTGGCAGGTGGCTCTCATCAACTCCTTGAACCAGTACCTGTGCGGCGCGGCGTTGGTCGGCACGCAGTGGGTCCTCACCGCCGCTCACTGTGTCACCAACATCGTCCGGTCCGGGGACGCGATCTACGTGCGCGTGGGCGACCACGACCTGACCAGGAAGTACGGGTCCCCGGGCGCGCAGACCCTCCGCGTGGCGACCACCTACATCCACCACAACCACAACAGCCAGACGCTCGACAACGACATCGCGCTGCTGAAGTTGCACGGCAAAGCTGAGCTCAAAGAAGGAGTGTGCTTGGTGTGTCTGCCGGCGCGAGGAGTCAGTCATGCGGCGGGGAAGCGGTGCACGGTCACCGGGTACGGCTACATGGGCGAGACGGGTCCGATCCCGCTGCGGGTCCGCGAAGCCGAGCTGCCGATCGTGAACGACGCGGAGTGCATCCGGAAGGTGAACGCGGTGACGGAGAAGATCTTCATCCTCCCCGCCAGCTCGTTCTGCGCCGGCGGAGAGGAAGGCAACGACGCCTGCCAGGGCGACGGCGGCGGCCCCCTCGTGTGCCAGGACGACGGGTTCTACGAGCTGGTGGGGCTGGTGTCGTGGGGCTTCGGCTGCGGCCGGCGCGACGTGCCGGGCGTGTACGTGAAGGTGTCCTCGTTCATCGGCTGGATCAATCAAATAATATCCGTGAACAACCAATGA

Protein sequence:

>DPOGS215180-PA
MRVLAIFLLLAVPILGQDESFASSFLSGLLDTLDTQVDAKNCPGVCMHALASLICSNVLEEVECPKPSMKCCVDEPLGNDTITTTRKPFTTRYTTTEAEEDEDVTTRPDKYKDSGNIACPGMCVESRYSQYCEAYLASTGLCVSGRQCCVSRDVYGEKRPPDLVVPSEKKQSTKRPTTALTTTPQPKQRPGHKCRGDCITGLFALLCDHVDEDATCPSDGTCCITESSTEQVTRRPTTPRPTTPAPLPRCPGYCLLNLMSAFCERPAVRLHNTQCKLSGSICCDNSRVPPRTTPRPTTTTTTTTPAPHDSRPDCPGSCIVSLLSFTCFRNAEMTDVFKCKKAGTQCCAPKSKVLEAMGISRNDTFPLATTHPHTQAHTPHTYTPQPLPYTTAMTQYEPSYVTTLRTPEKYNKYVCGVKGTSSRAGRVMGGEDGSRGEWCWQVALINSLNQYLCGAALVGTQWVLTAAHCVTNIVRSGDAIYVRVGDHDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGKAELKEGVCLVCLPARGVSHAAGKRCTVTGYGYMGETGPIPLRVREAELPIVNDAECIRKVNAVTEKIFILPASSFCAGGEEGNDACQGDGGGPLVCQDDGFYELVGLVSWGFGCGRRDVPGVYVKVSSFIGWINQIISVNNQ-