Monarch geneset OGS2.0

DPOGS206202
TranscriptDPOGS206202-TA1683 bp
ProteinDPOGS206202-PA560 aa
Genomic positionDPSCF300509 + 9570-13895
RNAseq coverage259x (Rank: top 41%)
Annotation
HeliconiusHMEL0144191e-11246.24% 
BombyxBGIBMGA011051-TA2e-6853.02% 
Drosophilasnk-PB2e-3335.43% 
EBI UniRef50UniRef50_D6W6R53e-3933.76%Serine protease P44 n=1 Tax=Tribolium castaneum RepID=D6W6R5_TRICA
NCBI RefSeqXP_969745.26e-4033.76%PREDICTED: similar to trypsin-like serine protease [Tribolium castaneum]
NCBI nr blastpgi|1892336781e-3833.76%PREDICTED: similar to trypsin-like serine protease [Tribolium castaneum]
NCBI nr blastxgi|1892336789e-3833.76%PREDICTED: similar to trypsin-like serine protease [Tribolium castaneum]
Group
Gene OntologyGO:00038246e-47catalytic activity
GO:00042521.6e-32serine-type endopeptidase activity
GO:00065081.6e-32proteolysis
KEGG pathwaygga:4227236e-20 
 K01324 (KLKB1)maps-> Complement and coagulation cascades
InterPro domain[220-415] IPR0090036e-47Peptidase cysteine/serine, trypsin-like
[226-423] IPR0012541.6e-32Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL21027 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206202-TA
ATGGTCAAAATTATCTCCGTTCTGCTCATCATCAGTTTTGCTGACGTGAGAGCTCAGCTCTTTGATTTCCTATCAGCGTTCCAACCTCAGACCTTACGGCCTATCTTCGAACCAACAAAACCGACGAGGACCACGAGGTTCCTCGACACAAATAGAACGCAGAACGCAAGTTGGAAATTGGGTCAGAGTTCAGAACTAACAACCGCGAAAGCAATACGTAAAACACCAAAAAGCAACCGACGAAGCCAAAATAAGACCAAAAAAGTTAAAGAGAACTATGGCCCGGACAAAATAGCGTTCGACGATGGCTACAACTCCATCATCAGTTACGACACGGTCAAACACACGACAGTTAACCCGTTCGCTCGCCCGGGAGTCAAACCCGCGGTAGTGAACGGTCCGCCTATAACGAATAGGCCTGTGACGAGGTCGTTTGCGCCTAGACCAATAGCTGGGCTTCAGCCGCACGACGATAACACAGTGACCCCCGAATTAATCGTGGGACCGGACGAAGATTACATGTCGACGGTGGAGAGGAGGCGTTTCATGGAAGTTACGGAAAAAAAATGTGAGCAGTACGTATCTCTGGACACTGTTCGCGTTGAAGCCATACCCCTCGTACCATCACCACGTCCTGTTGTCGTCAACGTGTCGTCCTGTGCTCGCTCCGTGCCGCTCGTGGTGGGAGGGAGCGTAGTCACGATACAACAGTTCCCACACATGGCCGGTCCTCCGCGAGCAGTCCAGCTCGGATCCTCTCGCCGTGACGATCCCGGGGCGATAGTTCTCCGCGTGTCCTCAGTAACCAGACACCCCAAGTACCGGCCGCCGCAGTCTTACTACGACATCGCCATCGTCAAGATGGTAAAAAATGTCAAATTCTCAGCCGTCATTAAGCCAGCCTGTCTGGGAGTACCACCAGCTCCTGGGAAACACATCATCGCTACAGGCTGGGGGAAGACGGAGTTTGGTGGAGATGAGTCCGAACTGCTAAGAGGTGTTTCACTTCCGGTGTGGAGCCTAGAGGAGTGTGCGTCAGTATTGGGGGAGTCCCGGAAACTGCCGCGCGGACCCGACCGCAGCCAGACCTGCGCCGGGGACAGGAGAGGAGGCAGGGACACCTGCCAGGGTGACTCGGGCGGACCAGCACAGCTCCGAGAGGGCTGCTCCTGGAGGGTCGTCGCTGTAACGTCGACAGGTAGAGCGTGTGGTGCAATCGACACACCAGCTATATACGCCAACGTCCAAATACCCTTCGTGGCCAAGGTCATCTTCGGTGATGAGATCAGGAATGGGGGGAATTTAAACATGGATAACCGAAATCAATGGGGCAGCGAACATAGCAGTGAAAGTGGACAAAACAACCGAAATAGACCAAATGATAGCGCTCAAAATAACTATAACAGACCGAGTGTTGACAGTGTACAAAATAACTATAACAGACCGAGTGTTGACAGTGTACAAAATAACTATAACAGACAAAATGGTGACAGTGTACAAAGTAGCTATAACCGACCGAATAACCAATGGTATCAACAAACGACCAGCAACCCGCTAAAGAATGAAAATTACCGAAATCCAGTATACAGAGGGGACGATACAGAAGATTCGACTATTGGCCCCCATTATTACTTCGAGGACAGCGTTCATAGACCTTACATAGGCAACATCTGGCAGACTTGA

Protein sequence:

>DPOGS206202-PA
MVKIISVLLIISFADVRAQLFDFLSAFQPQTLRPIFEPTKPTRTTRFLDTNRTQNASWKLGQSSELTTAKAIRKTPKSNRRSQNKTKKVKENYGPDKIAFDDGYNSIISYDTVKHTTVNPFARPGVKPAVVNGPPITNRPVTRSFAPRPIAGLQPHDDNTVTPELIVGPDEDYMSTVERRRFMEVTEKKCEQYVSLDTVRVEAIPLVPSPRPVVVNVSSCARSVPLVVGGSVVTIQQFPHMAGPPRAVQLGSSRRDDPGAIVLRVSSVTRHPKYRPPQSYYDIAIVKMVKNVKFSAVIKPACLGVPPAPGKHIIATGWGKTEFGGDESELLRGVSLPVWSLEECASVLGESRKLPRGPDRSQTCAGDRRGGRDTCQGDSGGPAQLREGCSWRVVAVTSTGRACGAIDTPAIYANVQIPFVAKVIFGDEIRNGGNLNMDNRNQWGSEHSSESGQNNRNRPNDSAQNNYNRPSVDSVQNNYNRPSVDSVQNNYNRQNGDSVQSSYNRPNNQWYQQTTSNPLKNENYRNPVYRGDDTEDSTIGPHYYFEDSVHRPYIGNIWQT-