Monarch geneset OGS2.0

DPOGS204450
TranscriptDPOGS204450-TA1182 bp
ProteinDPOGS204450-PA393 aa
Genomic positionDPSCF300002 + 165977-171656
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0040312e-7163.27% 
BombyxBGIBMGA013645-TA6e-15568.56% 
DrosophilaCG14892-PA4e-7440.36% 
EBI UniRef50UniRef50_E0V9714e-8043.86%Transmembrane protease, putative n=1 Tax=Pediculus humanus corporis RepID=E0V971_PEDHC
NCBI RefSeqXP_002422665.17e-8143.86%transmembrane protease, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420032451e-7943.86%transmembrane protease, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420032454e-8043.12%transmembrane protease, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00038241.2e-80catalytic activity
GO:00042521.7e-75serine-type endopeptidase activity
GO:00065081.7e-75proteolysis
KEGG pathwaymdo:1000139812e-19 
 K01324 (KLKB1)maps-> Complement and coagulation cascades
InterPro domain[13-390] IPR0090031.2e-80Peptidase cysteine/serine, trypsin-like
[33-386] IPR0012541.7e-75Peptidase S1/S6, chymotrypsin/Hap
[66-81] IPR0013146.1e-07Peptidase S1A, chymotrypsin-type
Orthology groupMCL16020 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204450-TA
ATGAAAGCAAGTTACGACACAGCCAGCGAGCGACAATGTGGTGTTCCTCTCCGTCGCCACACCAGGCAGCCCCGAGAGAGGTCCGCCAGCCAGCTCAGGATTATCAAGGGAAGGGAGTCCAAGAGAGGAGCCTGGCCTTGGCAGGTTTCTCTCCAGCTGTTACATCCTAACTACGGTCTGATCGGCCACTGGTGCGGCGGAGTGCTGGTTCATCCGCAGTGGCTGCTCACCACCGCGCATTGCGTTCACAACGAGCTGTTCAACCTGCCGCTACCAGCTCTATGGACGGCGGTGCTCGGGGAGTGGGACCGTAACGAACAACGCGGCTCCTTCCTGCCCATCGAGAGGATCATCCTGCATCACCGCTTCCACAACTACCAGCATGATATAGCTCTGATGAAGATGACAAAGTCAGCGGACGTGAGCACGAGGAGTCGCATCCGCGCCATCTGTCTGCCGCCATACGAGCCTGTGGACGACAACATAGAGAGGAGCACCTCCTACACCAGCACACAGGAAGTGAGGCGGAAGACGAGGCCGCCGCGACCCAAGCCCGACACCGCCAACAAATACTTGGAGAAAATCAACAACCTCACGAAGACCGTCCACTCGGCCAAGGACAAGAAGAAGAACACCAGGTATAACGTGCGGGTCTCCAACGACGACGGGCTCAGAGATAGGAAGATAGAGGACAGAGAGGCGGTCTACGACGGAGCCTCCCTGGACAGCGTCGTGAACCTCATACGGAAAGGGAAAGATGTCTCCAGGAGCGACAAGATGATAGCGAGCTACCACGAGATAGACCCCTTCATAGACAACTCCATAGACGCCAAAGAGGAGTGTTACGCCACTGGCTGGGGACGGCAGCAGACCAACGGCAGTCTCACGGACGTGCTGCTGGAGGCCGAGGTGCCGATACTGCCGCTCAAGACGTGCAGGGAGCGGTACTCGCTCAGTCTGCCGCTCAACGACGGACACCTGTGCGCCGGCAGCACGGACGGCAGCAGCGGAGCCTGCGTGGGTGACAGCGGCGGTCCCCTCCAGTGTGTGGTGGGCGGCAGGTGGGTGCTCCGCGGCCTGACGTCGTTCGGGTCGGGCTGCGCCCTGACCGGAGTCCCCGACGTCTACACCAACGTTAGACATTACGTTGCCTGGATCTACGCTCACGTTTACGCTGGGTAG

Protein sequence:

>DPOGS204450-PA
MKASYDTASERQCGVPLRRHTRQPRERSASQLRIIKGRESKRGAWPWQVSLQLLHPNYGLIGHWCGGVLVHPQWLLTTAHCVHNELFNLPLPALWTAVLGEWDRNEQRGSFLPIERIILHHRFHNYQHDIALMKMTKSADVSTRSRIRAICLPPYEPVDDNIERSTSYTSTQEVRRKTRPPRPKPDTANKYLEKINNLTKTVHSAKDKKKNTRYNVRVSNDDGLRDRKIEDREAVYDGASLDSVVNLIRKGKDVSRSDKMIASYHEIDPFIDNSIDAKEECYATGWGRQQTNGSLTDVLLEAEVPILPLKTCRERYSLSLPLNDGHLCAGSTDGSSGACVGDSGGPLQCVVGGRWVLRGLTSFGSGCALTGVPDVYTNVRHYVAWIYAHVYAG-