Monarch geneset OGS2.0

DPOGS215098
TranscriptDPOGS215098-TA4644 bp
ProteinDPOGS215098-PA1547 aa
Genomic positionDPSCF300139 - 383750-410911
RNAseq coverage490x (Rank: top 25%)
Annotation
HeliconiusHMEL0078005e-13162.18% 
BombyxBGIBMGA009610-TA3e-9946.07% 
DrosophilaCG11843-PA2e-2739.80% 
EBI UniRef50UniRef50_Q5MPB33e-9351.63%Hemolymph proteinase 21 n=5 Tax=Bombycoidea RepID=Q5MPB3_MANSE
NCBI RefSeqXP_001842493.12e-5035.77%serine protease [Culex quinquefasciatus]
NCBI nr blastpgi|3796990224e-10451.47%serine protease HP21 precursor [Bombyx mori]
NCBI nr blastxgi|3796990222e-10752.35%serine protease HP21 precursor [Bombyx mori]
Group
Gene OntologyGO:00038249.8e-52catalytic activity
GO:00042522.6e-32serine-type endopeptidase activity
GO:00065082.6e-32proteolysis
GO:00167724.4e-19transferase activity, transferring phosphorus-containing groups
GO:00167731.1e-08phosphotransferase activity, alcohol group as acceptor
KEGG pathwaygga:4227233e-18 
 K01324 (KLKB1)maps-> Complement and coagulation cascades
InterPro domain[1197-1395] IPR0090039.8e-52Peptidase cysteine/serine, trypsin-like
[1204-1478] IPR0012542.6e-32Peptidase S1/S6, chymotrypsin/Hap
[662-921] IPR0110094.4e-19Protein kinase-like domain
[775-941] IPR0004031.1e-08Phosphatidylinositol 3-/4-kinase, catalytic
[1064-1107] IPR0066045.1e-07Disulphide knot CLIP
Orthology groupMCL23338 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215098-TA
ATGTCGCATCGTATGACGTCACATCCTGTAACAGTGCAGAGGTCGTCCGAGGAGCCGAGTATAAAGCGCCCCGCCCGGCCGCAGCTTCCGGCGCACAGCACCCTGCAAGGACTCGCCCGGGGGCTCGCTGCAGGGCTCCGGGGACTTCGCGGCCTCAGCTCGGACTTAGCGCTCAGCTCGTACAACACATATACTGCCTCCCTCCTCCTCCTGTGTAGACTGTACGAGTGTCTGTTGTTGTCGCTGGTGGTCCCACTCGCTGACGGTCTCAGGGTGTCACTGGCGGCGGCGGTGGAGCTCTTCCTGCAGGACATCCGGGAACAGGATACGTTCAGAAACGCCAAATTAGATGTTATTGACATAAGCGCTCCACCGACCACAGTAACCGGAAGTCTGGCCAGTACAGATTCAAATAAAGATCTTGAGCAACATATAAGTTTAGAGGACGTCGTAGAGAACATACTGCAATTTGCCCGAACGGACGATGATTTCTGTACGTCCCTGATAACCCAATTCATACGTCAGTGTGGTAAAATTAGTCCCGGGTTCGATTTCCGCAGCGCCATCTGTTCAGATGTAATGAATTGCATCAAAATAAGTCCAGTAACCGGCAGTGTCATTGAAATTGCCAGGCAGATTGATATTTTGCAGGACATCGACTTATATAAATTAACTGACTTAGTTAACTCGTCCCTGGGCTCTGAAGGCGAGGGTGTGTGTCGACTTCTATATGAAGACGTGCTTCTGAGAAAGGGAGAGGAAATATTTGTTTTGGGACAAAATAAGCCCTGGGATGAAGAAATGTCCGTGGTCAATAACAAATTCTCAATCGAAGACGCGCCATTCTGGACGAGGTGTTTAGTGTCTAAATACAGGGAGCAAGACATACACACATACCAAGTATTAAAAGAACTGGTCCGATGGCCGAAGAGGCAGTTCGACGTCGCCGCAGTACTAGGCTCCCAGCAGTGGGAGGTGACGCAAGACGATATGGACAATGACTGCCTCTCCAAGTGGGCCTTCAGGGCTATGATGGGATCACTGTGCAGTAACAGTTCCAAGGGTCCGTCTCGCTCACAGGTGTCGTGGATGACGAGAGCCAATCACATGAAGATGTACAGCCAGGTGTTGAGATGCTGCGAGGGGATCGACACAAATGATGATAAAATATCGCTAGACATGAGACACCAGGAACTGCTGGCCCTTAGGGGGATGGCGTTACAATCAAACGATAGGCAGGCTCTGGAGAATATTTTGGAGAGAACAGAGTCTATGATACAATCTGGAAGTGAGCACAGTTACGGAGATATGTTGCGCGTATACGAACTGACATTGCGTCTGAGACGAGATTTGAACTCCGTGGAGCGAGTTAACGTGGAAGCGATCGTGAAACACGTGATGGGCGACGTTAGAAACGTTGAAAACGACGTCACCAGGAACCTCGACACGTTGTGTTTGCTGGGAATAACTTGTCTGGAGACCATGTTTGAGAACAGCACAGATATCGACGAACGTTCTTCACTGATGCTGTCAATCTGTGAGACGATCTCCTATCAAAGCTCTCCATCAACAGACGTTGTTCTTGACAAATTGGACTCCTTCGACCGGATCCTGGATGAGAATGTCTCCCAGAGGATACTAAACACGCTGCAACCCCTCCTGAGCACAATGGACCAGTATTCCGTGGACCAGTTGCTGTCCAAGCGCGAGCTGTTCTCTCCTCACCCCGCGCTGCTCGAGCTCCATCAAAACCAGCTCAGCTCGGACAAATATAAGCAGTGTCTGAGCTTAGTCTCTGACCCTGTGTATTTGCTGAAACAATACGTAGGAATGATGCTAGCAGCCGTGCAAACAGACGATGTAACGAAATACAAACAGATATACAAAAATATGAGACAGAGGATTTTTGAGAATCCGTACGTCGGCGCCGATTACATCGTCCTCAACAAGTACAGTAAGCAACTGGAAGGCTGTGACGATTTTGAGACGGACGTACACACGTTGCAGCGGCTATTGAAAGACATACACGCAGACTTACAATCCAGTAAGAGCCGTCTCTCTCTAACAGACATCTGCCCGACTCTATTAGAGGAGAAACAGAGTCGGGCACTCGACAGACTGCTGGCTCTCAAGGACGGAGTCCACTTCATTAAATTCATGGAAAATGTGTCCGTATACCGTGACGCCGTGACACGGCCGGTGTTACTGAGCTATTTGTCGTCAGACGGATTCACACGTCGTTGTATAGTGAAGACAGAGGGTAAGGGACATGCGGCTGCGGTCAGGGTCCGGGGGGCCTTGGAGAAGGCCTGGGAGATCCCACGGGGATATAAGGTGACGCCGCTCACCTCCGACTGTCTTCTGATCGAGTACGTGGAGAATAACACGCGACTCCGGGACATGGTGGACACCGGCGGTGGGGATGCCGGCGTCACTCGGACCGCTGACGAGAATCTCATACTAAACGTACCTCAAGCTATTTCCCAGTTGGAGTCCCTAGCAAAGAGCGTGCCTGCCACTTCACTACGGTCATCCATTGAGTCGGGGTGTCTCACCTTGGAAGAGTTCATAAGGAAGAAGACGGCCTTCACCGAGTCCCTGGGTCATATGACAGCATTCAGCTTTATATGTGGTCTGTCAGACCGTCATTTACAGAACATCCTGTATGACCCGGTCCGAGGCACCGTCTGCGCTGTGGACTGCGGAGCCCTACAGCCCCAGGAGATACCGCCCGCTAGACTCACGAGGAACCTGCTGGCGGTCTGTCGCACCGACGTTCTCGAAGCTCGACTCCAAAGAATGTTGTCCAGACTACGGGAATATCAAGGAATAATACTCCCAGCAGTAAATATATCGCTCAAGAGATCTGGACACCTGGATAAGCTACCCGCCATCCGCGGCAAAATCCAGGGCCGCCTTCTCCAGCACCAGGTCACTAAGGAGTGGATACAAAGGTCAGAGGTCAAATACAAAGAGAAATATCTGGAGCTGCTGGACGAGATCTTCGGCACAGATGACAAATGTTCATACACCGTCGAGGAACAGGTATCAAATCTGTTGCTCCAATCTACCGATCCAAGGATCCTCAGCGTGACCAGAGAGAGTAAGTTGTTGGAGTTCATTCACAATTCAGTTGTCGCGGAGAAGCCGTGCGTGCTACAAACAAACGGACAGTACGACGGCGACGTCTGTGATCACAATGGCGTCGAGGGCGTCTGTAGGAACATAAGGAAATGTCCGTCAGCTATAAATGAAATAAGAAATAAACGTCCTCCTGTCCTCTGCTCGTTTTCTAACACGGACCCCATCGTGTGCTGTGTGGAGAACACACCGACGACACAGAGACCACAAATTCTGACAACAAAGAGGTTCACGACTACAACAGAATATATACCACCAGAGGAAGATTACGTTTTAGCGGATACGAATTCTAAAGGAACCGATCAGTGCGAGCCGATTTTGGCTAATCAGACGGCGCCGAAAACAGGACAAAAAGCCTGGGACAAATGTATTGAATACCAACAAAAGCTGATATTCCCCTGCGAGAGAGGCGTGACCTTGACGGGAGCCATGAGCCGAGGTAACAAGTGCCATCACGACGCCAACCAGCTCATCGTTGGTGGAGTGGAAGCCACTGAGAACGAATTCCCACACATGGCGCTGCTCGGCTACGGGAAAAACATTAATAGTATCCAATGGCTCTGCGGAGGTTCACTGATAAGCGAGAGATTCGTTTTGACAGCGGGACATTGCACTTCAAGCAGAGACGCAGGTATAGTGAAGTACATCCGTCTCGGTGCCCTCCGTCGCACCGACCCCCTGGGCCCGGACCAGCTGTTCACTGTCAGTGACGTCATCAAGCACCGGGACTTCCATCCGCCAAACAGATACAACGACATCGCGCTATTGAAGATGGACAGAGATGCTATTCTGACGGAGTTCTTAGTGCCGGCTTGTTTGGACGTGAGAGCTCCAGGGGACTATAGCCGAGTGCTGGCTTCAGGCTGGGGAGCCACGAAGAACCGGGGGGCGAACGCTGATCATTTACAAAAGGTCATACTCCAGGAGTTCACAACCGAAGAGTGCTCCCAAATGTTCACAGCAAGTCGTCTCATGAAGCAGGGCTTTAACGGAAACACACAGATCTGCTACGGAGATAAAGAAATGTCCAAGGACACTTGTCAGGTTGCAGAGAAGCTCGTTTTGATTGGACATAATGAAAAAATAATGGCAAACTACGCGGAAATTAAGAAACTAGACGAGGAATCTTTGCAAAAGTTGACGGAAGGCGAAGGACCCGCCGACGAATGTCTCTGTCCGCTGCTCTCGCCGCCAACTGAGAAGAAGAATTTGGCAAGATCAGTCAAAAGGGCTCTCTCAGCGTTCGGAAAATCGCGAAAACCGAATAGAGTATCAAATATGATGGAGAACATTAAAGAATCAGGCTTCTTTTCGACGAGAGAGGTTCTAAAAAGACAGAAGTCTTGTGGCAAATGCGGCTGCGACGACGAAAATATTGTGTTGAAACATTCATACGCGAACATACGCATAACAAGTCCTGATCTATCGTCTGTCTGTCCGTGCCCGTCGAACTGTTTGCCTGGTAAGATTGCGTCTGAACACAAAGTGTCTCATAACCTCTGA

Protein sequence:

>DPOGS215098-PA
MSHRMTSHPVTVQRSSEEPSIKRPARPQLPAHSTLQGLARGLAAGLRGLRGLSSDLALSSYNTYTASLLLLCRLYECLLLSLVVPLADGLRVSLAAAVELFLQDIREQDTFRNAKLDVIDISAPPTTVTGSLASTDSNKDLEQHISLEDVVENILQFARTDDDFCTSLITQFIRQCGKISPGFDFRSAICSDVMNCIKISPVTGSVIEIARQIDILQDIDLYKLTDLVNSSLGSEGEGVCRLLYEDVLLRKGEEIFVLGQNKPWDEEMSVVNNKFSIEDAPFWTRCLVSKYREQDIHTYQVLKELVRWPKRQFDVAAVLGSQQWEVTQDDMDNDCLSKWAFRAMMGSLCSNSSKGPSRSQVSWMTRANHMKMYSQVLRCCEGIDTNDDKISLDMRHQELLALRGMALQSNDRQALENILERTESMIQSGSEHSYGDMLRVYELTLRLRRDLNSVERVNVEAIVKHVMGDVRNVENDVTRNLDTLCLLGITCLETMFENSTDIDERSSLMLSICETISYQSSPSTDVVLDKLDSFDRILDENVSQRILNTLQPLLSTMDQYSVDQLLSKRELFSPHPALLELHQNQLSSDKYKQCLSLVSDPVYLLKQYVGMMLAAVQTDDVTKYKQIYKNMRQRIFENPYVGADYIVLNKYSKQLEGCDDFETDVHTLQRLLKDIHADLQSSKSRLSLTDICPTLLEEKQSRALDRLLALKDGVHFIKFMENVSVYRDAVTRPVLLSYLSSDGFTRRCIVKTEGKGHAAAVRVRGALEKAWEIPRGYKVTPLTSDCLLIEYVENNTRLRDMVDTGGGDAGVTRTADENLILNVPQAISQLESLAKSVPATSLRSSIESGCLTLEEFIRKKTAFTESLGHMTAFSFICGLSDRHLQNILYDPVRGTVCAVDCGALQPQEIPPARLTRNLLAVCRTDVLEARLQRMLSRLREYQGIILPAVNISLKRSGHLDKLPAIRGKIQGRLLQHQVTKEWIQRSEVKYKEKYLELLDEIFGTDDKCSYTVEEQVSNLLLQSTDPRILSVTRESKLLEFIHNSVVAEKPCVLQTNGQYDGDVCDHNGVEGVCRNIRKCPSAINEIRNKRPPVLCSFSNTDPIVCCVENTPTTQRPQILTTKRFTTTTEYIPPEEDYVLADTNSKGTDQCEPILANQTAPKTGQKAWDKCIEYQQKLIFPCERGVTLTGAMSRGNKCHHDANQLIVGGVEATENEFPHMALLGYGKNINSIQWLCGGSLISERFVLTAGHCTSSRDAGIVKYIRLGALRRTDPLGPDQLFTVSDVIKHRDFHPPNRYNDIALLKMDRDAILTEFLVPACLDVRAPGDYSRVLASGWGATKNRGANADHLQKVILQEFTTEECSQMFTASRLMKQGFNGNTQICYGDKEMSKDTCQVAEKLVLIGHNEKIMANYAEIKKLDEESLQKLTEGEGPADECLCPLLSPPTEKKNLARSVKRALSAFGKSRKPNRVSNMMENIKESGFFSTREVLKRQKSCGKCGCDDENIVLKHSYANIRITSPDLSSVCPCPSNCLPGKIASEHKVSHNL-