Monarch geneset OGS2.0

DPOGS215181
TranscriptDPOGS215181-TA2454 bp
ProteinDPOGS215181-PA817 aa
Genomic positionDPSCF300143 - 375332-386210
RNAseq coverage665x (Rank: top 19%)
Annotation
HeliconiusHMEL0038762e-5534.44% 
BombyxBGIBMGA008668-TA1e-14445.49% 
DrosophilaCG1299-PA1e-5837.78% 
EBI UniRef50UniRef50_Q8I9257e-12758.29%Coagulation factor-like protein 3 n=1 Tax=Hyphantria cunea RepID=Q8I925_HYPCU
NCBI RefSeqNP_001036891.11e-14265.14%clip domain serine protease 4 [Bombyx mori]
NCBI nr blastpgi|1129828422e-14165.14%clip domain serine protease 4 precursor [Bombyx mori]
NCBI nr blastxgi|1129828421e-14465.14%clip domain serine protease 4 precursor [Bombyx mori]
Group
Gene OntologyGO:00038244e-90catalytic activity
GO:00042521.2e-83serine-type endopeptidase activity
GO:00065081.2e-83proteolysis
KEGG pathway 
InterPro domain[552-812] IPR0090034e-90Peptidase cysteine/serine, trypsin-like
[561-808] IPR0012541.2e-83Peptidase S1/S6, chymotrypsin/Hap
[594-609] IPR0013144.6e-13Peptidase S1A, chymotrypsin-type
[451-503] IPR0227001.8e-11Proteinase, regulatory CLIP domain
[451-504] IPR0066046.8e-11Disulphide knot CLIP
Orthology groupMCL16809 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215181-TA
ATGACAATAGACGGTAGTACGATATGCAGCATCATCCAATCATTTCCAGTGCTCTTAATCTTGGCGCAAAATTCAAAAAAATGTCTCCATACCCCTCTTGTATATAAACCTCCACACCCGACCCACTCCCCGGAATATTACGCGGCCTACGACGCCCTGATCCAAACATTGCTTCATGACGACAAATGCGACAACCAATACGGGATCCCGAGTTATGGCCTCGGCTACAAGAAGTGCTATAACCACTACCCCGTATATGACTTTCCAGTGAGCGGCCATTTTCCTGTTCATACTGATTTCGGATACCACTACCCCAGCTTGGAAGGCCATTATTTCCCTAAAGCCAGTATAAATAGCACCGGTTACGCGTACAACAGGAACTACGCCCCAAATCAAGTACACAACGTGAAAAGAAAATATAAGGTGGTGGTGAAGAATGGCAAGCGGATGCAGAAGCTGAGATCGAGAGGGCTCTTCCCGCAGCGAGAGCAACGACGCTACCAATCACCGTTTGGATTTACCCAAAACTTTTTCCCGAATCCAAGCAGAATTTCCACCACTATTTCCAAACTTCTCCAGTTGGAACGAGAAAACCCGTACAGCAACCCTGCATACATCTATCCCTTTCAGAACAACCCGATAACAAATCCCAATCACAATCCTTACAATGTTAGATTCCCAAATCAGGGACAGCAAAACAATTTCGGAAATCCAGAATATAATAACGAACAAGTCACAAATGCTCCAAGTTTCAATGGTAATCAAAATGTCAACGGGGATTTAACACCAGGTCAACAGGTTGAACCTGTTTTTGGACCAACAAATGAAGGTTCGAGTGGTTTAGTCTCAGGACAGCAATTGGGACCGGTGTTTGGGCAGGATAATGGTGATAAAAGTGGTAGCACACTGGTATCTGGACATCAAGTAAGCCCCGTCTTTTCGACGGAAGCACCTAATTATGAAAACCCTGGAAATTTGATTTCTGGACAACAAAATGGTTCACCTTTTAGTAAAGAACCAGCTAATGATGGCAATTTTGGAGGTTTGATATCTGGTCAACAAAACGGTCCCACTTTTAGCACAGAAGCACCTAACTTTACAAATTCTGGTAACTTGGTTACTGGTCAACAAAATAGTAATCCTTTCACCACCGAAGCACCCGATCAGGGCGGCACTGGTTTAGTTTCTGGAGTGCAAAGCGGCCCTGTATTCGGTACAGATGAAAGTAAAAAAGGAAACGGTGGCACTTTGATTTCTGGGACACACTCCGGACCTGTTTACAGCCCAGACGCAAATGTACCAGCTTTCGAATCCCGAAACAACTTTAACGAAGGAATATTCAAAGAGACATGCAACACCGTGGGTGGCGGTCTTGGACGTTGTATAACAATAGTCTCGTGTCCTGTTTATGTGAAGCTACTGCAGCAAGCCAGAACGAGTCCTTCCGCTGTCCAGGAGTTGAGAGCGGCACAATGCGGCTTTGAAGGAAATTATCCCAAGGTTTGTTGTCCTCTTCCTCCACCTCCTCCACCTCCTATACCCGACACACCGCCAGCACCTCCGACACCTCCAACACCCCCTACACCCACGCCATCCGGGAAGTCCATACCTTCGGAGTCAGACTTCATCACTGCCTTCCCTGAACCACCTGAGTGTGGAGTCTCCAATGCCTCATTCAGCAGAGTCGTGGGTGGAATACCCTGTACCTGGGGTGATTTCCCTTGGATGGCACTCCTGGGCTACAAGGGTCGTAGTGGAGCTGGTACGCGCTGGCTATGTGGAGGTTCGCTCGTCTCCCACCACCATGTCTTAACCGCAGCCCATTGCATCCACAACCACGAGCACGACTTATACGTCGTCCGTCTCGGCGAGCTGGACCTGGAGCGAGATGATGAAGGTGCTACTCCCATCGATGTCCTCATCAAGCAAAAGATCAAACATGAGAAATACAACGCGACCTCGTACACTAACGACATCGGACTGTTGGTGTTGCAGAACGACGTGGACTTCACCAATCTGATAAGACCGATTTGCATCCCGACGCGTCAGGATCTGCGCGCCAACTCATTTGTTGACTACAACCCACTCATTGCTGGGTGGGGAGACACCGAGTTCCGCGGGCCATCAGCATCACACCTGCAAGTCCTTCAGCTGCCAGTGCTGGACAACTCGTTCTGTCAGAAGGCCTACTCGCGGTACAAGGCTCAAGTGATCGACGACCGCGTCATGTGCGCCGGCTTCAAGAAAGGCGGCAAGGACGCCTGCCAGGGCGACAGCGGCGGACCGCTCATGCAGCCGGATTACAATCCAACGACGCTGGCAACATACTTCTACCAGACCGGAGTGGTTTCGTTCGGTCGGAAGTGTGCCGAGGCTGGGTATCCAGGAATCTACACGAGGGTGACGCACTTCGTACCGTGGCTGCAAAAAAACATGCTCGGCATTGACGGATGA

Protein sequence:

>DPOGS215181-PA
MTIDGSTICSIIQSFPVLLILAQNSKKCLHTPLVYKPPHPTHSPEYYAAYDALIQTLLHDDKCDNQYGIPSYGLGYKKCYNHYPVYDFPVSGHFPVHTDFGYHYPSLEGHYFPKASINSTGYAYNRNYAPNQVHNVKRKYKVVVKNGKRMQKLRSRGLFPQREQRRYQSPFGFTQNFFPNPSRISTTISKLLQLERENPYSNPAYIYPFQNNPITNPNHNPYNVRFPNQGQQNNFGNPEYNNEQVTNAPSFNGNQNVNGDLTPGQQVEPVFGPTNEGSSGLVSGQQLGPVFGQDNGDKSGSTLVSGHQVSPVFSTEAPNYENPGNLISGQQNGSPFSKEPANDGNFGGLISGQQNGPTFSTEAPNFTNSGNLVTGQQNSNPFTTEAPDQGGTGLVSGVQSGPVFGTDESKKGNGGTLISGTHSGPVYSPDANVPAFESRNNFNEGIFKETCNTVGGGLGRCITIVSCPVYVKLLQQARTSPSAVQELRAAQCGFEGNYPKVCCPLPPPPPPPIPDTPPAPPTPPTPPTPTPSGKSIPSESDFITAFPEPPECGVSNASFSRVVGGIPCTWGDFPWMALLGYKGRSGAGTRWLCGGSLVSHHHVLTAAHCIHNHEHDLYVVRLGELDLERDDEGATPIDVLIKQKIKHEKYNATSYTNDIGLLVLQNDVDFTNLIRPICIPTRQDLRANSFVDYNPLIAGWGDTEFRGPSASHLQVLQLPVLDNSFCQKAYSRYKAQVIDDRVMCAGFKKGGKDACQGDSGGPLMQPDYNPTTLATYFYQTGVVSFGRKCAEAGYPGIYTRVTHFVPWLQKNMLGIDG-