Monarch geneset OGS2.0

DPOGS215188
TranscriptDPOGS215188-TA1152 bp
ProteinDPOGS215188-PA383 aa
Genomic positionDPSCF300143 - 302713-307052
RNAseq coverage1x (Rank: top 95%)
Annotation
HeliconiusHMEL0038762e-4532.11% 
BombyxBGIBMGA008668-TA2e-9847.59% 
DrosophilaCG32260-PA7e-6234.95% 
EBI UniRef50UniRef50_Q8I9251e-9647.89%Coagulation factor-like protein 3 n=1 Tax=Hyphantria cunea RepID=Q8I925_HYPCU
NCBI RefSeqNP_001036891.13e-9947.37%clip domain serine protease 4 [Bombyx mori]
NCBI nr blastpgi|564184131e-9849.34%hemolymph proteinase 17 [Manduca sexta]
NCBI nr blastxgi|564184158e-10347.77%hemolymph proteinase 17 short form [Manduca sexta]
Group
Gene OntologyGO:00038248.4e-84catalytic activity
GO:00042525e-72serine-type endopeptidase activity
GO:00065085e-72proteolysis
KEGG pathway 
InterPro domain[118-380] IPR0090038.4e-84Peptidase cysteine/serine, trypsin-like
[127-376] IPR0012545e-72Peptidase S1/S6, chymotrypsin/Hap
[166-181] IPR0013144.9e-14Peptidase S1A, chymotrypsin-type
[27-78] IPR0227001.2e-12Proteinase, regulatory CLIP domain
[27-79] IPR0066047.4e-09Disulphide knot CLIP
Orthology groupMCL27844 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215188-TA
ATGGTTCTCCGAGTAGTATTTCTGTGTTTGTGTTTCCAGACAGCGCTCTCTCGTATCGAATATAGAAGAGAGGAGTCATGCAAGGAAGTGAATGGTGTTACTGGAAGATGTGTCTCCATAGAATCCTGTCCCCCGTTTGTCCTGATGATGCAGACGGAACTTATTACTCAATACAAAACACTTCTAAAGCAATCACACTGTGGGTTTGAGGGAAACGTCCCCATGGTTTGCTGTCCCGATACTTCTCCTGAATCTCCATCCTCTCTCGGTCCCTCATCTCCTGTCGACGTTCCATCTAAGGGGTCGGCTCGAATGATGAACCTTCAATCACCATTTCTGTCACCACCAACCTGTGGAGTGTCGAACGCTTCATCTGGCCGGGTTGTGGGAGGAGTTGACGCTAAGCTCGGAGACTTGCCCTGGATGTGCCTTTTGGGGTACTGGGAGGGTGGCTATGATAAAGGCGGATCAAACGGGGACACCAAGTGGAGATGCGGGGGATCGCTGGTGTCCGCACAACACGTGCTCACAGCCGCTCACTGTATTCATCACAGAGAGAAAGAACTATACGTGGTCCGTCTCGGAGAGTTGGATCTCGATCGTGATGATGAAGCGGCTCCAATCGACGTCCTCATTAGAAGAGCAATAAAACATGAAGCATATAACAGGGACACGTACACTAACGACATAGGACTCCTCGTACTCGAAAGAGGTGTCGAGTTCACAAACCTGATACGGCCTATTTGTCTTCCGATCCTTCCTGAATTACTGTCTAACACGTTTGTCAACTACAGTCCGTTCGTTGCTGGCTGGGGCAGAACGTCAGATCGAGGTCCCGGTTCGAGCCATCTCAAACTGACTCAATTGCAAGTAGTCGATAACCAAAAGTGTAAGAAAACGTACCTGGAGTACCCCGCCGTGATTGATGATAAGGTCTTGTGTGCTGAAGCGGGAGGACGCGACGCCTGCGAAGGGGACAGCGGGGGACCCCTTATACAACCATTTTATAATCAGGATAAGAAAGTGTATTACTTCTACCAGACAGGTGTTGTAGCGTACGGAAGACGTTGTGCTGAAGCCGGTTACCCCGGGGTATATTCCAGGGTAACTCACTACATACTCTGGATACAGAAGCACATCATGGAGAACTAA

Protein sequence:

>DPOGS215188-PA
MVLRVVFLCLCFQTALSRIEYRREESCKEVNGVTGRCVSIESCPPFVLMMQTELITQYKTLLKQSHCGFEGNVPMVCCPDTSPESPSSLGPSSPVDVPSKGSARMMNLQSPFLSPPTCGVSNASSGRVVGGVDAKLGDLPWMCLLGYWEGGYDKGGSNGDTKWRCGGSLVSAQHVLTAAHCIHHREKELYVVRLGELDLDRDDEAAPIDVLIRRAIKHEAYNRDTYTNDIGLLVLERGVEFTNLIRPICLPILPELLSNTFVNYSPFVAGWGRTSDRGPGSSHLKLTQLQVVDNQKCKKTYLEYPAVIDDKVLCAEAGGRDACEGDSGGPLIQPFYNQDKKVYYFYQTGVVAYGRRCAEAGYPGVYSRVTHYILWIQKHIMEN-