Monarch geneset OGS2.0

DPOGS215189
TranscriptDPOGS215189-TA1152 bp
ProteinDPOGS215189-PA383 aa
Genomic positionDPSCF300143 - 295288-299824
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0038766e-4632.11% 
BombyxBGIBMGA008668-TA3e-9847.59% 
DrosophilaCG32260-PA7e-6234.95% 
EBI UniRef50UniRef50_Q8I9253e-9647.89%Coagulation factor-like protein 3 n=1 Tax=Hyphantria cunea RepID=Q8I925_HYPCU
NCBI RefSeqNP_001036891.18e-9947.37%clip domain serine protease 4 [Bombyx mori]
NCBI nr blastpgi|564184134e-9849.34%hemolymph proteinase 17 [Manduca sexta]
NCBI nr blastxgi|564184158e-10347.77%hemolymph proteinase 17 short form [Manduca sexta]
Group
Gene OntologyGO:00038241.8e-84catalytic activity
GO:00042529.1e-73serine-type endopeptidase activity
GO:00065089.1e-73proteolysis
KEGG pathway 
InterPro domain[118-380] IPR0090031.8e-84Peptidase cysteine/serine, trypsin-like
[127-376] IPR0012549.1e-73Peptidase S1/S6, chymotrypsin/Hap
[166-181] IPR0013143.2e-14Peptidase S1A, chymotrypsin-type
[27-78] IPR0227001.2e-12Proteinase, regulatory CLIP domain
[27-79] IPR0066047.4e-09Disulphide knot CLIP
Orthology groupMCL27844 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215189-TA
ATGGTTCTCCGAGTAGTATTTCTGTGTTTGTGTTTCCAGACAGCGCTCTCTCGTATCGAATATAGAAGAGAGGAGTCATGCAAGGAAGTGAATGGTGTTACTGGAAGATGTGTCTCCATAGAATCCTGTCCCCCGTTTGTCCTGATGATGCAGACGGAACTTATTACTCAATACAAAACACTTCTAAAGCAATCACACTGTGGGTTTGAGGGAAACGTCCCCATGGTTTGCTGTCCCGATACTTCTCCTGAATCTCCATCCTCTCTCGTTCCCTCATCTCCTGTCGACGTTCCATCTAAGGGGTCGGCTCGAATGATGAACCTTCAATCACCATTTCTGTCACCACCAACCTGCGGAGTGTCGAACGCTTCATCTGGCCGGGTTGTGGGAGGAGTTGACGCTAAGCTCGGAGACTTGCCCTGGATGTGCCTTTTGGGGTACTGGGAGGGTGGTTATGATAAAGGCGGATCAAACGGGGACACTAAGTGGAGATGCGGGGGATCGCTGGTGTCCCCACAACACGTGCTCACAGCCGCTCACTGTATACATCACAGAGAGAAAGAACTATACGTGGTCCGTCTCGGAGAGTTGGATCTCGATCGTGATGATGAAGCGGCTCCAATCGACGTCCTCATTAGAAGAGCAATAAAACATGAAGCATATAACAGGGACACGTACACTAACGACATAGGACTCCTCGTACTCGAAAGAGGTGTCGAGTTCACAAACCTGATACGGCCTATTTGTCTTCCGATCCTTCCTGAATTACTGTCTAACACGTTTGTCAACTACAGTCCGTTCGTTGCTGGCTGGGGCAGAACGTCAGATCGAGGTCCCGGTTCGAGCCATCTCAAACTGACTCAATTGCAAGTAGTCGATAACCAAAAGTGTAAGAAAACGTACCTGGAGTACCCCGCCGTGATTGATGATAAGGTCTTGTGTGCTGAAGCGGGAGGACGCGACGCCTGCGAAGGGGACAGCGGGGGACCCCTTATACAACCATTTTATAATCAGGATAAGAAAGTGTATTACTTCTACCAGACAGGTGTTGTAGCGTACGGAAGACGTTGTGCTGAAGCCGGTTACCCCGGGGTATATTCCAGGGTAACTCACTACATACTCTGGATACAGAAGCACATCATGGAGAACTAA

Protein sequence:

>DPOGS215189-PA
MVLRVVFLCLCFQTALSRIEYRREESCKEVNGVTGRCVSIESCPPFVLMMQTELITQYKTLLKQSHCGFEGNVPMVCCPDTSPESPSSLVPSSPVDVPSKGSARMMNLQSPFLSPPTCGVSNASSGRVVGGVDAKLGDLPWMCLLGYWEGGYDKGGSNGDTKWRCGGSLVSPQHVLTAAHCIHHREKELYVVRLGELDLDRDDEAAPIDVLIRRAIKHEAYNRDTYTNDIGLLVLERGVEFTNLIRPICLPILPELLSNTFVNYSPFVAGWGRTSDRGPGSSHLKLTQLQVVDNQKCKKTYLEYPAVIDDKVLCAEAGGRDACEGDSGGPLIQPFYNQDKKVYYFYQTGVVAYGRRCAEAGYPGVYSRVTHYILWIQKHIMEN-