Monarch geneset OGS2.0

DPOGS215182
TranscriptDPOGS215182-TA1341 bp
ProteinDPOGS215182-PA446 aa
Genomic positionDPSCF300143 - 364767-374771
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0038769e-4431.57% 
BombyxBGIBMGA008668-TA2e-9048.63% 
DrosophilaCG32260-PA3e-5534.85% 
EBI UniRef50UniRef50_Q8I9257e-9343.47%Coagulation factor-like protein 3 n=1 Tax=Hyphantria cunea RepID=Q8I925_HYPCU
NCBI RefSeqNP_001036891.13e-9148.25%clip domain serine protease 4 [Bombyx mori]
NCBI nr blastpgi|259892092e-9243.47%coagulation factor-like protein 3 [Hyphantria cunea]
NCBI nr blastxgi|259892096e-9543.47%coagulation factor-like protein 3 [Hyphantria cunea]
Group
Gene OntologyGO:00038242.6e-79catalytic activity
GO:00042521.5e-70serine-type endopeptidase activity
GO:00065081.5e-70proteolysis
KEGG pathway 
InterPro domain[200-441] IPR0090032.6e-79Peptidase cysteine/serine, trypsin-like
[209-437] IPR0012541.5e-70Peptidase S1/S6, chymotrypsin/Hap
[242-257] IPR0013144.5e-13Peptidase S1A, chymotrypsin-type
[115-166] IPR0227001.6e-08Proteinase, regulatory CLIP domain
Orthology groupMCL27843 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215182-TA
ATGGTTAGAAATACCGTTTACTTGACTGCAGATCTCGACCGATCCGTCCGGATACTGACTCAGGAAGTCCACAAGGACACTTGCACCACATTGGAGGGTGGCATCGGAACGTGTACGCCAGCTGTCTCATGCGCTGCGTACTTCGATCTGCAGCGACAAGCCAAAAGCTTAACTGCTTCTTTCCAGCTTAGAGACTCGCAATGTAAACCAAATGGATATAATGATATGATCTGCTGCCCAGTTGCAAATGAGATACCAAGAGAAACGACGCTGCCATTTCGGGACTTAGACAAAGAATATAACTGTGGTGAAGACCAAAGGAATGCTCAACTTGACGAAACGTGCACCACAATTGAAGGTGGTGTTGGCAAGTGTGAGTCGTTAGCAGCCTGCGAGCCGTACCTTCATCTGACGAGACAAGCCAAAAACATTCCCTTGGCTATTCAACTTAGAGATGCTCAATGCGGTTCAGACGGAAACGACCAAAAGGTCTGCTGTCCTACTTCAGGTACCTCCACTTCATCGCCAACTGGCGAACCTTCCTTCAGGTCACTATCAGAGTCAGACTACATAACTGCCTTCCCTGAACCACCAGATTGTGGATTCAGTTTAGCACACTTTAACAGAGTTGTGGGAGGTGTGAACGCTAAACTCGGAGGCTTCCCATGGATGGCACTTCTTGGTACCAAACAAGAAAACTGGGACACAGCACGTTGGATATGTGGGGGAAGTCTGATCTCTCACCGCCACGTCCTGACCGCTGCTCACTGTATAAAGAATGAATTGAACGTGGTCCGACTTGGAGAACTGGACTTCGAAAGAGACAACGATGGCGCTTCTCCCATAGACTTATCCATTAAAAGAAAAATCAAACATGAAAACTTCGACTACGCTTCCTTCACTAATGACATCGGCCTCTTGATATTGGGAAAGGATGTGGAGTTCTCAAAATTTATCCGACCGATCTGTCTACCGACGAGTGCAAATACGAGCTGGAATCCTCTTGTAGGCTACAACCAGTTCCTCGCTGGCTGGGGAAACATTGACAACCGCGGGGCTTCTTCATCTCACCTGCTATATGTGGAGCTGCCTGTCGTGAACAACTCGGTATGCGAGACAGCTTATGAGTCGCGGGTCATCGATGAGAGAGTTATGTGTGTTGGCAGCATCTTTAAAGACTCCTGCTCCGGGGACAGCGGTGGACCGCTCATGGACAATATAACCGGCGTTGTGTCGTATGGTCACACTAAATGCGGTGAAGCAAATTTTCCAGGCGTCTACAGTTCACTGGCGTACTTCTTGCCCTGGATACGGGAAAATGTGCTGGGATTTGTAGAATAG

Protein sequence:

>DPOGS215182-PA
MVRNTVYLTADLDRSVRILTQEVHKDTCTTLEGGIGTCTPAVSCAAYFDLQRQAKSLTASFQLRDSQCKPNGYNDMICCPVANEIPRETTLPFRDLDKEYNCGEDQRNAQLDETCTTIEGGVGKCESLAACEPYLHLTRQAKNIPLAIQLRDAQCGSDGNDQKVCCPTSGTSTSSPTGEPSFRSLSESDYITAFPEPPDCGFSLAHFNRVVGGVNAKLGGFPWMALLGTKQENWDTARWICGGSLISHRHVLTAAHCIKNELNVVRLGELDFERDNDGASPIDLSIKRKIKHENFDYASFTNDIGLLILGKDVEFSKFIRPICLPTSANTSWNPLVGYNQFLAGWGNIDNRGASSSHLLYVELPVVNNSVCETAYESRVIDERVMCVGSIFKDSCSGDSGGPLMDNITGVVSYGHTKCGEANFPGVYSSLAYFLPWIRENVLGFVE-