Monarch geneset OGS2.0

DPOGS215183
TranscriptDPOGS215183-TA2304 bp
ProteinDPOGS215183-PA767 aa
Genomic positionDPSCF300143 - 346853-360250
RNAseq coverage104x (Rank: top 60%)
Annotation
HeliconiusHMEL0038761e-4833.18% 
BombyxBGIBMGA008668-TA9e-10553.55% 
DrosophilaCG1299-PA6e-6032.30% 
EBI UniRef50UniRef50_Q8I9253e-10345.88%Coagulation factor-like protein 3 n=1 Tax=Hyphantria cunea RepID=Q8I925_HYPCU
NCBI RefSeqNP_001036891.13e-10553.10%clip domain serine protease 4 [Bombyx mori]
NCBI nr blastpgi|1129828425e-10453.10%clip domain serine protease 4 precursor [Bombyx mori]
NCBI nr blastxgi|259892092e-10747.32%coagulation factor-like protein 3 [Hyphantria cunea]
Group
Gene OntologyGO:00038244.5e-81catalytic activity
GO:00042522.1e-74serine-type endopeptidase activity
GO:00065082.1e-74proteolysis
KEGG pathway 
InterPro domain[509-762] IPR0090034.5e-81Peptidase cysteine/serine, trypsin-like
[518-758] IPR0012542.1e-74Peptidase S1/S6, chymotrypsin/Hap
[551-566] IPR0013141.9e-12Peptidase S1A, chymotrypsin-type
[32-83] IPR0227003.2e-08Proteinase, regulatory CLIP domain
Orthology groupMCL27843 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215183-TA
ATGTTACCTCCTCGCGTAATGAGTAGTGCTCTGTTGACAGTGCTGTGTGTCTGTTTGTGTCCGCAAGCTGTGTTTTGTCAATACCACGAAACGTGCACCACAATTGAAGGTGGTGTTGGCAAGTGTGAGTCGTTAGCAGCCTGCGAGCCGTACCTTCATCTGACGAGACAAGCCAAAAACATTCCCTTGGCTATTCAACTTAGAGATGCTCAATGCGGTTCAGACGGAAACGACCAAAAGGTCTGCTGTCCTACTTCAGGTACCTCCACTTCATCGCCAACTGGCGAACCTTCCTTCAGGTCACTATCAGAGTCAGACTACATAACTGCCTTCCCTGAACCACCAGATTGTGGATTCAGTTTAGCACACTTTAACAGAGTTGTGGGAGGTGTGAACGCTAAACTCGGTGACTTCCCCTGGATGGCACTTCTTGGTACCAAACAAGGAAACTGGGACGCAGCACGTTGGATATGTGGGGGAACTCTGATCTCTCACCGCCACGTCCTGACCGCTGCTCACTGTATAAAGAATGAATTGAACGTGGTCCGACTTGGAGAACTGGACTTCGAAAGAGACAACGATGGCGCTTCTCCCATAGACTTTTCCATTAAAAGAAAAATCAAACATGAAAACTTCGACTACGCTTCCTTCACTAATGACATCGGCCTTTTGATATTGGGAAAGGATGTGGAGTTCTCAAGTAGTTTTTCTTCAACTCAGCTACAGTATGTGACGCTGCCTGTTGTGAACAACTCGGTTTGCGAGACAACTTTTGAGTCGCGGGTCATCGATGAGAGAGATATGTGTGTTGGCAGAGTCTTTAAAGACTCCTGCTCCGGGGACAGCGGTGGACCGCTCATGGACAATATATCAAGAGCCAGCCCCAGTCGCTGGCAGTGCTTGCTCGTACACGCCATGTCACCTCCTCACGTAATGAGTAGTGCTCTGTTGACAGTGCTGTGTGTCTGTTTGTGTCCACAAGCTGTGTTTTGTCAATACCAGGAAACTTGCACCACATTAGAAGGTGGCATCGGAACATGTACGCCAACTATCTTATGCGCTCCGTACTTCAATCTGCTGGGAGTAGCCAAAAACCTCACTATTTCTTTTCAACTTAGAGATGCTCAATGTGGGTCACATGGTATCAACATCATGGTCTGCTGCCCAAATCAAAATGAGCCACCTCCAGAAACAAGCAAATTATTATTGACCTTAAATACCTCCGATGACTCTAATAAGGATCAAGGGATCACTCAACTTGACGAAACGTGCACCACAATTGAAGGTGGTGTTGGCAAGTGTGAGTCGTTAGCAGCCTGCGAGCCGTACCTTCATCTGACGAGACAAGCCAAAAACATTCCCTTGGCTATTCAACTTAGAGATGCTCAATGCGGTTCAGACGGAAACGACCAAAAGGTCTGCTGTCCTACTTCAGGTACCTCCACTTCATCGCCAACTGGCGAACCTTCCTTCAGGTCACTATCAGAGTCAGACTACATAACTGCCTTCCCTGAACCACCAGATTGTGGATTCAGTTTAGCACACTTTAACAGAGTTGTGGGAGGTGTGAACGCTAAACTCGGTGACTTCCCCTGGATGGCACTTCTTGGTACCAAACAAGGAAACTGGGACGCAGCACGTTGGATATGTGGGGGAACTCTGATCTCTCACCGCCACGTCCTGACCGCTGCTCACTGTATAAAGAATGAATTGAACGTGGTCCGACTTGGAGAACTGGACTTCGAAAGAGACGACGATGGCGCTTCTCCCATAGACTTTTCCATTAAAAGAAAAATCAAACATGAAAACTTCGACTACGCTTCCTTCACTAATGACATCGGCCTTTTGATATTGGGAAAGGATGTGGAGTTCACTCGTCTGATGCGGCCGATCTGTCTGCCGACTCGTGAAGACCTACGTTCAAAATCTTTTGTTGGCTACCATCCTTTCATCGCCGGTTGGGGAAACGTCGACAACCGTGGTGCTGCTAAATCTCACATGCAAGTTGCGCAGCTGCCTGTCCTGGAAAACTCCAAATGCAGGAGGGTTTACGAATTGCGGGTCATCGACGAAAGGGTCATGTGTGCTGGCGTCACAGGCAAAGACTCCTGCAATGGTGACAGTGGCGGACCGCTCATGCAACCGAATACGAACCGGACAACGGGTAAAATATATTTCTATCAGACCGGCGTGGTGTCGTATGGTCACACTAGATGTGGTGAAGCGAGTTTCCCAGGCGTGTACAGCTCAGTGCAGCACTTCCTGCCCTGGATACAGAAACACGTGCTGGGATCGGACGAATGA

Protein sequence:

>DPOGS215183-PA
MLPPRVMSSALLTVLCVCLCPQAVFCQYHETCTTIEGGVGKCESLAACEPYLHLTRQAKNIPLAIQLRDAQCGSDGNDQKVCCPTSGTSTSSPTGEPSFRSLSESDYITAFPEPPDCGFSLAHFNRVVGGVNAKLGDFPWMALLGTKQGNWDAARWICGGTLISHRHVLTAAHCIKNELNVVRLGELDFERDNDGASPIDFSIKRKIKHENFDYASFTNDIGLLILGKDVEFSSSFSSTQLQYVTLPVVNNSVCETTFESRVIDERDMCVGRVFKDSCSGDSGGPLMDNISRASPSRWQCLLVHAMSPPHVMSSALLTVLCVCLCPQAVFCQYQETCTTLEGGIGTCTPTILCAPYFNLLGVAKNLTISFQLRDAQCGSHGINIMVCCPNQNEPPPETSKLLLTLNTSDDSNKDQGITQLDETCTTIEGGVGKCESLAACEPYLHLTRQAKNIPLAIQLRDAQCGSDGNDQKVCCPTSGTSTSSPTGEPSFRSLSESDYITAFPEPPDCGFSLAHFNRVVGGVNAKLGDFPWMALLGTKQGNWDAARWICGGTLISHRHVLTAAHCIKNELNVVRLGELDFERDDDGASPIDFSIKRKIKHENFDYASFTNDIGLLILGKDVEFTRLMRPICLPTREDLRSKSFVGYHPFIAGWGNVDNRGAAKSHMQVAQLPVLENSKCRRVYELRVIDERVMCAGVTGKDSCNGDSGGPLMQPNTNRTTGKIYFYQTGVVSYGHTRCGEASFPGVYSSVQHFLPWIQKHVLGSDE-