Monarch geneset OGS2.0

DPOGS215220
TranscriptDPOGS215220-TA966 bp
ProteinDPOGS215220-PA321 aa
Genomic positionDPSCF300143 + 289980-293798
RNAseq coverage79x (Rank: top 65%)
Annotation
HeliconiusHMEL0038762e-4136.36% 
BombyxBGIBMGA008668-TA2e-8653.63% 
DrosophilaCG1299-PA1e-4639.05% 
EBI UniRef50UniRef50_Q8I9258e-8554.24%Coagulation factor-like protein 3 n=1 Tax=Hyphantria cunea RepID=Q8I925_HYPCU
NCBI RefSeqNP_001036891.11e-8453.63%clip domain serine protease 4 [Bombyx mori]
NCBI nr blastpgi|259892093e-8454.24%coagulation factor-like protein 3 [Hyphantria cunea]
NCBI nr blastxgi|259892095e-8454.24%coagulation factor-like protein 3 [Hyphantria cunea]
Group
Gene OntologyGO:00038246.6e-80catalytic activity
GO:00042521.6e-65serine-type endopeptidase activity
GO:00065081.6e-65proteolysis
KEGG pathway 
InterPro domain[46-315] IPR0090036.6e-80Peptidase cysteine/serine, trypsin-like
[56-311] IPR0012541.6e-65Peptidase S1/S6, chymotrypsin/Hap
[89-104] IPR0013144.1e-13Peptidase S1A, chymotrypsin-type
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215220-TA
ATGATTAATTTAAGTGTAACAGTTTTGTGTGTTTGTGTGTGTTTTGAACATGTTTTGTGTAACGACGATGATGTAGACACTACAGACAATTTAGAAAATTCTGACTACATCACCGACTTCCCCGAGCTTCCGGTCTGCGGGATATCCAAAAATGGTTTTTATACAAAAGTTGTTGGAGGAGTCGATGCCAATCTTGGTGAATTTCCCTGGATGGCCCTCCTAGGATACAACGATAGTGCAGGGAATGGCACGTTTTGGTCTTGCGGTGGCTCGCTGATCTCTCAACGCCACGTCCTTACCGCAGCTCACTGTATACACAACCACGACGAATTATACGTGGTCCGCTTAGGTGAGCTGGACGTTCTTCGTAATGATGACGGAGCGACTCCTCTCGATGTTCACATTAAACGCAAAATAGAACACGAAGCTTACAGCGCTAACTCGTTTCAACACGACATCGGGCTTTTAATATTGGACACTGACGTCGTATTCAGTGACCTCATCAGGCCAATTTGCATCCCATTGCTTCCGGAGCTGCGCAACAACTTGTTTGAAGACTACAACCCATTTATCGCCGGTTGGGGATACAATGCGTTCCCCGGCCACGCAAACGTTCAATTCAGATTTGGCGAACTTCGGAAATCACATCTGCAAAAGGTGTCTGTGCCAGTCACTAGGCTTTCCCAATGCCAGGAAGTTTACAAGAGTTATGGAAAAAGTCTGAAGATAGATGACAAGGTCATATGCGGTGGATATGAGGGCGGCAAGAACTCCTGCAAGGGAGACAGCGGCGGACCTCTCATGTTGCCCAATACCAACTCGGAACAACAAGTATATTTCTACCAAATCGGGATTGTGTCGTTAGCTCCATTATGCGCATTGAAGAATTATCCTACCGTCTTCACTAGAGTCACCCACTACATACCATGGCTACAGACACAGGTTTTGGGAAGAGCTGACTTTTGA

Protein sequence:

>DPOGS215220-PA
MINLSVTVLCVCVCFEHVLCNDDDVDTTDNLENSDYITDFPELPVCGISKNGFYTKVVGGVDANLGEFPWMALLGYNDSAGNGTFWSCGGSLISQRHVLTAAHCIHNHDELYVVRLGELDVLRNDDGATPLDVHIKRKIEHEAYSANSFQHDIGLLILDTDVVFSDLIRPICIPLLPELRNNLFEDYNPFIAGWGYNAFPGHANVQFRFGELRKSHLQKVSVPVTRLSQCQEVYKSYGKSLKIDDKVICGGYEGGKNSCKGDSGGPLMLPNTNSEQQVYFYQIGIVSLAPLCALKNYPTVFTRVTHYIPWLQTQVLGRADF-