Monarch geneset OGS2.0

DPOGS215195
TranscriptDPOGS215195-TA1848 bp
ProteinDPOGS215195-PA615 aa
Genomic positionDPSCF300143 - 170444-173779
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0038760.078.47% 
BombyxBGIBMGA008673-TA0.071.00% 
DrosophilaCG7432-PB3e-12956.09% 
EBI UniRef50UniRef50_E2A4884e-13954.03%Proclotting enzyme n=4 Tax=Endopterygota RepID=E2A488_CAMFO
NCBI RefSeqXP_001944076.11e-14253.54%PREDICTED: similar to proclotting enzyme [Acyrthosiphon pisum]
NCBI nr blastpgi|3838613928e-14556.24%PREDICTED: proclotting enzyme-like [Megachile rotundata]
NCBI nr blastxgi|3838613921e-14655.65%PREDICTED: proclotting enzyme-like [Megachile rotundata]
Group
Gene OntologyGO:00038242.9e-94catalytic activity
GO:00042521.6e-90serine-type endopeptidase activity
GO:00065081.6e-90proteolysis
KEGG pathway 
InterPro domain[362-615] IPR0090032.9e-94Peptidase cysteine/serine, trypsin-like
[371-610] IPR0012541.6e-90Peptidase S1/S6, chymotrypsin/Hap
[402-417] IPR0013144.4e-12Peptidase S1A, chymotrypsin-type
[220-263] IPR0066042.4e-08Disulphide knot CLIP
Orthology groupMCL15898 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215195-TA
ATGACTAACGTGGATGTTTTGGGACCTAAGCTCTTTTGCTGCAGGAATGAATACCTGCTGATCGGGTTAGCGTTAGAGACGAAATGCCAACAAGTCCACCCCGTTACATTCATAAGACCGGGACAATACTATCAGCACGGGCACCCGCCTTGGCATCGATTCACATCACCTCGCAAAAGATCACCTGAACCACAGGCCTATTACAACGGACCCCACGAGCTATCGTACAATCAAAATCCCTTCCAGAGTAATGAGTTCTCGGCCAACCAACAGACTCAAAGACAGGACATCCAGGGACATCAAAACTATGCAAGTAGATCATCAGCTGACCTCATCTATACCGAGACAAGAGACGTTCCCAATGACGGAAGACTACTCTCAGATTCAGCATTCACCAGGATATCGGAAACTCTCGGAGCGATCAACACCGTCGGCCACTACCTGGTGGACATCGTCAACGAGAATGATAGAAACGAATCCGATCCCAACCTGAAGCAGCTTCCGAACGCTATATACACCATCAGCAAGAATGTCTTAGGTAGAAATGTTACAGATACAATTGCTCCTATAGTCAAGAAAGCGTTACCCAGAGTTTTGCCCGACGCACCGATCACTAGAATAGCTACAGCCGATGTGAGCCAAAGTAATTCCAAATCCTGCACCACCCCCGACGGAGAGCAGGGGGTCTGCGAGGATTTGAGCAACTGCCCTCAATTGCTTCTGAATCTCATAAGTCTCAGAGAATCGTTATGTTTCAAAGATCTTTTCGTTCCCGGCGTGTGTTGTCCACGAAACGCTGTCGTATCATCGACGCCGGCTGTAGAGAAGCCAGTCCAGAGTACGACCAGCAAACCCACTTATTTAGTACCTATAACGACTCAGCGGCCGGTTCAAAAACCGACGACGACGAAAAAGCCTTCGGCCATCTTGGTTCTGACCACCAAAAAGCCAAAGACAACCAGCCCTAGACCTACGAAGACGCCTACCACTGTTACCACACCGAGAGCACCGTCCACCACCAGCTTCTACACTGTGACACCACCGTTCTTAGGAAATTATTCCAACATCGTCGACGTCAACGACTGTGGTCAGCGTGAAGACGAAGGAGGTCGCATAGTCGGCGGTACCGAGTCCAAGCCCGGCGCGTGGCCCTGGATGGCGGCCATATACCTGCACGGGAACAAGCGCAAGGAGTTCTGGTGCGGCGGGACGTTGGTTGGCAGCCGACACGTGCTCACCGCCGCGCACTGCACCAGGGACTCCAAACAGAGGCCGTTTCCTCCCCGTCAGTTCTCAGTGCGTCTGGGGGACGTGGACCTCTCCCGCACCGATGAGCCCTCGCGGCCGCTGACCGCTCGCGTCACGGCCGTGCGGGCACACGAGCAGTTCTCACGAGTCGGATACTACAACGACATCGCCGTGCTGGTGCTGGCTGAGAACGTCCCCAAATCAAAATACGTGATACCAATCTGTCTCCCCAAGGGTGAGGCGGGTCGCCAGCAGTTTGACGGCATGGTAGGGACCGTGGTGGGCTGGGGCACCACCAGATACGGCGGCGGAGAGAGCTCCACACAGCTGGAGGCGCGACTTCCGGTCTGGAGGAACGAAGACTGCGACCGGGCTTACTTCCAACCAATCACGGACACTTTCCTTTGCGCCGGATACCCCAGAGGAGGGGTCGATGCCTGTCAGGGAGACTCAGGAGGCCCCCTCATGCTGCAGATCCAAGGTCGGTGGACACAGATCGGGGTGGTGTCCTTCGGTAACAAGTGCGGAGAGCCGGGCTACCCCGGGGTATACACCAGGGTCACTCACTACCTCGGCTGGCTGAAGAACAATCTCACCTAA

Protein sequence:

>DPOGS215195-PA
MTNVDVLGPKLFCCRNEYLLIGLALETKCQQVHPVTFIRPGQYYQHGHPPWHRFTSPRKRSPEPQAYYNGPHELSYNQNPFQSNEFSANQQTQRQDIQGHQNYASRSSADLIYTETRDVPNDGRLLSDSAFTRISETLGAINTVGHYLVDIVNENDRNESDPNLKQLPNAIYTISKNVLGRNVTDTIAPIVKKALPRVLPDAPITRIATADVSQSNSKSCTTPDGEQGVCEDLSNCPQLLLNLISLRESLCFKDLFVPGVCCPRNAVVSSTPAVEKPVQSTTSKPTYLVPITTQRPVQKPTTTKKPSAILVLTTKKPKTTSPRPTKTPTTVTTPRAPSTTSFYTVTPPFLGNYSNIVDVNDCGQREDEGGRIVGGTESKPGAWPWMAAIYLHGNKRKEFWCGGTLVGSRHVLTAAHCTRDSKQRPFPPRQFSVRLGDVDLSRTDEPSRPLTARVTAVRAHEQFSRVGYYNDIAVLVLAENVPKSKYVIPICLPKGEAGRQQFDGMVGTVVGWGTTRYGGGESSTQLEARLPVWRNEDCDRAYFQPITDTFLCAGYPRGGVDACQGDSGGPLMLQIQGRWTQIGVVSFGNKCGEPGYPGVYTRVTHYLGWLKNNLT-