Monarch geneset OGS2.0

DPOGS207119
TranscriptDPOGS207119-TA1371 bp
ProteinDPOGS207119-PA456 aa
Genomic positionDPSCF300001 + 3381848-3385829
RNAseq coverage155x (Rank: top 53%)
Annotation
HeliconiusHMEL0132693e-5739.58% 
BombyxBGIBMGA012777-TA2e-8955.51% 
Drosophilayip7-PA2e-2732.62% 
EBI UniRef50UniRef50_C9W8F43e-9152.05%Serine protease 24 n=1 Tax=Mamestra configurata RepID=C9W8F4_9NEOP
NCBI RefSeqNP_001037037.15e-6542.62%35kDa protease [Bombyx mori]
NCBI nr blastpgi|3044435951e-9052.05%serine protease 24 [Mamestra configurata]
NCBI nr blastxgi|3044435952e-9050.00%serine protease 24 [Mamestra configurata]
Group
Gene OntologyGO:00038242.1e-61catalytic activity
GO:00042527.7e-57serine-type endopeptidase activity
GO:00065087.7e-57proteolysis
KEGG pathway 
InterPro domain[35-284] IPR0090032.1e-61Peptidase cysteine/serine, trypsin-like
[39-276] IPR0012547.7e-57Peptidase S1/S6, chymotrypsin/Hap
[70-85] IPR0013141.9e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL30114 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207119-TA
ATGTTGTGGCTAGCAGTAGCGTTTGTGTTGGCTCTGGCTACCAACCAGGTACTAGGAGAACAAGTAACTCCAGGATTTTTGGAAGACATTGACAATAAAGAGGCATGGGACGCTCGGATTGTATCAGGCTGGAAGGCCTATCCTGGTCAGCACCCACATATGGTTTCTTTGCGCATGGTGAATAATGTGGGAACCGTTCAAGCCTGCGGAGGAAGTATCGTTTCCCGACAGTGGATCATCACTGCAGCGCATTGTACAGCAGCACAAGTGTCGCTCCTCGTTCGGGCCGGTGTTACTAATCTAACTCGACCAGAAATTTTTACAGAGACCCAGGAATATTATATGTATCCAACTTATAACCCAACAAATCCAGGTCTTGTTCAGCCTAACGATATAGCTATGGTAAAATTGCAGATTTCATTAACGTATTCGCAATATTTAAAGCCTATTCGTATCCAGTCCTCAGTTGATGCACACAAAGACTATGACAACTTAATTGTATATGCAAGTGGATTTGGTAGAACCTGGACAAACGGTCCCACTACTGAACATCTCCTCTGGGTCTATCTCCGTGGAGTCTCAAATGCTGCATGCACAAATATATTTACCTCAAAATACGTTACTGAAAACACCGTTTGCGCCAAATTCTTCAATGTTACCTCTCAATCTATTTGCCAGGGTGACAGTGGCGGACCTTTAGTGCACGTTTCTTCAGAAAATGCACATACCCTCGTAGGAGTGAGTTCATTTGTAGCAGCGTCACCCATCGGCTGTCACTCAGGAGTACCCGGAGCTTTTATTCGTCCTGGAGCTTTCCATTCTTGGTTCACCCAAATTTCTGGTATTGATTTTGAGAATCCTGTAGATGATAAACCAACTACAACGTCTACTTCCACAACCACAGCCGCTCCAACGACAAAAGCTTCTACGACTACTGTAACTTTCGCTCCGTCTACAACTTCTACTTCCGCCCCTTCTACAACTAGTAGTAGCACTTCCGCTCCGTCTACTACTAGCTCTACCTCCGCCCCTCCTACTACTAATATGACTTACGCTCCGTCTACCACTAGCTCTACCTCCGCCCCTACTACCACTAGAAGCACTTACGCTCCGTCTACCACTACTCTGAGCCCTATCACTACAACTTCAGCACTATCTACTACAACTGCAGCACCTTCTAGTACTACTTTATCACCTTCCACCACTTCTTTGGCTCCCACTACACCTACCACACCCGCACCAGAAGAAGAAGACGAAGACGATGCAGAACTAGAGGACCTTCTCAAGAGATTAGAAGTGAAAGTAAAAGTTAGGGTCATATTAAGTAAATATTTAAAGAAGAAGAAGCAAATACAAGAGAAAACCCTTTAA

Protein sequence:

>DPOGS207119-PA
MLWLAVAFVLALATNQVLGEQVTPGFLEDIDNKEAWDARIVSGWKAYPGQHPHMVSLRMVNNVGTVQACGGSIVSRQWIITAAHCTAAQVSLLVRAGVTNLTRPEIFTETQEYYMYPTYNPTNPGLVQPNDIAMVKLQISLTYSQYLKPIRIQSSVDAHKDYDNLIVYASGFGRTWTNGPTTEHLLWVYLRGVSNAACTNIFTSKYVTENTVCAKFFNVTSQSICQGDSGGPLVHVSSENAHTLVGVSSFVAASPIGCHSGVPGAFIRPGAFHSWFTQISGIDFENPVDDKPTTTSTSTTTAAPTTKASTTTVTFAPSTTSTSAPSTTSSSTSAPSTTSSTSAPPTTNMTYAPSTTSSTSAPTTTRSTYAPSTTTLSPITTTSALSTTTAAPSSTTLSPSTTSLAPTTPTTPAPEEEDEDDAELEDLLKRLEVKVKVRVILSKYLKKKKQIQEKTL-