Monarch geneset OGS2.0

DPOGS207109
TranscriptDPOGS207109-TA1296 bp
ProteinDPOGS207109-PA431 aa
Genomic positionDPSCF300001 + 3197033-3202091
RNAseq coverage557x (Rank: top 23%)
Annotation
HeliconiusHMEL0132699e-8754.26% 
BombyxBGIBMGA012788-TA4e-9758.53% 
Drosophilayip7-PA2e-3035.51% 
EBI UniRef50UniRef50_B6D1Q95e-10163.93%Trypsin-like serine proteinase T26 n=2 Tax=Ostrinia nubilalis RepID=B6D1Q9_OSTNU
NCBI RefSeqNP_001037037.12e-7246.05%35kDa protease [Bombyx mori]
NCBI nr blastpgi|2093953804e-10263.35%trypsin-like serine proteinase T26 [Ostrinia nubilalis]
NCBI nr blastxgi|2093953807e-10354.20%trypsin-like serine proteinase T26 [Ostrinia nubilalis]
Group
Gene OntologyGO:00038242.8e-63catalytic activity
GO:00042523e-55serine-type endopeptidase activity
GO:00065083e-55proteolysis
KEGG pathway 
InterPro domain[35-279] IPR0090032.8e-63Peptidase cysteine/serine, trypsin-like
[37-271] IPR0012543e-55Peptidase S1/S6, chymotrypsin/Hap
[68-83] IPR0013146.9e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL18024 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207109-TA
ATGGCCGCGTTAGGTGCTCTTCTCGTTTTATTCGCTGTGTCGTGGGTTGGCGCTTATCCTAGTCCGCAACTCGCTGACTTCCCCGAAAATGCACGAGGAGGGAATACGAGGATAGTTTCTGGTTGGGAGGCAGTTAATGGCCAAATCCCATACCAACTATCTCTCCGTATGGTGGATGCCAGCGGCAGTGTGTTTGGATGTGGCGCTACCCTTATCCACAACGAATGGGCCATGACTGCTGCACATTGCACTGCTCGCCGGGTGACCATCGTCATCCGCGTGGGTGGAGTTGAGCTTTCCCGACCATCTCTGATATTGGAGTCTACGGAATACTACAACCATCCCTTGTACATCGAATCCTTAGCTGGTGTCGTTCAACCCAACGACATCGGTCTGATCAAACTCAACAAATACGTTCAATATAACGACCTCATTCAGCCCATCAGAATCCAACGTGACGCAGACAAAAATAAGGACTACAGCGAAGTGAGGCTTCAAGCTAGCGGCTGGGGCAGAACTTGGACTAACGGAGCCTCCCCTCAAATCTTGAACTGGGTCTATTTACTTGGAGTTAGCAACAATTTCTGCCGCTCTCTATTCACTAACATCGTCGTTGACTCTACAATCTGTGCAAACGCATACAATGTGTCATCTCAGTCCACTTGCCAGGGTGACAGCGGTGGCCCCTTGGTTGTCGTTGATGAAGATGGCCAACTTACTCAGGTCGGTGTATCTTCCTTCGTGTCCAGCTCCGGCTGCCACACACCTTTACCCGCTGGTTTCATCCGCCCTGGACATTATCATAGCTGGTTCAGTGAAATTACTGGCATCGACTTTGATTGGGATTTCGTCCAGCCTACTGTACCTGAACCCAGTACTACAGAAGAAGTAACAACCAGTGTCACTGAAGATTTAACAACTGAACCTGCTGAAGATGTAACAACTGGCGCTCCTGAAGATTTAACAACTGAACCTGCTGAAGATGTAACAACTGGCGCCCCTGAAGATTTAACAACTGAACCTTCCGAAGAAGTAACAACTGGTGCTCCTGAAGATTTAACAACTGAAGGTGCTGAAGAAATAACAACTGAGGCTTCTACTGCTGCCCCTGAAGAAGAAGAAGAACAGGAGGAGGACAAGGACGAGGACGAGGATGACGAAGACGAAGATGAGGACGAAGAAGATGAGTCTGGCAGTGAAGAAGACAGTGAAGATGATGAAGACGATGAAGATGACGATGAAGAAGACGAAGAAGACGACGAAGACGATGAAGACGATGAAGAAGACGAAGAATAG

Protein sequence:

>DPOGS207109-PA
MAALGALLVLFAVSWVGAYPSPQLADFPENARGGNTRIVSGWEAVNGQIPYQLSLRMVDASGSVFGCGATLIHNEWAMTAAHCTARRVTIVIRVGGVELSRPSLILESTEYYNHPLYIESLAGVVQPNDIGLIKLNKYVQYNDLIQPIRIQRDADKNKDYSEVRLQASGWGRTWTNGASPQILNWVYLLGVSNNFCRSLFTNIVVDSTICANAYNVSSQSTCQGDSGGPLVVVDEDGQLTQVGVSSFVSSSGCHTPLPAGFIRPGHYHSWFSEITGIDFDWDFVQPTVPEPSTTEEVTTSVTEDLTTEPAEDVTTGAPEDLTTEPAEDVTTGAPEDLTTEPSEEVTTGAPEDLTTEGAEEITTEASTAAPEEEEEQEEDKDEDEDDEDEDEDEEDESGSEEDSEDDEDDEDDDEEDEEDDEDDEDDEEDEE-