Monarch geneset OGS2.0

DPOGS205231
TranscriptDPOGS205231-TA1557 bp
ProteinDPOGS205231-PA518 aa
Genomic positionDPSCF300265 + 156742-184233
RNAseq coverage384x (Rank: top 31%)
Annotation
HeliconiusHMEL0134669e-8343.94% 
BombyxBGIBMGA014404-TA4e-7137.13% 
DrosophilaCG5390-PA9e-6137.57% 
EBI UniRef50UniRef50_B7SVM22e-7335.88%Serine proteinase-like protein 1 n=5 Tax=Obtectomera RepID=B7SVM2_HELAM
NCBI RefSeqNP_001037053.11e-7338.77%clip domain serine protease 11 [Bombyx mori]
NCBI nr blastpgi|2702981845e-7338.28%masquerade-like serine proteinase [Pieris rapae]
NCBI nr blastxgi|2089725491e-7235.96%serine proteinase-like protein 1 [Helicoverpa armigera]
Group
Gene OntologyGO:00038241.1e-63catalytic activity
GO:00042525e-36serine-type endopeptidase activity
GO:00065085e-36proteolysis
KEGG pathway 
InterPro domain[238-507] IPR0090031.1e-63Peptidase cysteine/serine, trypsin-like
[262-502] IPR0012545e-36Peptidase S1/S6, chymotrypsin/Hap
[291-306] IPR0013147.5e-07Peptidase S1A, chymotrypsin-type
Orthology groupMCL23320 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205231-TA
ATGAAGAGAAACGAACAGCATGAAATGCGAGAAGACGAAACAGAAGAAAATCAAGACGAGGAAAATGATGCGGGACCATCTCGCGATGAAGCACTTCATGCACTGGAAACAGCTCTCAAGTGGTTCGAAAAGCAAACAGAGTGTTACTGTGAGCTTACTACAGCTGAAACGAATTCTCGATATAGCAGCAACGAAGAAAAAGAGTTGCTTACATATATAACATTAGTTTTGCTGAGCATCGGGGTATTTTGTCAAAGAGAGCCAAAGAAACAGGCGAACGATGGTACGATAAATTTAGATCCAGTCAACTACCAAGATAACGACTCGGAGGAAGATTCAGTTGTTGATTGGGTAAACAAGATCATATCTGAATCAAAGATAAACGTAACGAACAGAGAAGTAACAAATCTAAATTCAGAAAGCGTTACGAATATAAATTGCACCGCGACTGACAACAGACCTGGGACTTGTGTGTTGTACTATCAATGCGACGAAGACAGTAACACTATTATAGATGACGGAGCGTCCATAGTTAATTTTAGAACCGAGGCGTCCTGTCCTCATTATCTCAAGGTCTGCTGTGCGATGGATAAAATTAAATCAGACGATAAAGCTAACACTATCCGGAGAGGAAGTAATAACGAGTCTGCCCAGGAGTTGGATTCGAAGGACGACTCCGACAGCAGCAGTGCTGTAGTTGACTTGGGGAAGTGTGGTTGGAACAATCCAGCGCTTTATGTGTTTCAACCCAAAAGAAACAATTCCGAGGCAGAGCCGTTCTACGCAAATTATGGGGAATTCCCCTGGATGATCGCCGTCATCAGGAGGTCCAATGATACGGATCTGTGGGCAAGAAAAAATTACGTCGGAGGTGGAACTCTCATTCATCCGGGGGTAGTTGTCACTGCGGCTCACATAGTTCGGAATAAAAAGCCCGATGACCTGAAATGTCGCGCTGGTGAATGGGACACTGAAGTGACCTTCGAGATATTTCCACACCAAGAGAGGAATGTGAAGAATATTATCATCCACCCAGATTACTACAGGCCATCTCTATACAACGACATGGGACTCCTGCTGTTAGAGGAACCGTTTGAACTGCTCCTCGCGCCACACATAGGTCTGGCTTGCGTTGGGAACAGCCTGCCGGCTCCCGGCACCGTCTGCTATGGAATGGGCTGGGGCAGGAAAATCGACAAGAAGTACGCAATTATTCTTAAAAAAATGCGGCTTCCGTTAGTGGAAAGAGAGGAGTGCCAGGCCCTCCTGCGGAGTATACGTTTGGGGCCATTTTTCCAACTGCACGAGTCCCTGACGTGTGCTGGCGGGGAAGATCGCATGGACATGTGCAAAGGAGACGGCGGGTCCTCATTAGTATGCCCTATTCAGACTAATGGTAGAAATGTCAAATACGCCATGTTCGGGATGGTGGCGTACGGCCTGGGATGTCACTCGAGGAAAGTGCCCGGCGTGTTCGTCAATGTGCCAAACCTGAAGTCGTGGCTGGACAGCACCATGGAGGCCGAAGGCTATTCTAAAGACACATACACTTACTAA

Protein sequence:

>DPOGS205231-PA
MKRNEQHEMREDETEENQDEENDAGPSRDEALHALETALKWFEKQTECYCELTTAETNSRYSSNEEKELLTYITLVLLSIGVFCQREPKKQANDGTINLDPVNYQDNDSEEDSVVDWVNKIISESKINVTNREVTNLNSESVTNINCTATDNRPGTCVLYYQCDEDSNTIIDDGASIVNFRTEASCPHYLKVCCAMDKIKSDDKANTIRRGSNNESAQELDSKDDSDSSSAVVDLGKCGWNNPALYVFQPKRNNSEAEPFYANYGEFPWMIAVIRRSNDTDLWARKNYVGGGTLIHPGVVVTAAHIVRNKKPDDLKCRAGEWDTEVTFEIFPHQERNVKNIIIHPDYYRPSLYNDMGLLLLEEPFELLLAPHIGLACVGNSLPAPGTVCYGMGWGRKIDKKYAIILKKMRLPLVEREECQALLRSIRLGPFFQLHESLTCAGGEDRMDMCKGDGGSSLVCPIQTNGRNVKYAMFGMVAYGLGCHSRKVPGVFVNVPNLKSWLDSTMEAEGYSKDTYTY-