Monarch geneset OGS2.0

DPOGS213841
TranscriptDPOGS213841-TA1599 bp
ProteinDPOGS213841-PA532 aa
Genomic positionDPSCF300183 + 245550-249159
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0050590.079.67% 
BombyxBGIBMGA011594-TA0.075.89% 
DrosophilaSb-PA1e-12776.36% 
EBI UniRef50UniRef50_F4WUW04e-13680.43%Serine proteinase stubble n=6 Tax=Neoptera RepID=F4WUW0_ACREC
NCBI RefSeqXP_394101.24e-13882.16%PREDICTED: similar to Stubble CG4316-PA [Apis mellifera]
NCBI nr blastpgi|1839793804e-16188.99%hypothetical protein [Papilio xuthus]
NCBI nr blastxgi|1839793803e-17289.94%hypothetical protein [Papilio xuthus]
Group
Gene OntologyGO:00038249.2e-98catalytic activity
GO:00042527e-91serine-type endopeptidase activity
GO:00065087e-91proteolysis
KEGG pathway 
InterPro domain[275-531] IPR0090039.2e-98Peptidase cysteine/serine, trypsin-like
[287-526] IPR0012547e-91Peptidase S1/S6, chymotrypsin/Hap
[320-335] IPR0013141.2e-14Peptidase S1A, chymotrypsin-type
Orthology groupMCL14740 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213841-TA
ATGTCCCGCAAGTCGTGTACGGTCGGCGGCTCGAGAGGGGCTTGTATGTGGGTGCAGGAATGCAACAGAGTTGGAGGAATACACGCGGGAGTCTGTGTAGATGGCTTCATGTTCGGTTCATGCTGCCGAATGCCCGACCGACCTATAACAGAGACGCCGATTCCAACTACAGTTACCGATCGTCCGTCAACAACGCCATCATACACCACGACCAGCGTAACAACTATTACTCCGAGTACTACTCAAACTATACGGCCCTCTTCACAAACGGCACGGCCGTCGTTTATGACCAAACCCATAGATGGCGCTCCCACATCATATAGCTACCGCCCGCCTGAAATAAATCTGCCCTCTGTTAGCAGTTTAGATGCAAATAGCGAGCAAAACAGTGATATAGTTCATAAACTAACTTATAGCAGTGTAAATAAGTACCAAAATGTACATAGGCCAAGCAATGAAATGGAAGCTAGTCCTCACAACAAAATTTCGTCTAGTCTTAGTTTAATGGGCGCGCGCCCTATGGCTGTTTCCGAACAACACTCAGAGAATAGCATTGCTTCAGCTCATATGATGTCGCGCCCAAACAACTTGAACACAATACATTGGCAGGCTACAACTGAGCCAATATTTGTAACAAAACCGAGACCGAATTGGGAGAAACCAGTAGGGAAACCAAAACCCACAAAGAAGTTTACAACGACAACCAGCAAGCCGCATAAGAATTACATAAAACCAAAGGATCCAGCTTTAAACATGATTAACAAAACCGACGAGTCGACGCCCGCTTCCATACAAACAACAGCCGCAACGAATAGTGTCGAATGTGGCACGAGAGCGATGTGGCCACGTCCAGAAACGAGGATAATGGGTGGCAAAGACTCCAGTTTCGGTCGCTGGCCATGGCAGGTGTCTGTTAGACGGAATTCCTTCTTCGGCTTCTCATCGACTCATAGATGTGGAGGTGCTATCATCAACGAGGGGTGGATAGCGACCGCTGGTCATTGTGTAGACGATCTTCTTACTTCGCAAATACGGATAAGAGTCGGCGAATACGATTTCTCAACAGTGTCTGAACAATATCCGTATTCCGAGAGAGGTGTGGCTAGAAAGGCGGTCCATCCGAAATACAATTTTTACACTTACGAATATGATTTGGCGTTGGTGAAGCTGGATTCGCCGGTCCAGTTCGCGCCTCACATATCCCCGATATGTCTTCCAGCGAGCGATGACCTTCTGGTCGGTGAAAATGCCACCGTCACGGGTTGGGGAAGATTATCTGAGGGTGGAGTTTTGCCTTCCGTTTTGCAAGAGGTGCAAGTACCAATAGTGTCGAATGATAGATGTAAGTCAATGTTTCTACAAGCCGGAAGACATGAGTTCATTCCGGACATTTTCCTTTGTGCGGGGCACGAGCGAGGGGGCCACGACTCTTGTCAGGGGGACTCGGGGGGACCTTTACAGGTCAAAGGAAAAGATCAAAAATATTTCCTAGCGGGCATCATAAGCTGGGGTATCGGTTGTGGGGAGGCGAACTTACCCGGCGTTTGCACAAGAATATCTAAGTTCGTCCCGTGGATATTGCAAACTGTTAACTCATAA

Protein sequence:

>DPOGS213841-PA
MSRKSCTVGGSRGACMWVQECNRVGGIHAGVCVDGFMFGSCCRMPDRPITETPIPTTVTDRPSTTPSYTTTSVTTITPSTTQTIRPSSQTARPSFMTKPIDGAPTSYSYRPPEINLPSVSSLDANSEQNSDIVHKLTYSSVNKYQNVHRPSNEMEASPHNKISSSLSLMGARPMAVSEQHSENSIASAHMMSRPNNLNTIHWQATTEPIFVTKPRPNWEKPVGKPKPTKKFTTTTSKPHKNYIKPKDPALNMINKTDESTPASIQTTAATNSVECGTRAMWPRPETRIMGGKDSSFGRWPWQVSVRRNSFFGFSSTHRCGGAIINEGWIATAGHCVDDLLTSQIRIRVGEYDFSTVSEQYPYSERGVARKAVHPKYNFYTYEYDLALVKLDSPVQFAPHISPICLPASDDLLVGENATVTGWGRLSEGGVLPSVLQEVQVPIVSNDRCKSMFLQAGRHEFIPDIFLCAGHERGGHDSCQGDSGGPLQVKGKDQKYFLAGIISWGIGCGEANLPGVCTRISKFVPWILQTVNS-