Monarch geneset OGS2.0

DPOGS214043
TranscriptDPOGS214043-TA846 bp
ProteinDPOGS214043-PA281 aa
Genomic positionDPSCF300238 + 289427-294004
RNAseq coverage251x (Rank: top 42%)
Annotation
HeliconiusHMEL0050741e-9068.40% 
BombyxBGIBMGA014022-TA3e-3037.85% 
DrosophilaCG6592-PA5e-3435.29% 
EBI UniRef50UniRef50_C9W8I31e-5256.21%Serine protease 46 (Fragment) n=1 Tax=Mamestra configurata RepID=C9W8I3_9NEOP
NCBI RefSeqXP_002046826.11e-3541.28%GJ12275 [Drosophila virilis]
NCBI nr blastpgi|2377008255e-5256.21%serine protease 46 [Mamestra configurata]
NCBI nr blastxgi|2377008258e-5256.97%serine protease 46 [Mamestra configurata]
Group
Gene OntologyGO:00038249.4e-63catalytic activity
GO:00042525.8e-46serine-type endopeptidase activity
GO:00065085.8e-46proteolysis
KEGG pathway 
InterPro domain[64-280] IPR0090039.4e-63Peptidase cysteine/serine, trypsin-like
[45-272] IPR0012545.8e-46Peptidase S1/S6, chymotrypsin/Hap
[66-81] IPR0013144.4e-15Peptidase S1A, chymotrypsin-type
Orthology groupMCL24893 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214043-TA
ATGCTCTTAACCGACTCGGACGTCGCTGTCGGCCATGACCGCTTATCATTAATGGTTCACACTGAATGTCATTCTACGTTCGTTACGCTGAGAGCCGACAGCGATCTCCGAGTACATAATATCCTTATGATAAAAATAATTCAAGATGCAGCACTTTTACTTCGGGTTGGTGAGGATGGCGAACTAGGCTTCTGCGGCGGTTCGCTGGTACACTTACAGTGGGTTTTAACGGCAGCCCATTGCTGTTATCACGGGCCCCAGGAGGTTACGAATGTAGAGGTGATATTGGGAGCACACTCTCTGTACGATCGTTACGAGAATGGTCGTCGGTTGATTACAGTTAATCAGATAGTGGTACATCCCGAGTGGGATCCAGATACATTCGCTAACGATCTCGCTCTACTAAAGTTGACTAACGCCGTACAACCATCAGATTCAATAGGGATTGTTCGTCTTCCGTATCTAAGGACAGTGTCTGCAAATTTCGCCGGTCAGGCGGCCACAGCTTCTGGCTGGGGTATAGCTGCCAATGGTGTCACCTTCGTCTCTCCAACTCTTCGCATGAAGATGTCCACGGTGACGACGAACACCTACTGCAGGTCACTCTTCAGAACCAACTTGCCAGATAACATTATATGCACTTTCAGTACTCAGGCTAGCACTTGCAAGGGCGATAATGGTGGACCGTTGACTGTATTTACAAACGACACTGAAGAAATCGTACTGATCGGCGTAACTTCGTTTATAGAAGCTCACGGCTGTAATACCAACTTACCGAACGTGTTCACACGAGTCCAACGTTATCTGAGTTGGATCAGTGAAGTCACTGGAATCACGCTAGATTGA

Protein sequence:

>DPOGS214043-PA
MLLTDSDVAVGHDRLSLMVHTECHSTFVTLRADSDLRVHNILMIKIIQDAALLLRVGEDGELGFCGGSLVHLQWVLTAAHCCYHGPQEVTNVEVILGAHSLYDRYENGRRLITVNQIVVHPEWDPDTFANDLALLKLTNAVQPSDSIGIVRLPYLRTVSANFAGQAATASGWGIAANGVTFVSPTLRMKMSTVTTNTYCRSLFRTNLPDNIICTFSTQASTCKGDNGGPLTVFTNDTEEIVLIGVTSFIEAHGCNTNLPNVFTRVQRYLSWISEVTGITLD-