Monarch geneset OGS2.0

DPOGS210395
TranscriptDPOGS210395-TA996 bp
ProteinDPOGS210395-PA331 aa
Genomic positionDPSCF300291 + 812-7897
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0130033e-6649.26% 
BombyxBGIBMGA008281-TA9e-8149.82% 
DrosophilaCG5255-PA2e-2730.99% 
EBI UniRef50UniRef50_A5CG731e-8051.61%Chymotrypsinogen-like protein 3 n=9 Tax=Obtectomera RepID=A5CG73_MANSE
NCBI RefSeqNP_001166054.14e-3434.20%serine protease 120 [Nasonia vitripennis]
NCBI nr blastpgi|3044436153e-8856.63%serine protease 33 [Mamestra configurata]
NCBI nr blastxgi|1717408872e-8657.71%trypsin [Helicoverpa armigera]
Group
Gene OntologyGO:00038243e-54catalytic activity
GO:00042523.6e-42serine-type endopeptidase activity
GO:00065083.6e-42proteolysis
KEGG pathway 
InterPro domain[82-331] IPR0090033e-54Peptidase cysteine/serine, trypsin-like
[92-326] IPR0012543.6e-42Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL16203 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210395-TA
ATGCAGAGCACGCATCAGTATAACAGTATCGGCATGACTTCAGATGATGAAGCATCAATTCTTGAACCTGGCCCATCCACTCTACGAGACCTGCAACGGCATTCATATAAATGGAAACTTGAATGGGAAAATACAGTTGGTTTAAGAAAAATAGGAAAAATGGTGTTCAAAGCTGCGTTTGTATTTTTAGGGCTCCTCATAGGAAGTTTGGCTCTACCTCTCAATGAGGCGGAAGACATGTCCGTCTTCTTCGACCACCCGGCAATAACACCATACATCGTCGGAGGATTGACTGCTGGCAAAGTTCCTCATATGGTGGCTCTGACCACCGGTGTCTTCACTAGATCCTTCACCTGCGGAGGTTCTCTGGTGACCAAGAAACACGTCCTCACTGCAGCACATTGCATTGAAGCTGTGTATAGTCGAGGATCTCTTTTGAGTTCTCTCCGTGGAATTGTCGGCACCAATCGCTGGAATTTTGGGGGAGTCCAACAACAATTCGCCTCAAACATTACGCACCCTAACTACGTCGGTTCCATCATCAAAAACGACATCGGTTTTCTGGTAACAGACGCCGAAGTATCTCTGAACGACAACATACAATTGGTACCAATCTCCTACGATTTCATTGAAGGTGAAGTAGCTGCTGTTATCCATGGATGGGGCAGAATCCGGACTGGTGGATCATTGTCACCAAATCTGTTGGAGCTCAAAACAAAGGTCATCGACGGCGAGCGTTGCGTCTCTGACGTGGCTCGTAGAAGTTCGGAAATTGGTATGAGGGTTCCACCAGTTCAACCAGATCTCGAAGTCTGTACTTTCCTAGCACTCAACTTTGGAAACTGTCATGGTGACTCCGGCAGTGCCCTCCTTCGTCAAAGTGACGGCCAGCAAATCGGTGTCGTGTCTTGGGGTCTTCCTTGTGCTCGCGGCGCACCCGATATGTACGCCAGAGTTAGCGCCTACCGCGACTGGATCGAACAAAGCCTTCAATAA

Protein sequence:

>DPOGS210395-PA
MQSTHQYNSIGMTSDDEASILEPGPSTLRDLQRHSYKWKLEWENTVGLRKIGKMVFKAAFVFLGLLIGSLALPLNEAEDMSVFFDHPAITPYIVGGLTAGKVPHMVALTTGVFTRSFTCGGSLVTKKHVLTAAHCIEAVYSRGSLLSSLRGIVGTNRWNFGGVQQQFASNITHPNYVGSIIKNDIGFLVTDAEVSLNDNIQLVPISYDFIEGEVAAVIHGWGRIRTGGSLSPNLLELKTKVIDGERCVSDVARRSSEIGMRVPPVQPDLEVCTFLALNFGNCHGDSGSALLRQSDGQQIGVVSWGLPCARGAPDMYARVSAYRDWIEQSLQ-