Monarch geneset OGS2.0

DPOGS201125
TranscriptDPOGS201125-TA1455 bp
ProteinDPOGS201125-PA484 aa
Genomic positionDPSCF300137 + 350618-362303
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0219732e-6163.82% 
BombyxBGIBMGA013383-TA4e-3636.29% 
DrosophilaTry29F-PC4e-3941.20% 
EBI UniRef50UniRef50_Q16ID27e-4341.81%Trypsin (Fragment) n=1 Tax=Aedes aegypti RepID=Q16ID2_AEDAE
NCBI RefSeqXP_001846713.16e-4742.47%trypsin 7 [Culex quinquefasciatus]
NCBI nr blastpgi|1700377391e-4542.47%trypsin 7 [Culex quinquefasciatus]
NCBI nr blastxgi|1700377392e-4542.47%trypsin 7 [Culex quinquefasciatus]
Group
Gene OntologyGO:00038241.3e-77catalytic activity
GO:00042524.2e-72serine-type endopeptidase activity
GO:00065084.2e-72proteolysis
KEGG pathway 
InterPro domain[100-346] IPR0090031.3e-77Peptidase cysteine/serine, trypsin-like
[110-338] IPR0012544.2e-72Peptidase S1/S6, chymotrypsin/Hap
[137-152] IPR0013141.6e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL34382 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201125-TA
ATGAAAAATAACATTCTCAATTTAATATCGTGGGTGGTGTTGACGAAGCTCGTTCTAAGTTTACACGTAGTCGAACTCGAAGACGATCGTTATCCTCGGAGGGCTCAACTGAGGACAGCAAACGACTATGATCCAGAACGTGGTGATGAAATTACCGATGATGATGAACATCACGAAATAGTCAGCATGTTGGAGGAAAGATATAAATTTGTTAACGACAACACGTCCACCAGATTCAAAATTGCAAATGATGAAAGGCATCCAGTGTTCCCAAAGTCCAGAACCGAGCGGAAACTTGCGTGGGCGTTTAGAGCATATGCCGTGAGAAGAATCGTCGGTGGGATGGAGACCAGCATCAGCATGTACCCTTACAACGTGGCCATCTCCAGAAACGGAAAACATTGGTGCGGGGGGAGCATCATCGATGAACAGTGGGTTCTGACAGCTGGCCACTGCTTTGAATCAGCTCATGATGGGGACAAGAAGAAACTGCTGCCGTTTATTGTAAGAGCTGGTAGTTCTTTTCACAACCGTGGAGGCTATCAAGCCAGAGTTAATAGGGTTTTCTTTCCCGAAAAGTATGCCCCGGGCAATGCAGACTTCGACTTTTCCCTTCTCCGTTTGGATAGGCCTATGCCCATCGGTAGAAATATAGCGGTATTGAACCTGCCTGCCAAGGAATACCTCGTGCAGACGGACGACCTGTTGATTGTAACTGGATGGGGCAGCACACACGAGTCCGGCTTCGACCACATTCCAGAACGCCTACGTTTTGTGCCAGTGCCGGTGATGCGTCTCGAGCAGTGCCAGACGGCATACCGTTTCTATATCACGCCGAGAATGATGTGTGCCGGATATGCCACTGGAGGAAAGGACGCCTGTAATCATGATTCTGGTGGCCCAGCTGTTAGAGACGGAGTCTTGATCGGCATCGTATCCTTCGGCGGAAAGAAGTGCGGGGACTCTCGATCTCCAGGTGTATACAGCAGGGTGACTGAGATCACGGACTGGGTTGAAGCGACAATCACTGACAACGAGGCCATCGATGTACCGGAACTGAAAGCGAAAATAGAAAAGGCGCGGCTTAGAGAGAAGGAGTTACAGAAGTTCAAGGCCAGAGTGGAAGAAAAGAAGAACAAGATAAAGGATTGGCTCCGTGAGACGCTGAAGTCACCAACTTTCCTTAAATTGGCTAAAAAGAAACTGATAGACGCTGGAGTTATCGGAAATAACATGAGAAGGTCTCATGCACAGCCGTACATAGACGATAACACTTTAGACGAAATAAACCTATCTCATCTCATCAATGAAAGAATATTGGAGGATACGGAGAGGAACGAGCCGGAAGTGCTGCTGAGGTCCCTGGCGATGCAGGAGATCATGAACGAGAGCAACGAACTACGGAGAACAGACGACGATGAAGTGGAAGAAATCATCGCCTATTTGAGTAAATAA

Protein sequence:

>DPOGS201125-PA
MKNNILNLISWVVLTKLVLSLHVVELEDDRYPRRAQLRTANDYDPERGDEITDDDEHHEIVSMLEERYKFVNDNTSTRFKIANDERHPVFPKSRTERKLAWAFRAYAVRRIVGGMETSISMYPYNVAISRNGKHWCGGSIIDEQWVLTAGHCFESAHDGDKKKLLPFIVRAGSSFHNRGGYQARVNRVFFPEKYAPGNADFDFSLLRLDRPMPIGRNIAVLNLPAKEYLVQTDDLLIVTGWGSTHESGFDHIPERLRFVPVPVMRLEQCQTAYRFYITPRMMCAGYATGGKDACNHDSGGPAVRDGVLIGIVSFGGKKCGDSRSPGVYSRVTEITDWVEATITDNEAIDVPELKAKIEKARLREKELQKFKARVEEKKNKIKDWLRETLKSPTFLKLAKKKLIDAGVIGNNMRRSHAQPYIDDNTLDEINLSHLINERILEDTERNEPEVLLRSLAMQEIMNESNELRRTDDDEVEEIIAYLSK-