Monarch geneset OGS2.0

DPOGS215673
TranscriptDPOGS215673-TA774 bp
ProteinDPOGS215673-PA257 aa
Genomic positionDPSCF300041 - 1163943-1165461
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0163573e-5340.62% 
BombyxBGIBMGA003580-TA9e-4841.41% 
DrosophilaepsilonTry-PA4e-2932.62% 
EBI UniRef50UniRef50_P350423e-4839.06%Trypsin CFT-1 n=68 Tax=Ditrysia RepID=TRYP_CHOFU
NCBI RefSeqNP_001040350.14e-3337.02%trypsin-like protease [Bombyx mori]
NCBI nr blastpgi|150725484e-4843.22%trypsin-like protein [Galleria mellonella]
NCBI nr blastxgi|4649592e-5341.80%trypsin [Manduca sexta]
Group
Gene OntologyGO:00038241.2e-65catalytic activity
GO:00042523.3e-47serine-type endopeptidase activity
GO:00065083.3e-47proteolysis
KEGG pathwaydpo:Dpse_GA115982e-28 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[13-257] IPR0090031.2e-65Peptidase cysteine/serine, trypsin-like
[23-253] IPR0012543.3e-47Peptidase S1/S6, chymotrypsin/Hap
[56-71] IPR0013144.1e-07Peptidase S1A, chymotrypsin-type
Orthology groupMCL27845 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215673-TA
ATGGCTGCGGCTCTAATACTCTTGGCTGGTGTCCTCTCTGCAGTAGCGGCGGTGCCAGCTCCGGAGAGAATCGCTGGTGGTGCAATTACGACGATTGCATCATATCCTTATGCTGCTAGTATCGCGTACAATAAACTTGGATTTGGTAACTTCATTTTCTCCTGTGGAGGATCTATAGTCAGCAGCAGAGCCATTCTCACGGCAGCTTTTTGTGTGTATAACGACCAGATCAACCGTTTCCGAGTGCGCGTGGGTTCGTTGACAACAATCAGTGCCGGAAGCGTTCGCGAAATCGATTATATCGCTTCTCATCCAAATTACAATCCAGTTACTAATGAGCATGACATCGCGCTAGTTCATGTATTCCCACACTTGCTATTCTCTACAAATATCGCATTGGGATCTTTTGCTGATGGTAGCTATATACCTCGTTACAATCAATCTGTTTGGGCTATTGGATGGGGACAGATGAATCATGGTGGAGCTCTATCCGACAGCCTTCGACGTGTTCAACTGTGGTTAGTTGATAATAATGACTGCAGAAATCGTTACTATGAGCTGGAAGGTCCCAGGGTCACTCCTAATATGATTTGTGCTGTTGGTTTTGATCCTTCTGGAAGGGGACAGTGTCTGGGCGATAATGGCAGCCCCATTATTGATGATGGACTCATTATTGGTATATATTCATGGAGTCATCAGTGCGCAACAGTTCGATACCCTGGTGTGAATACTTATATACCTAAATATTCAGATTGGATTAAATCTCAGTACTAA

Protein sequence:

>DPOGS215673-PA
MAAALILLAGVLSAVAAVPAPERIAGGAITTIASYPYAASIAYNKLGFGNFIFSCGGSIVSSRAILTAAFCVYNDQINRFRVRVGSLTTISAGSVREIDYIASHPNYNPVTNEHDIALVHVFPHLLFSTNIALGSFADGSYIPRYNQSVWAIGWGQMNHGGALSDSLRRVQLWLVDNNDCRNRYYELEGPRVTPNMICAVGFDPSGRGQCLGDNGSPIIDDGLIIGIYSWSHQCATVRYPGVNTYIPKYSDWIKSQY-