Monarch geneset OGS2.0

DPOGS208315
TranscriptDPOGS208315-TA729 bp
ProteinDPOGS208315-PA242 aa
Genomic positionDPSCF300293 - 51870-53559
RNAseq coverage341x (Rank: top 34%)
Annotation
HeliconiusHMEL0060001e-6652.08% 
BombyxBGIBMGA013383-TA2e-1729.31% 
DrosophilaJon74E-PA5e-1929.75% 
EBI UniRef50UniRef50_B4J3I37e-2031.85%GH15354 n=1 Tax=Drosophila grimshawi RepID=B4J3I3_DROGR
NCBI RefSeqXP_001663439.12e-2228.68%serine-type enodpeptidase, putative [Aedes aegypti]
NCBI nr blastpgi|1571352293e-2128.68%serine-type enodpeptidase, putative [Aedes aegypti]
NCBI nr blastxgi|99652622e-2029.10%late trypsin [Aedes albopictus]
Group
Gene OntologyGO:00038245.3e-48catalytic activity
GO:00042526.2e-32serine-type endopeptidase activity
GO:00065086.2e-32proteolysis
KEGG pathwaydpo:Dpse_GA150512e-17 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[18-241] IPR0090035.3e-48Peptidase cysteine/serine, trypsin-like
[31-236] IPR0012546.2e-32Peptidase S1/S6, chymotrypsin/Hap
[50-65] IPR0013146.8e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL34745 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208315-TA
ATGTCGGTTTTAATCGTTGTTAATCTTGTTTTGTCTGTAGTTTTTAAAGACCATTTCGCAGCGGCTCAAGACGTGGAGTCAGTTATTCTTTATGGTGATGATGCGAAGATAGAAGATTATCCTTTCTACGCCGGCCTTAGCAATTGTGGTGCTGCTGTACTTTCCCACATTTGGATATTGACGGCCGCTCATTGTGTAGAAAAAGTCAACGTGAGAGTACCATATGAAAGTATCCACATTCATCAGGGGTACCACAATCTAAAAGGAATGCCAGTTAATGATATCGCCTTAATACAACTTTCTCACCCTTTACAATTCACTAAAAAGATTCGACCTGTCAAATTGCCCAGTGGATTAAAATCGAACACAAGTCTTAGTCTTAGTTTCGTTGGTAGAGGAATTGATGAGACTGGAACTCTCTCAAAAAATATAAAGACAGTGGATCTTATCAGATTAAATACAAGAGATTGTATCAGATTAATACCGCCCGCGTTCTCAGAATACATTATGTTTTATAAAGTTTTGGAGTCCACAAACATTTGCATAAAAAGAGAAGGTGAAAGGCCAAGCATTTGCAAGGGTGACTCTGGTAGTCCTTTAGTATCTGGTGACACTATTATTGGTCTCGCATCGTTTATAGGTAACCTTGGTTGCAACAATGTTCGTTTGGGTTTCTTTGTAAATGTTGCAACTTTTGTGCCATGGATAAAATCAATCACTGGACTTTAA

Protein sequence:

>DPOGS208315-PA
MSVLIVVNLVLSVVFKDHFAAAQDVESVILYGDDAKIEDYPFYAGLSNCGAAVLSHIWILTAAHCVEKVNVRVPYESIHIHQGYHNLKGMPVNDIALIQLSHPLQFTKKIRPVKLPSGLKSNTSLSLSFVGRGIDETGTLSKNIKTVDLIRLNTRDCIRLIPPAFSEYIMFYKVLESTNICIKREGERPSICKGDSGSPLVSGDTIIGLASFIGNLGCNNVRLGFFVNVATFVPWIKSITGL-