Monarch geneset OGS2.0

DPOGS212003
TranscriptDPOGS212003-TA345 bp
ProteinDPOGS212003-PA114 aa
Genomic positionDPSCF300136 - 272154-272498
RNAseq coverage22x (Rank: top 79%)
Annotation
HeliconiusHMEL0045994e-5479.46% 
BombyxBGIBMGA004518-TA2e-4769.72% 
Drosophilasnk-PB1e-2043.93% 
EBI UniRef50UniRef50_D9HQ791e-5179.46%Seminal fluid protein HACP027 n=14 Tax=Heliconiini RepID=D9HQ79_9NEOP
NCBI RefSeqNP_001153674.18e-4668.81%male reproductive organ serine protease 1 [Bombyx mori]
NCBI nr blastpgi|3584427483e-5281.08%seminal fluid protein HACP027 [Eueides isabella]
NCBI nr blastxgi|3584427483e-5381.08%seminal fluid protein HACP027 [Eueides isabella]
Group
Gene OntologyGO:00038241.5e-30catalytic activity
GO:00042523.1e-22serine-type endopeptidase activity
GO:00065083.1e-22proteolysis
KEGG pathwaybba:Bd26309e-16 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[1-108] IPR0090031.5e-30Peptidase cysteine/serine, trypsin-like
[2-103] IPR0012543.1e-22Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL25029 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212003-TA
ATGAAAGTGGACGTTGATGTCATAGATAGTAAAATTTGCAATCGTTCTATGAAATTTTTGGTAAAGAGAAAGATTTTGGAATACGGTATAACTGACTCTCAACTCTGCGCCGGGGACTATGAACATGGTGGCAAGGATACATGCCAGGGTGATTCCGGAGGACCCTTACAGGTTATGGATGAAAGGGTGGATTGCGTGAAAACCTTTCCTTTGCATAAAATAGTGGGCATTACTTCGTTTGGCAGGGACTGTGGTAGGAAGATGTCTCCGGGGGTGTACACGAGGACCTCTAAATATATAGACTGGATAGAAAACGTTGTCTGGCCGGATGATGTTAAAACGTAA

Protein sequence:

>DPOGS212003-PA
MKVDVDVIDSKICNRSMKFLVKRKILEYGITDSQLCAGDYEHGGKDTCQGDSGGPLQVMDERVDCVKTFPLHKIVGITSFGRDCGRKMSPGVYTRTSKYIDWIENVVWPDDVKT-