Monarch geneset OGS2.0

DPOGS214483
TranscriptDPOGS214483-TA828 bp
ProteinDPOGS214483-PA275 aa
Genomic positionDPSCF300122 - 176626-230243
RNAseq coverage255x (Rank: top 41%)
Annotation
HeliconiusHMEL0139252e-4949.46% 
BombyxBGIBMGA001320-TA8e-4946.56% 
DrosophilaCG32374-PA4e-2233.90% 
EBI UniRef50UniRef50_B6CME75e-4546.41%Trypsin n=3 Tax=Noctuidae RepID=B6CME7_HELAM
NCBI RefSeqXP_317829.42e-2231.98%AGAP011477-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2827212187e-4847.57%trypsin [Helicoverpa armigera]
NCBI nr blastxgi|2827212183e-4747.57%trypsin [Helicoverpa armigera]
Group
Gene OntologyGO:00038242.1e-41catalytic activity
GO:00042523.8e-27serine-type endopeptidase activity
GO:00065083.8e-27proteolysis
KEGG pathwaydpo:Dpse_GA218796e-19 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[70-270] IPR0090032.1e-41Peptidase cysteine/serine, trypsin-like
[101-266] IPR0012543.8e-27Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL26802 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214483-TA
ATGGTCCGTGCCGGAGTCTCCCTCGTCAAGCGTATGTCTGCATACAGGGTGTGGAGGTGGCAGAGGAAAAAGATTAGATATCTGAGCCTCAAACTGGATGGCAGATGGAGAATCGAGGGCCATTCCTCCCTGTTGGCCCCGAAACATGGAAATACTGCGGTTTCCTTCGGGAGACGTCTGCTCAATATCCCCGGGTCACAGAAGCGATGCCGGCGACTCCACGCTTGTATAGTGAGGCGTATGGCCCTCTACGGTGATTCCATATGGGGTGCCACTTTACCCAATTATCGTATTAGGGGCGGTTCCACTTATAGTATGAACGGCGGACAAGTGGTTTCCATTAGAGAAGTCGTTAAACACCCTGAGTTTGTTGAAACTCCTCGTAAAAACGACATCGCGGTTGTGATTCTTCAGGAAGTTTTCAGATTAAGTGGCAGTCTCAATATTATGTACCTCCCTCCTAAAAATATTGACATCCCAAATGGAATACCAGCTACTGTTGTTAGTTGGGGATTTGAATCAGAACAAGGCCCTATCCATAACTCTCTCATGGCAATCACCTTGACCACAGTTCCTTTGGAACAATGTCAGCAGATCTACGCTGACGATGCTGATATTAAAATTAATGAAGCCGTTATATGTGCTAACGCGGCGAACTCCGGCGTTTGTTCGGGGGACATGGGTGCTCCGCTTGTGTCCGGTGGTGTATTAATAGGAGTGGCCTCCAACCATAAGGGATGCGGTTCGCAGAACTATCCTGATGTCTTCACAAGAATCGACGCTTACGTAGATTGGATCATGGAAGTGGCTGTTGCGCCGTCAAGTTAA

Protein sequence:

>DPOGS214483-PA
MVRAGVSLVKRMSAYRVWRWQRKKIRYLSLKLDGRWRIEGHSSLLAPKHGNTAVSFGRRLLNIPGSQKRCRRLHACIVRRMALYGDSIWGATLPNYRIRGGSTYSMNGGQVVSIREVVKHPEFVETPRKNDIAVVILQEVFRLSGSLNIMYLPPKNIDIPNGIPATVVSWGFESEQGPIHNSLMAITLTTVPLEQCQQIYADDADIKINEAVICANAANSGVCSGDMGAPLVSGGVLIGVASNHKGCGSQNYPDVFTRIDAYVDWIMEVAVAPSS-