Monarch geneset OGS2.0

DPOGS213461
TranscriptDPOGS213461-TA1251 bp
ProteinDPOGS213461-PA416 aa
Genomic positionDPSCF300100 - 454490-458444
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0168275e-12658.45% 
BombyxBGIBMGA004487-TA2e-11458.15% 
DrosophilaCG13430-PB2e-1832.67% 
EBI UniRef50UniRef50_Q5BAR48e-2027.30%Serine protease similarity, trypsin family (Eurofung) n=1 Tax=Aspergillus nidulans FGSC A4 RepID=Q5BAR4_EMENI
NCBI RefSeqXP_002087645.11e-2226.23%GE15163 [Drosophila yakuba]
NCBI nr blastpgi|1954707012e-2126.23%GE15163 [Drosophila yakuba]
NCBI nr blastxgi|1700383795e-1834.12%trypsin 2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00038246e-40catalytic activity
GO:00042523.3e-36serine-type endopeptidase activity
GO:00065083.3e-36proteolysis
KEGG pathwayani:AN2366.24e-20 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[69-266] IPR0090036e-40Peptidase cysteine/serine, trypsin-like
[71-364] IPR0012543.3e-36Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL25025 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213461-TA
ATGAACTGGGTTCTCGTTTTTATGACTCTATCAATGTTTTATAGCTATGTTCTGAGCTATGGAGACCAGGCCTCAGTAGTTAAATTTAAATTCAATCTGGACCCGTATGGAGAGGCTCCGCTCCAAGATGGCCGACGGAGACATCGAGATAACAAAAAATCATTAAGAGTCCGAAACGACTTCCTGTTCAGTTTAAACAAAGACGCTCTCAGGATCCGGGGGGGGAATGCCACGGATACGACCAACTATCCGTACATAGCGGCCATTATAATCAACGGCAGGTTATGGTGCGCCGGCACCATCGTCGACGTCAACTGGGTACTGACAGCGGCGCATTGTCTGAATTACGTGCTTCACGTAGCGCCAATGAAGACCCTGGGGCAGTACGTGAAGGTCAGGGTCGGCAGCGCCCAGGCTCACGAAGGAGGTTTGCTGGTAGACGTCGCGGGGGCCGTGCGACACCCGAAATTCGAAGAGGAACCCGTGCCTCATGCTGATGTAGCTTTATTGAAACTGACTGAAAACCTTGAATTCTCAACTCACATCAATCTGATTAAAATAAACGAAGATATGAGAGAGCCTTACGCGCAGAGTTTCGTGTCTGTAACCGGCTGGGGAGCGACCCGTGGCACAGACACAGCCTTCAGAGAACACACGCCCGACCTGATGACGGCTCGTCTCAAGGTTCGCACGGTCAACTACTGCAGAGACGCGTACCAACTGGTTAGCGGGTTTCAGTTCACCGCAGACTTCTTCTGCGCTTCGTTGAGAAACGGCACCAGAGACGCGTGTTTGGGCACAGACACAGCCTTCAGAGAACACACGCCCGACCTGATGACGGCTCGTCTGAAGGTTCGCACGGTCAACTACTGCAGAGACGCGTACCAACTGGTTAGCGGGTTTCAGTTCACCGCAGACTTCTTCTGCGCTTCGTTAAGAAACGGCACCAGAGACGCGTGTTTGTTCGACGCGGGCGCGCCAGCCACCCAACACAACAAATTAATGGGCGTCATGAGCTTCGGGCCCGAGCGTTGCGGACACGAATACCAACCAGCGGTGTTCATTAAGGCTTTTTATTTCAGGGATTTCGTGAAGCACACTATATCCTCATATAAGACTACAGCTGAACTTATAGAAGCCATGAAAGATATCGACAAAGTTATCAGACCACCCGTTCATGTGAAACAGGAACACGTGGTCGTCGAGAAAGATGAGCAAGAGGTCACGGAACCAGATTATAAACACGATTGA

Protein sequence:

>DPOGS213461-PA
MNWVLVFMTLSMFYSYVLSYGDQASVVKFKFNLDPYGEAPLQDGRRRHRDNKKSLRVRNDFLFSLNKDALRIRGGNATDTTNYPYIAAIIINGRLWCAGTIVDVNWVLTAAHCLNYVLHVAPMKTLGQYVKVRVGSAQAHEGGLLVDVAGAVRHPKFEEEPVPHADVALLKLTENLEFSTHINLIKINEDMREPYAQSFVSVTGWGATRGTDTAFREHTPDLMTARLKVRTVNYCRDAYQLVSGFQFTADFFCASLRNGTRDACLGTDTAFREHTPDLMTARLKVRTVNYCRDAYQLVSGFQFTADFFCASLRNGTRDACLFDAGAPATQHNKLMGVMSFGPERCGHEYQPAVFIKAFYFRDFVKHTISSYKTTAELIEAMKDIDKVIRPPVHVKQEHVVVEKDEQEVTEPDYKHD-