Monarch geneset OGS2.0

DPOGS213412
TranscriptDPOGS213412-TA1182 bp
ProteinDPOGS213412-PA393 aa
Genomic positionDPSCF300271 - 297119-306174
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0168152e-12357.61% 
BombyxBGIBMGA004388-TA1e-10367.92% 
DrosophilaCG32374-PA5e-1626.75% 
EBI UniRef50UniRef50_Q9XYX93e-1626.96%Trypsinogen RdoT1 n=1 Tax=Rhyzopertha dominica RepID=Q9XYX9_RHYDO
NCBI RefSeqXP_317174.28e-1726.19%AGAP008291-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|12724312e-1625.76%Astryp1 [Anopheles stephensi]
NCBI nr blastxgi|4103258e-1626.07%trypsin-related protease [Anopheles gambiae]
Group
Gene OntologyGO:00038244e-31catalytic activity
GO:00042524.1e-20serine-type endopeptidase activity
GO:00065084.1e-20proteolysis
KEGG pathwayani:AN2366.22e-13 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[51-272] IPR0090034e-31Peptidase cysteine/serine, trypsin-like
[56-267] IPR0012544.1e-20Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL25015 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213412-TA
ATGTGGCGTATTGGCGTGTTATTAATATTGTCCCTTCAAGTTCAAGGAAAATTAGACACTGACGAGGATACAGGGGAGTCAGGAGATCGTGATTCAGGCGGAGACTTCAAAGTTACAAAAAATCGTACTGAAGATGAGAAAATCTACGACAGCGAGGAGAGAATCGGCGCCGCAGTGACACAGATATCCCGACATCCGTACACAGCGGCTTTATTGAAGAATGAGACTTACGTTTGCAGCGCCATTATACTGAACACATACTGGCTACTCACGCTCTCTAAATGCTTTGAGTCTGACGTCATATCCTCGTACGTGACGCACAGGTACCTTGGCAACTACACCATCAGGACCGGAAGCTCATACAACAATAAAGGCGGAACAATGTCTACGATAAAAATGCTCATAAATAATTTCGATTCGAAAGTATCGGCGGTTAAATTGACAGCGCCACTTGAGTTTGGATCGAAGATTCACGGCGTACGACTACCGAGACCTGACGAAGAAGTTACTTTAGGGTACCTGACTGCAATACTGGCCTGGACTCCTTCCGGGCACATGAGAGTCGTGAACGCTCCAATAATCGATTCCTCCATATGTGAACCATCAGCTAAGCTGATACCTGGGAGGTACATTTGTGTCGGAGGTGTTCAAGATCCAAATCGACATTTCTGCAGGAAAGACAACGGTGGAGCGGTTATCCAGAACAACGTACTAATAGCCATAACTTCATTTTTGCACACCTGCGCCCTCTACACAAAAACGCATGCCTTCCCAAAAGTTTCAAGTTTTTCAAGATGGCTAGATACTGTAATCTGGGATGAAAATAATCGACCAACCACAACCAGTAGCACAACCACAGTAACAACCACAGAAGCAATTAAAAACGTAACCGAACCAAGAGAACAAAACCTTTACATAGACCCAAGGAAATTCATGCTAACGCTTCCATTTGATCCAATTAACGTACCTCTGGAACCAGCTGAAGATAACTCCGTCTTGCCAAGAATGAGTTTGTATGAATCATACCTTCAGAACATAGCGAGAGCTAAAACATCGACAACAGCGGATCCAAACGCCGTGGAAGAAGAAAAAAAGGAATGGCTGAGAAAGTTCGGAAACTCTATACTGAAGATGGACCCGAAGGCTTTATCTAAGAAATATGACCAATACGACTATAAGTGA

Protein sequence:

>DPOGS213412-PA
MWRIGVLLILSLQVQGKLDTDEDTGESGDRDSGGDFKVTKNRTEDEKIYDSEERIGAAVTQISRHPYTAALLKNETYVCSAIILNTYWLLTLSKCFESDVISSYVTHRYLGNYTIRTGSSYNNKGGTMSTIKMLINNFDSKVSAVKLTAPLEFGSKIHGVRLPRPDEEVTLGYLTAILAWTPSGHMRVVNAPIIDSSICEPSAKLIPGRYICVGGVQDPNRHFCRKDNGGAVIQNNVLIAITSFLHTCALYTKTHAFPKVSSFSRWLDTVIWDENNRPTTTSSTTTVTTTEAIKNVTEPREQNLYIDPRKFMLTLPFDPINVPLEPAEDNSVLPRMSLYESYLQNIARAKTSTTADPNAVEEEKKEWLRKFGNSILKMDPKALSKKYDQYDYK-