Monarch geneset OGS2.0

DPOGS205776
TranscriptDPOGS205776-TA1146 bp
ProteinDPOGS205776-PA381 aa
Genomic positionDPSCF300144 - 436817-443841
RNAseq coverage392x (Rank: top 31%)
Annotation
HeliconiusHMEL0051202e-7252.21% 
BombyxBGIBMGA010590-TA2e-7453.78% 
DrosophilaCG31954-PA3e-3734.78% 
EBI UniRef50UniRef50_Q9NB923e-6652.40%Trypsin AiT6 n=14 Tax=Obtectomera RepID=Q9NB92_AGRIP
NCBI RefSeqNP_001037317.14e-4039.02%vitellin-degrading protease precursor [Bombyx mori]
NCBI nr blastpgi|83476381e-7154.12%trypsin precursor AiT9 [Agrotis ipsilon]
NCBI nr blastxgi|1569682951e-7252.80%protease [Helicoverpa armigera]
Group
Gene OntologyGO:00038245.6e-74catalytic activity
GO:00042522.2e-73serine-type endopeptidase activity
GO:00065082.2e-73proteolysis
KEGG pathwayani:AN2366.22e-36 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[14-245] IPR0090035.6e-74Peptidase cysteine/serine, trypsin-like
[22-256] IPR0012542.2e-73Peptidase S1/S6, chymotrypsin/Hap
[53-68] IPR0013149.2e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL18546 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205776-TA
ATGCGCTTGTTTATACTGTTAGCTGCTCTTGGAGCCGTTTGTGCGGCACCAAGGAAGCCGGAACGCATTGTCGGTGGTGAAAATACTAGTATTGAAAGATATCCTTTCATGTCTGGTATCTTACACAATTCCTTCTGGGGAACGGAACAGATGTGTGGTGGCACTCTCATAACCAACAGAGCTGTAGTCACAGCAGCTCACTGTTTTGCAGGCTTTAGGCCATCCGAGCTGAGCGTACGATTGGGATCAACTTATGCTTCGTCTGGCGGACAAGTCCACCTACTTTCTAGAATTATCATGCATCCACACTATAATTTTTACTTAATTTTAAATGACATTGCAATAACAAGACTTCAAAACGCTGTTACATTATCTAATCAGATTCAAATTGCACGAATTGCTGGACCCCAGTACAACTTACCAGATAATACATCTCTCAATGTTATTGGTTGGGGGGCTACCAGTTACAATGGAAAACCTTCCGAAATACTTCAGCACGCATCTGTTAACATCATCAACCAAAGTGTTTGTGCAGAACGTTACGCATATTTCCAATCTTTACCCGGATCTTGGCCAAGCATAACTCCTGAGATGATGTGCACCGGTATCCTGGATGTCGGTGGCAAGGACGCCTGTCAGGGTGACTCCGGCGGACCTGTGGTTCATGGCGGAAATATTCTAGTAGGAATTACTTCGTGGGGATACCAGTGCGCTCATCCAAATTATCCGGGAGTTAACATTTGTAAAGAACACCAATATCCATCGATCGATAACGCGGACGTGCCGTCTGAAACAAGCAAAGTGGCTCTAGACACGTGGTCTGAGATACCCACCAGCGAGCCATCGGATAGCGTGACCAGTTCGAGTGTTGTCGGTGTTAGCGGAAGCGGATGTGTCTGTGGTCTGGCGGCGAGGGTGGTGGCCTTAGAGGCGGACGCAGCCTCCGCACAGAGACACAGACATCATCTCGAGGAGGAGACCAGCGAGCCATCGGATAGCGTGACCAGTTCGAGTGTTGTCGGTGTTAGCGGAAGCGGATGTGTTTGTGGTCTGGCGGCGAGGGTGGTGGCCTTAGAGGCGGACGCAGCCTCCGCACAGAGACACAGACATCATCTCGAGGAGGAGGTATTCATTAAGCACAGTTGA

Protein sequence:

>DPOGS205776-PA
MRLFILLAALGAVCAAPRKPERIVGGENTSIERYPFMSGILHNSFWGTEQMCGGTLITNRAVVTAAHCFAGFRPSELSVRLGSTYASSGGQVHLLSRIIMHPHYNFYLILNDIAITRLQNAVTLSNQIQIARIAGPQYNLPDNTSLNVIGWGATSYNGKPSEILQHASVNIINQSVCAERYAYFQSLPGSWPSITPEMMCTGILDVGGKDACQGDSGGPVVHGGNILVGITSWGYQCAHPNYPGVNICKEHQYPSIDNADVPSETSKVALDTWSEIPTSEPSDSVTSSSVVGVSGSGCVCGLAARVVALEADAASAQRHRHHLEEETSEPSDSVTSSSVVGVSGSGCVCGLAARVVALEADAASAQRHRHHLEEEVFIKHS-