Monarch geneset OGS2.0

DPOGS205340
TranscriptDPOGS205340-TA1536 bp
ProteinDPOGS205340-PA511 aa
Genomic positionDPSCF300292 + 80677-83764
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0107758e-16660.86% 
BombyxBGIBMGA013679-TA9e-7146.40% 
DrosophilaCG13430-PB1e-2732.54% 
EBI UniRef50UniRef50_Q16PS21e-2633.61%Trypsin n=2 Tax=Aedes aegypti RepID=Q16PS2_AEDAE
NCBI RefSeqXP_001599779.12e-2835.15%PREDICTED: similar to ENSANGP00000018316 [Nasonia vitripennis]
NCBI nr blastpgi|3454828005e-2835.15%PREDICTED: trypsin-7 [Nasonia vitripennis]
NCBI nr blastxgi|3454828003e-2735.15%PREDICTED: trypsin-7 [Nasonia vitripennis]
Group
Gene OntologyGO:00038242.4e-57catalytic activity
GO:00042525.9e-37serine-type endopeptidase activity
GO:00065085.9e-37proteolysis
KEGG pathwaydpo:Dpse_GA218791e-24 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[200-439] IPR0090032.4e-57Peptidase cysteine/serine, trypsin-like
[202-436] IPR0012545.9e-37Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL34594 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205340-TA
ATGTTGATGTGTATTTGTATTGTAATTTTAAATATATTTTTTATAAATTGTAACAATGACACTGAAAATGATAGAGGTGAATTAACAGTTAATGATATTGCGTCAACGTTAGTGTCTCTACTGGGGGTAGATGCAGAAAGCAAAAGAATCAAGGATCTGAACGAGAAATTAAAGAAAATAGACAGTTTTGTGGTTGACAAAGACAATTATGCTGAATTATTACTAAATGAAACGATGACCACCAATTTGAATGAAACTAATTTCTTCAATGCCTTAAATGAATCTGATCACTTTAGATTCCTGATTGAAAAGGAACCTAAAAATTTATATTTAAAATTGAAAGAGAAAATGAATCGAGATGACATACTGAGGATCGTTTCAAAAGATCATCTTAAATCTTCGAAATCTGGAAGAAAAATGATGGTCAATATAAATCCAGATTACAACGAAAAAGACGAAATCATTGAAGAAGTAGTCGATGAGATGATCGCAAAAATGCCTACAGACCAACACAAGTTTAAGTTTAGGAAAGATGATAATATTTATTGGGACCCTCAGGGAGAGTTGGAAGATTTAAACGATTACCATAAGCATAACGGGAGACGAATATATAAGGGAGAGAGAACGACAATAAGATATTATCCATTTATGGTATCTGTCCATGTGATGGGAAGATTTTGGTGTGGAGGTAGCATCTATTGGCACGACCTGGTTTTAACTTCAGCCGCTTGCTTACAACTAATGCACAACAATCGTTTCTTCCGTGAAAACCCGGGTGTTCTAAAAATACGATTGGGCAGCAACCATAGTCGAATCGGAGGAGAAAATGTTGAGGCTCTTGAAGTATATTTCCATCCGGGATACAATCCAAGAACTCTTCGACACAACATAGCGATCATTCGTCTTCGACGTCATCTGTTCTTCAGCTATCATCGCATACCTAAGCTTATTGACATCTCACACACAGAAATAGGAATCTCGCCGACTTCCGAGGTTCTGGTTTTAGGATGGGGAGTGACAAAGATGTCACAAAAACTTGCCTATGAGCCTGTTTACTTGAACCGAAAGTTTTTACCAATTTATCCAAACGTGTTTTGTAAGGATGTTTATGGAAAAAAGTTTATATCAGAAACAATGTTTTGTGCTGGAACATTAACAACTGGCGAAGGAGCCTGTGATCATGATGCAGGAGGTCCAGCAGTCATTGCGGGTAAACTAGTCGGTATTATATCGTTCGGCCCTTCCGTCTGTGGATATCCTAACGCTCCGACTGTATTTACACTTGTCGGTGCATATTCTGACTGGATTGAAAGTGTTAATGAATCTATGCCTGGATACTACAGGGGCAAGAAACGGACTACTACTTTAAAACCCATAACATTCGCAGAATATAAGATAAAGAAACTGTACAAAGATATAGCTGATCTTAACAGTGATAAACCTGCTACCGCGACGACAGAGGCCAAAATAGAGCTTCTGCGGCAGAAAAACAAGCTGATGCATGATGATTCTGACTACGGATTTAGTATTTTATAG

Protein sequence:

>DPOGS205340-PA
MLMCICIVILNIFFINCNNDTENDRGELTVNDIASTLVSLLGVDAESKRIKDLNEKLKKIDSFVVDKDNYAELLLNETMTTNLNETNFFNALNESDHFRFLIEKEPKNLYLKLKEKMNRDDILRIVSKDHLKSSKSGRKMMVNINPDYNEKDEIIEEVVDEMIAKMPTDQHKFKFRKDDNIYWDPQGELEDLNDYHKHNGRRIYKGERTTIRYYPFMVSVHVMGRFWCGGSIYWHDLVLTSAACLQLMHNNRFFRENPGVLKIRLGSNHSRIGGENVEALEVYFHPGYNPRTLRHNIAIIRLRRHLFFSYHRIPKLIDISHTEIGISPTSEVLVLGWGVTKMSQKLAYEPVYLNRKFLPIYPNVFCKDVYGKKFISETMFCAGTLTTGEGACDHDAGGPAVIAGKLVGIISFGPSVCGYPNAPTVFTLVGAYSDWIESVNESMPGYYRGKKRTTTLKPITFAEYKIKKLYKDIADLNSDKPATATTEAKIELLRQKNKLMHDDSDYGFSIL-