Monarch geneset OGS2.0

DPOGS201112
TranscriptDPOGS201112-TA1638 bp
ProteinDPOGS201112-PA545 aa
Genomic positionDPSCF300137 - 203071-205387
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0179842e-11847.88% 
BombyxBGIBMGA013679-TA2e-13650.30% 
DrosophilaCG7829-PA4e-2830.61% 
EBI UniRef50UniRef50_D6WAH75e-2835.80%Serine protease P25 n=2 Tax=Tribolium castaneum RepID=D6WAH7_TRICA
NCBI RefSeqXP_001814174.14e-2935.80%PREDICTED: similar to Trypsin alpha [Tribolium castaneum]
NCBI nr blastpgi|1892347367e-2835.80%PREDICTED: similar to Trypsin alpha [Tribolium castaneum]
NCBI nr blastxgi|45300584e-2733.60%trypsin-like serine protease [Ctenocephalides felis]
Group
Gene OntologyGO:00038243.4e-59catalytic activity
GO:00042524.2e-42serine-type endopeptidase activity
GO:00065084.2e-42proteolysis
KEGG pathwayani:AN2366.22e-22 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[56-300] IPR0090033.4e-59Peptidase cysteine/serine, trypsin-like
[58-295] IPR0012544.2e-42Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL26113 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201112-TA
ATGTACAGCGTTGGTAAGGTCTATGCTAATATTAATATGGATGTTCTGAAGGCCATGAAAGTTCCAACGAATAAAAACGATAAATCATCCAATGTTACATATGATTCGTTTAAGATTGTTGGAAACACTACTATTATAGAAGAAAACGAGTTCTGGGAGTCTAGCGGCCGCAGAATATTCAAAGGAGAAAGGACCAAAATCAAACATTTTCCATTTATGGCAACAATTCAAATATTTAATAACTTTCATTGCGCCGGATCCATTATTAAGTCGGATCTTATAATCACAGCCTCTTCTTGTTTACAATTGGCTTATAATAATCGTCTGTTCCGAGAAAATCCAGCGTTTCTGTCAGCTCGAGTCGGCAGTAGCTTTTATAACGGTGGAGGTGAAGTCATATCTGTGCAGGAGGTCTACTTCCATCCTTCCTACGATCCAAAGACTTTGAGGAACAATATCTGTCTCCTCCGACTAGCACGCCATCTGAAATTTAGGAGAAAAATCAGAAGCGTAAAAAAAATTGATTTTGATAGACACGAGTCCACTCTCTCTATGACTACATCTGGAATTACTATAGTGGGTTGGGGTGCCAAAGAGCACAGTCCGATAATTGGCAGTCCATGGAAAAACATATTGTCTTTCGCTGAATTACATGTGTATCCTTTAGAAGATTGTCAAGATGTTTATTCCAAAGCTTACGTTACGAAAAAAAACTTTTGTGCTGGTTTTATATCTAGAGGAGGAGGAGCCTGCAATCGTGACGTAGGTGGTCCTGGTATAGTTGAAAATAAGCTGATGGGAATCATAAGCTTCGGATCTCCGGTTTGTGGCTCTCCCGATATGCCAACAGTGTTCACGAAAGTGGGGTATTATACTGATTGGATTGAAGAAATTATGGAACAGCCGGTAATTATTTCGAAGAAAAGGACTACTCTAAAATCAGACTTCAACCCATTTTTAGCTCAACCAATTCATATTGAACCGGATCAAACCACATTTAAGATACCACCTTTGACTGGTGAAAAAATGAAGCCAATACCTATTACAGAAATAGATGGTCAGCTTAGAATATCTGATGAGAAACTGTTTAAAGAATTCCTAGCCACTATGTTCAATAGCCAGGAAATTGCTGAATATGAGGACATAATAAATCCAGACAATGGTGACATCGAGATTAATGATATGATACTCAATGATGAGGATACAGAAGTACAGGAAGAAGTAGAGAATCAAACACAAATAAATGAAGTTTCAAACAAAAGCTTAGAAGAAAGTGAAAAAGAAGGCAATAAATACGAAATGAATACACCTGCTATAGAAGATGAACCAGCGGGCGTTGATAATTTAAATAAGGATTTAGCCAACTTACTTGAAAATGTACAAGATGATGGCGGGTTAGGACCTAAGAAAGATGAAGATAACAACGTTCAGACTGACGATCAAAAGGTTTTGACACTTTTATATTTATCTGACGAGGACAAAAAGAGTAATGGAGGATTGAGCATACCAACGGAACATTTTGAGGATTTAACGAGAAGTAAACAAAATGTTCTGAATATTTTACCAGAGAATGAACTATATGCTCTTTTATCGGAAGTTATACAAGACGAGGTTGAGAAAATAAACGCTGGGACTTGA

Protein sequence:

>DPOGS201112-PA
MYSVGKVYANINMDVLKAMKVPTNKNDKSSNVTYDSFKIVGNTTIIEENEFWESSGRRIFKGERTKIKHFPFMATIQIFNNFHCAGSIIKSDLIITASSCLQLAYNNRLFRENPAFLSARVGSSFYNGGGEVISVQEVYFHPSYDPKTLRNNICLLRLARHLKFRRKIRSVKKIDFDRHESTLSMTTSGITIVGWGAKEHSPIIGSPWKNILSFAELHVYPLEDCQDVYSKAYVTKKNFCAGFISRGGGACNRDVGGPGIVENKLMGIISFGSPVCGSPDMPTVFTKVGYYTDWIEEIMEQPVIISKKRTTLKSDFNPFLAQPIHIEPDQTTFKIPPLTGEKMKPIPITEIDGQLRISDEKLFKEFLATMFNSQEIAEYEDIINPDNGDIEINDMILNDEDTEVQEEVENQTQINEVSNKSLEESEKEGNKYEMNTPAIEDEPAGVDNLNKDLANLLENVQDDGGLGPKKDEDNNVQTDDQKVLTLLYLSDEDKKSNGGLSIPTEHFEDLTRSKQNVLNILPENELYALLSEVIQDEVEKINAGT-