Monarch geneset OGS2.0

DPOGS206698
TranscriptDPOGS206698-TA1938 bp
ProteinDPOGS206698-PA645 aa
Genomic positionDPSCF300048 + 1655605-1660374
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0047492e-6339.38% 
BombyxBGIBMGA008530-TA7e-4030.75% 
DrosophilaCG8586-PA3e-1427.39% 
EBI UniRef50UniRef50_Q7K5M09e-1226.67%GH05918p n=6 Tax=Drosophila RepID=Q7K5M0_DROME
NCBI RefSeqXP_002089671.15e-1326.86%GE22829 [Drosophila yakuba]
NCBI nr blastpgi|3838559404e-1232.22%PREDICTED: serine proteinase stubble-like [Megachile rotundata]
NCBI nr blastxgi|3838559407e-1331.25%PREDICTED: serine proteinase stubble-like [Megachile rotundata]
Group
Gene OntologyGO:00038244.8e-32catalytic activity
GO:00042527.3e-23serine-type endopeptidase activity
GO:00065087.3e-23proteolysis
KEGG pathway 
InterPro domain[413-621] IPR0090034.8e-32Peptidase cysteine/serine, trypsin-like
[428-620] IPR0012547.3e-23Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL25499 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206698-TA
ATGACGTCCCCATTTTTAGTTTTGTTGATCGTTCTGAAAACGGTTTCATCGCAAATAAATGGCGAATTTTGGTGGCTTAATGAAAAATTCACGAAGTTGCAGCAGGTTGTGCCACCATCACCAACATTCGAAGACACGGGACATTTGGAAACCGATGAAAGTGTTAAAATTATTTTTAAAGATGTCACAGAGGATATTGATAAAAATATAAACTTTTCTCTCAACGAAGGAAAAATTGCCATAAATGATAAAATTGACTTCGATGAAGATAAACCGACAATAGAATCTGAAAGTATTTGCACGTTTATTACAAAACATGAATGCTTACGCAACAAGGGCACTGTTCACATGTCTGGGTATGCAATTTTCAGATTATGTCCCTTGAATTCTTTTCATAATTATCACAGAATTTGCTGTATACTTCCTCTTTTTCCATATCCCAAACAATTACACCCAAGTGACATCCTCAATGGAAGCCGGTACAAACGATCTAATGATGATGAAATCAGTCCAGCGCTCAAACAAAGAAACGCTTTGCTGCAGCGAAAAAATTTCTCCCAGGCGTTTAAAAATAACCACGATCCTACAACAGATCAATCTAGAAACCAAAAAATTGTAACTCCTAGTGATAACGTGGATCCGTATTGGAATGTTAAAAACTTTAAATTTCGTCAACAAAATAATTTTAATGAAAACAAGGATCGTGAAAATACAAAAATCGACTCAAGTAGTAAAGATTATTCAGATGATTATACAGCAGAGGTACCTAAACCTGGACTTTTAGGTGCTTATACAGAACGTGATGAACGTCTTACTACTTGGAAAATGAGAAATAAAGCCTACTCATACGACGGCTATGACGAAATAAGCGAGGAAGATAGCGGAGAAACTGACATGCCGTTCGGTTACTCAACGTTTGATCCAAGACAAGGTAATCGCAAGAAAAATTCTAGAAGCAAAAAACGTAAACCTCAAAGGCTAATGTTAACATCAACAGAAGAGAACAAAGGATCAGAGAGCCAGGCTATAAATTTTCATTCGAGGCCGGATTTTCATGTACTACATGGTTTTAAACTTGTTAATTTATCAGGGAATAAAAATAGATTTGTTAGAACTACCACAGAAACTTTATGTGACAGCCAGGATACTTCCAACGAAAATACATTTCCAGAGGTCGAAAATCCGGATGATATTTACGACGACACGATTGATTTAGATCAGCAAGTTTACAAAGATTGTGGAAATAGTGTTACCAACGCTTTCAATAGCGATTTTAGAAAATCACATGAAGAAAAAAACCCTTGGCTCGCTCTGGTCGTGTTAACAAAAAGTCCACAGACAATATTATGTTATGCCACAATAGTACATCCGCGCGCAGTCATTACAGCTGCCGAATGTGTCCAGGGTAAAATCCCTGGGGACGTAACTGTACTAACTGGTGTATGGCAACTAAGGAAAGACAAAGCAGTACCACAACACCGCATGGCCTCGGTCTACATTGTTTCTGACTATAAACCTGGAGAACTTGTTAATGATCTTGCTCTTCTGTATTGGAAACGACCATTACAACTGGCAGAGAACGTTCAGCCCGCATGCCTCGCGGATCCGCACGTCGGAGACGAGTGTTATTTCGTTGGGTGGGGTGGTTACGATCAAGGTTTAAGTCATCACCCTGACTCTCAACAAGCGACTATACTCACGCCTCGTGTGTGTAACGAGAAATTATCATCACCAGAGCTGCTTCTACCCCCAGGCGCGTTCTGTGCTTCAGTTGAATCACGTGGCACTGTAACCGGTATTGGAGGTGCTCTTCTATGTAAAGGCGCGGGTAGTCGAACATCTGTCGTAGGAGTGGCGGTGTATCGTGACAGTATAGTTGTCTTACTACCCACATTCGAATGGGTCGTCTCGGCGTTGCGACACCACCAAATAATTTAA

Protein sequence:

>DPOGS206698-PA
MTSPFLVLLIVLKTVSSQINGEFWWLNEKFTKLQQVVPPSPTFEDTGHLETDESVKIIFKDVTEDIDKNINFSLNEGKIAINDKIDFDEDKPTIESESICTFITKHECLRNKGTVHMSGYAIFRLCPLNSFHNYHRICCILPLFPYPKQLHPSDILNGSRYKRSNDDEISPALKQRNALLQRKNFSQAFKNNHDPTTDQSRNQKIVTPSDNVDPYWNVKNFKFRQQNNFNENKDRENTKIDSSSKDYSDDYTAEVPKPGLLGAYTERDERLTTWKMRNKAYSYDGYDEISEEDSGETDMPFGYSTFDPRQGNRKKNSRSKKRKPQRLMLTSTEENKGSESQAINFHSRPDFHVLHGFKLVNLSGNKNRFVRTTTETLCDSQDTSNENTFPEVENPDDIYDDTIDLDQQVYKDCGNSVTNAFNSDFRKSHEEKNPWLALVVLTKSPQTILCYATIVHPRAVITAAECVQGKIPGDVTVLTGVWQLRKDKAVPQHRMASVYIVSDYKPGELVNDLALLYWKRPLQLAENVQPACLADPHVGDECYFVGWGGYDQGLSHHPDSQQATILTPRVCNEKLSSPELLLPPGAFCASVESRGTVTGIGGALLCKGAGSRTSVVGVAVYRDSIVVLLPTFEWVVSALRHHQII-