Monarch geneset OGS2.0

DPOGS210981
TranscriptDPOGS210981-TA1929 bp
ProteinDPOGS210981-PA642 aa
Genomic positionDPSCF300004 - 138282-148059
RNAseq coverage441x (Rank: top 28%)
Annotation
HeliconiusHMEL0250042e-11577.60% 
BombyxBGIBMGA006406-TA2e-10871.60% 
DrosophilaCG4914-PA3e-9452.32% 
EBI UniRef50UniRef50_G6D2Z90.0100.00%Serine protease-like protein n=3 Tax=Obtectomera RepID=G6D2Z9_DANPL
NCBI RefSeqNP_001037368.17e-11966.01%serine protease-like protein [Bombyx mori]
NCBI nr blastpgi|1129836181e-11766.01%serine protease-like protein precursor [Bombyx mori]
NCBI nr blastxgi|1129836185e-11966.01%serine protease-like protein precursor [Bombyx mori]
Group
Gene OntologyGO:00038244e-90catalytic activity
GO:00042521e-87serine-type endopeptidase activity
GO:00065081e-87proteolysis
KEGG pathway 
InterPro domain[390-635] IPR0090034e-90Peptidase cysteine/serine, trypsin-like
[56-285] IPR0012541e-87Peptidase S1/S6, chymotrypsin/Hap
[429-444] IPR0013149.3e-15Peptidase S1A, chymotrypsin-type
Orthology groupMCL22670 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210981-TA
ATGTTTTTGTATGTACTCTGTCTTATGCTATGCGTGTCTCTATGTTCTACGGATACATTGACTAAGAATTTACTGAGAGTAAGCGATGAATACTACGCTCATGGTCGAAACAACGATTTACCGCCGTGCCGTGATTGTAGTTGTGGAGAACGTAATGAAGAACCTAGAATCGTGGGTGGTTCTTCCACCGACGTGAACGCATATCCTTGGACGGCTCGTCTTATCTATTATAAGTCGTTCGGATGTGGTGCCTCGGTCATCAATGACAGATACGTTATAACGGCAGCCCATTGTGTGAAGGGATTCATGTGGTTCTTATTCAAAGTGAAATTCGGTGAGCATGATCGTTGCGATACTGGCCATGTGCCTGAAACTCGTACAGTAGTTAAGATGTATGTACACAACTTTACTTTGACGGAATTAACTAATGACATATCACTACTACAGCTCAATAGACCTTTGGAGTATACACATGCTATCCGACCCGTTTGTCTGCCTAAAACAGCGGATAATTTGTACGTTGGCAAAATAGCTACTGTCGCTGGCTGGGGCGCCGTCCAAGAAACTGGTAAATGGTCGTGTACGTTACTCGAGGCTCAGTTACCGATACTGAGCAACGAGAATTGTACCAAGACGAAATATGATGTAACAAAAATTAAGGAAGTTATGATGTGTGCTGGATATCCAGAAACCGCTCATAAAGACGCTTGCACTGGAGATAGCGGTGGACCGTTGTTTATGGAAAATAGTGAACACGCTTACGAATTAATTGGTATAGTATCTTGGGGCTACGGATGTGCTAGAAAAGGCTACCCCGGTGTTTACACGAGAGTAACCAAATATTTGGACTGGATACGTGATAATACACAAGACGCATATAGTTGTCTTTACAAGAGCTGGTCCAGCCTAATAATAGAACAGCGCATAATAATGTGGTTGAGGGAGCAACGTCCTCATTCCCGTTGCATCCTCTCTCTGCTACAGCCTGGGGCTAGTGCTGCCTTAGGTGAAAAGGCCTTTAATGAAACAAAAGAAACAACCACTGCGGCAAGTGGTAATATTGAGAGTTCCAGTAATACTAATAGTAGTACTACTTCTACTACTACTCCTGCTACTACATTCGATCAGGAGATGTTAGACGAACTATATCAAGATTCGCAAAACAGGTGTAACTGTCGTTGCGGTGAAAGAAACGAGGAATCTCGTATTGTGGGTGGAGTGGAAACATCAGTGAACGAGTTCCCTTGGGTCGCTCGTCTGACTTACTTTAACAAGTTCTACTGCGGGGGCATGCTGATAAATGATAGATATATCCTAACTGCGGCCCATTGTGTTAAAGGATTAATGTGGTTCATGATAAAGGTAACTTTGGGAGAGCACAACCGTTGTAACGACTCTCGTCCTGTAACACGTTATGTAGTACAAGTTGTTGCCCACAACTTTACCTATCTTACATTCAGGGATGATGTTGCCGTTTTGAGATTGAACGAGCCGATCGAAATATCAGATACAATTAAACCAGTATGTCTGCCCCAAATTACCGATAATGATTACGTGGGGGTAAAAGCAATTGCCGTTGGTTGGGGATCGATTGGTGAGCAGAAAAATCATTCGTGCACTCTATTAAACGTGGAATTGCCAGTGCTTAGTAATGACGTTTGTAGAAACACTATGTATGAGACGAGTATGATAGCGGATGGAATGCTCTGCGCCGGTTACCCAGACGAAGGACAAAGGGACACTTGCCAGGGTGACAGTGGTGGACCTCTGACTGCAGAGAGAAAGGATAAACGTTACGAACTGCTGGGTATAGTCTCTTGGGGTATTGGGTGTGGAAGACGTGGATATCCAGGGGTTTACACGAGGGTTACAAAATACCTGAATTGGATCAGAGACAACTCCCGCCACGGATGTTTCTGTTCAGACTAA

Protein sequence:

>DPOGS210981-PA
MFLYVLCLMLCVSLCSTDTLTKNLLRVSDEYYAHGRNNDLPPCRDCSCGERNEEPRIVGGSSTDVNAYPWTARLIYYKSFGCGASVINDRYVITAAHCVKGFMWFLFKVKFGEHDRCDTGHVPETRTVVKMYVHNFTLTELTNDISLLQLNRPLEYTHAIRPVCLPKTADNLYVGKIATVAGWGAVQETGKWSCTLLEAQLPILSNENCTKTKYDVTKIKEVMMCAGYPETAHKDACTGDSGGPLFMENSEHAYELIGIVSWGYGCARKGYPGVYTRVTKYLDWIRDNTQDAYSCLYKSWSSLIIEQRIIMWLREQRPHSRCILSLLQPGASAALGEKAFNETKETTTAASGNIESSSNTNSSTTSTTTPATTFDQEMLDELYQDSQNRCNCRCGERNEESRIVGGVETSVNEFPWVARLTYFNKFYCGGMLINDRYILTAAHCVKGLMWFMIKVTLGEHNRCNDSRPVTRYVVQVVAHNFTYLTFRDDVAVLRLNEPIEISDTIKPVCLPQITDNDYVGVKAIAVGWGSIGEQKNHSCTLLNVELPVLSNDVCRNTMYETSMIADGMLCAGYPDEGQRDTCQGDSGGPLTAERKDKRYELLGIVSWGIGCGRRGYPGVYTRVTKYLNWIRDNSRHGCFCSD-