Monarch geneset OGS2.0

DPOGS214570
TranscriptDPOGS214570-TA1272 bp
ProteinDPOGS214570-PA423 aa
Genomic positionDPSCF300050 - 780900-785277
RNAseq coverage10346x (Rank: top 1%)
Annotation
HeliconiusHMEL0219562e-7943.91% 
BombyxBGIBMGA010546-TA2e-4336.71% 
DrosophilaMP1-PC1e-4836.99% 
EBI UniRef50UniRef50_UPI00015B5CB32e-5838.07%UPI00015B5CB3 related cluster n=1 Tax=unknown RepID=UPI00015B5CB3
NCBI RefSeqXP_972720.26e-6344.21%PREDICTED: similar to hemolymph proteinase 5 [Tribolium castaneum]
NCBI nr blastpgi|1892391771e-6144.21%PREDICTED: similar to hemolymph proteinase 5 [Tribolium castaneum]
NCBI nr blastxgi|1892391778e-6244.14%PREDICTED: similar to hemolymph proteinase 5 [Tribolium castaneum]
Group
Gene OntologyGO:00038244.2e-80catalytic activity
GO:00042524.7e-78serine-type endopeptidase activity
GO:00065084.7e-78proteolysis
KEGG pathway 
InterPro domain[156-422] IPR0090034.2e-80Peptidase cysteine/serine, trypsin-like
[163-417] IPR0012544.7e-78Peptidase S1/S6, chymotrypsin/Hap
[194-209] IPR0013147.2e-16Peptidase S1A, chymotrypsin-type
Orthology groupMCL23662 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214570-TA
ATGAACAACCTCGATGTAACAAATGGGAAGGGAACAAGAGAATATTCGTTATATCAAGTAATTCGTCGAACTGAGAGCCCGCGGACCAAGCATAATCAGTCTACCGAAGGACTCCGTGAAGAGCACATCGGACAAAACAATATAATAAATAAAATGCTCACATATAAGTTATATTTGTGTGTATTAATACTCGGCTGTGCGGGAAGTATCAATGGAAAAGTGTGCAACGACTGTGTTGCCTTCACATCATGTCCGAGCGCCGCGGAATTAGCTGTGAACCATCGAGACGCCACCACCAAGACAAAATTCATAGAATCCTTTTGTGGATATGAATACGATCAATTAAAAAGGATACCAAAGGTGTGTTGTTCTGATTTCAACGTTGCCTCACGGACCGGGGAAGCGGGAACAGATGTAACAGGCACAGCCAATCCACATCCCAACATGAAACTTCTGCCTGACATTTGTGGTGACATCGATGGAAATCGCATCATCGGTGGCAGAGTTGCCAAAATTCACGAGTTCCCGTGGATGGCCCTCATTTCTTATAACACGCGTGAGGGACTCCAGTTTCTGTGCGGTGGAAGTATTATAAACTCACGATACATTCTAACTGCTGGTCATTGTGTTGCGGGCTCCCAGAAAATAGCTGGAGTCCGTATCGGGGAATTCGATATTCGTTACAAAACTGACTGTCAAGGAGAAGAACCGAATTTTGTATGCGAATCTCACCTGCAGGATATTCGAGTTGAAAAAGTTATTCTTCACGAATCCTATACAGGATTACCAGCTCCCTCTAATGACATAGCTCTTCTACGTCTAAGCAAACCAATCAACCTTAGCTACAAAAATGCATTACCAATTTGTCTACCTGTGACGGAAGATCTGCAAAATGTTGAGCTTGGGGGTAGAGTGGGCACTGTAGCTGGTTGGGGACTTACAGAAACCGAAAAGTACTCTCCTGTTCTTCTTAAGGTCAACGTTTCTATAAGAACTGGGGAGGAGTGCACTCATTATTATAACAGAAGTCCCGGGACGAAAGAAAATGATAGAACTACGAACTACCTTTGCGCTGGTGAGTATTTAAAAGATAGTTGCAACGGTGACTCCGGCGGTCCTCTGATGTTAGAGGGAGAATACAAGGGAATCAACAGAAATATACAATACGGCATTGTGTCCCACGGCCCTAAACAATGCGGTTCAGATTTCCCTGGTGTGTACACTGCAGTCACCAAATACATAGGTTGGATCTTAGACAATATAAGGGAGTAA

Protein sequence:

>DPOGS214570-PA
MNNLDVTNGKGTREYSLYQVIRRTESPRTKHNQSTEGLREEHIGQNNIINKMLTYKLYLCVLILGCAGSINGKVCNDCVAFTSCPSAAELAVNHRDATTKTKFIESFCGYEYDQLKRIPKVCCSDFNVASRTGEAGTDVTGTANPHPNMKLLPDICGDIDGNRIIGGRVAKIHEFPWMALISYNTREGLQFLCGGSIINSRYILTAGHCVAGSQKIAGVRIGEFDIRYKTDCQGEEPNFVCESHLQDIRVEKVILHESYTGLPAPSNDIALLRLSKPINLSYKNALPICLPVTEDLQNVELGGRVGTVAGWGLTETEKYSPVLLKVNVSIRTGEECTHYYNRSPGTKENDRTTNYLCAGEYLKDSCNGDSGGPLMLEGEYKGINRNIQYGIVSHGPKQCGSDFPGVYTAVTKYIGWILDNIRE-