Monarch geneset OGS2.0

DPOGS202909
TranscriptDPOGS202909-TA1584 bp
ProteinDPOGS202909-PA527 aa
Genomic positionDPSCF300126 + 267124-271227
RNAseq coverage176x (Rank: top 50%)
Annotation
HeliconiusHMEL0053724e-12975.93% 
BombyxBGIBMGA004182-TA4e-10052.39% 
DrosophilaCG3589-PA2e-2528.50% 
EBI UniRef50UniRef50_UPI00022476CE8e-6230.73%UPI00022476CE related cluster n=1 Tax=unknown RepID=UPI00022476CE
NCBI RefSeqXP_002427295.11e-5428.57%heat shock protein 70 HSP70 interacting protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3454946773e-6130.73%PREDICTED: peroxisomal leader peptide-processing protease-like [Nasonia vitripennis]
NCBI nr blastxgi|3454946772e-6030.74%PREDICTED: peroxisomal leader peptide-processing protease-like [Nasonia vitripennis]
Group
Gene OntologyGO:00038241.2e-23catalytic activity
GO:00042523.2e-06serine-type endopeptidase activity
GO:00065083.2e-06proteolysis
KEGG pathway 
InterPro domain[296-501] IPR0090031.2e-23Peptidase cysteine/serine, trypsin-like
[323-479] IPR0012543.2e-06Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL16916 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202909-TA
ATGACGGTCGAGGGTGTCATGGTGTCCTACAACTATTCGGATGACGCGGAACACGCGAACATACTGACAGTGTCGGCATCAGGAATAAAGTTCTCCAAGCGATGGGTTCTCACGCACGGTTCAATATTGTCGCCACTAAAACAGGCCAACGTTATCAAGAACGCTCGAGGCAAACCCATTTTAAACGACGAGTTCTATGACAACCTTCCGGAAATATACGTCACCTGTGAGAAGGTTAAATCCAAGACACCAAACATGTACGAGAACCTCGAGATCCTGAGCAGAGAGAGATCGCTTAACAACGATGCTGACTTGGAACATAGCAGCTATCAAGTTAGAGTGCTCACTGGAAGGATATGCCACGTGTGGCAATGTCCTGTGCTGGATCGCTGCGTGGACAACATCCTGTACAGTTGGACCATCGGTCACCGGGACGGAGACGTGGAGAAACAGACGCAGCTGGGGAAGGCGCTGCTTTCCGTGTTCGTGCTGGTAGACCTGGAGACGGATAATAGAGATTTTAAGATCCTGAGGCCGCTGTCTGAGCTACTGGACATGTGTCAGCCGCCGCCCGACCGCGGCGCCACCGTCGACATACATTCCACGCCCTTTGGATGCGAGGTGTTCCTGAACGCGGTGACTCGTGGCTCCGTGTGTGGTGTCGTGGGCAAGCGACCGTCCCTCTTACTGACAGACGCTGCCACCGCCTTGGGCTCGGAGGGAGGGCCAGTCTTCACCGCGGGACCCGACAATCATCTGGTGGGCGCGGTGGTGTGTTCCGTGTCGTGGTGGCGCGGGGAGTGGGTGGGTCTCACCCTGGCCGCCCCTCTCAAGTCGGTGCTCGCGGCTAAGCTGAGAGTCCAACAGCCGCTCCCCGCTAGGACACCGCCGTCGCCGCTGTACGTTAGGATACTAGAGCTGGTGGACCGCAGTACGGTGCTGGTGAGGTGCGGGGCGGCCTGGGGCGCCGGACTCTATCTGGGGGGAGGACACGTGCTCACGTGCGCGCACGTCGTCAAACATCACGCGTCTCACAAAGTGTCGGTGTACTGTGACAACGTGAAGGAGACGGCCGCGGTCCGCTACAAGACCAAAGACGACCTAGCTTACGATCTCGCCTTGCTATACGTGTCACCGGCTCATTGGAGACACCTCCTGCCGGCGGTCTTCGCTGAGGAGTCAGCACAGAAAGGCGAGTCTGTGCTGGCGGCGGGGTTCCCGTACTTCAACGAGACCAACCTGGAGGAGCTGAAGCCGACCGTCACCAGCGGCCACGTCAACAACGTCTCCCCGTCACTCATACAGACCACCTGCTGCGTGCAATCAGGGTTCAGCGGAGGTCCGATATTCCGTATAACAAAGGAGCTCAAGGTGGAGGTGCTGGGTACGATCGTGTCCAACGCTAAGACGGAGACGGGCGCTAGCTACCCCTACATCAACATGGCCGTCCCCACCAAGGCGTTCATACGCCTCGTGCAACACTTCATACTGGAGAGGGATGAGAATGTCCTCTCCCAGATTGAAAACAAAAAAGATATGATCCAATCACAGTGGAGGTTGCTGCCTTATAGATCTAAGATATGA

Protein sequence:

>DPOGS202909-PA
MTVEGVMVSYNYSDDAEHANILTVSASGIKFSKRWVLTHGSILSPLKQANVIKNARGKPILNDEFYDNLPEIYVTCEKVKSKTPNMYENLEILSRERSLNNDADLEHSSYQVRVLTGRICHVWQCPVLDRCVDNILYSWTIGHRDGDVEKQTQLGKALLSVFVLVDLETDNRDFKILRPLSELLDMCQPPPDRGATVDIHSTPFGCEVFLNAVTRGSVCGVVGKRPSLLLTDAATALGSEGGPVFTAGPDNHLVGAVVCSVSWWRGEWVGLTLAAPLKSVLAAKLRVQQPLPARTPPSPLYVRILELVDRSTVLVRCGAAWGAGLYLGGGHVLTCAHVVKHHASHKVSVYCDNVKETAAVRYKTKDDLAYDLALLYVSPAHWRHLLPAVFAEESAQKGESVLAAGFPYFNETNLEELKPTVTSGHVNNVSPSLIQTTCCVQSGFSGGPIFRITKELKVEVLGTIVSNAKTETGASYPYINMAVPTKAFIRLVQHFILERDENVLSQIENKKDMIQSQWRLLPYRSKI-