Monarch geneset OGS2.0

DPOGS213495
TranscriptDPOGS213495-TA846 bp
ProteinDPOGS213495-PA281 aa
Genomic positionDPSCF300100 + 400782-403276
RNAseq coverage838x (Rank: top 15%)
Annotation
HeliconiusHMEL0168332e-14488.26% 
BombyxBGIBMGA004376-TA7e-15993.24% 
DrosophilaProsbeta5-PA4e-11370.76% 
EBI UniRef50UniRef50_Q8T8V82e-8957.97%Proteasome subunit beta type n=52 Tax=Arthropoda RepID=Q8T8V8_DROME
NCBI RefSeqXP_970194.12e-12175.36%PREDICTED: similar to proteasome subunit beta type 5,8 [Tribolium castaneum]
NCBI nr blastpgi|2101486403e-15292.78%proteasome subunit beta 5 [Helicoverpa armigera]
NCBI nr blastxgi|2101486405e-14892.78%proteasome subunit beta 5 [Helicoverpa armigera]
Group
Gene OntologyGO:00516032.6e-48proteolysis involved in cellular protein catabolic process
GO:00042982.6e-48threonine-type endopeptidase activity
GO:00058392.6e-48proteasome core complex
GO:00041758e-19endopeptidase activity
KEGG pathwaytca:6587396e-121 
 K02737 (PSMB5)maps-> Proteasome
InterPro domain[71-252] IPR0013532.6e-48Proteasome, subunit alpha/beta
[81-96] IPR0002438e-19Peptidase T1A, proteasome beta-subunit
Orthology groupMCL11209 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213495-TA
ATGGCTTTGATAGACTTTTGCCGTTTAGATGAGAAAATTAACTTAAAGCCCGTGGATTCTCTTGCTTCTGTTAACAACGATGTAATGGGTTACACCCAAAACTTTTTAAATAGTGCCCAACTCGCTCTTCCTCCGTTCGCTAATCCGGTAGAAACTCTAGCCCAGTTTAACACCAAAGACGAAACTGGCCGTCAAATTAAAATTGAATTTGATCACGGCACTACCACACTCGGATTTCGGTACAAAGGTGGAGTACTTCTAGCTGTTGATTCCAGAGCGACCGGCGGTCAGTTCATTGGTTCTCAGTCTATGAAAAAGATTGTGGAAATCAATGACTATTTACTGGGTACGTTGGCCGGTGGTGCAGCTGACTGTGTGTACTGGGATCGTGTCCTCGCCAAACAGTGCCGTCTGTATGAGCTCAGGAACAGGGAACGGATCTCAGTAGCGGCGGCCAGCAAACTGATGGCCAACATGGTCTATAACTACAAGGGGATGGGACTCAGTATGGGAATGATGCTTGCCGGTATTGACAAGAGGGGCGCTCAACTGTACTATGTAGACAGTGAAGGTACTCGGACCCCTGGCAAAGTATTCTCTGTTGGATCAGGATCCGTCTATGCATTTGGCGTCCTTGACTCCGGGTACCGCTGGGACCTTGAAGATGAAGAAGCTCAAGAGCTGGGTCGCCGGGCCATCTACCACGCCACCCACCGCGACGCCTACTCCGGCGGTATAATTCGTGTCTATCACATCAATGACAAAGGCTGGGTGAACATTTCCAACGAAGACTGCTCGGAGCTGCACTACAAGTATCAGGCTGAGAAGGGAATTAAGGATGAATAA

Protein sequence:

>DPOGS213495-PA
MALIDFCRLDEKINLKPVDSLASVNNDVMGYTQNFLNSAQLALPPFANPVETLAQFNTKDETGRQIKIEFDHGTTTLGFRYKGGVLLAVDSRATGGQFIGSQSMKKIVEINDYLLGTLAGGAADCVYWDRVLAKQCRLYELRNRERISVAAASKLMANMVYNYKGMGLSMGMMLAGIDKRGAQLYYVDSEGTRTPGKVFSVGSGSVYAFGVLDSGYRWDLEDEEAQELGRRAIYHATHRDAYSGGIIRVYHINDKGWVNISNEDCSELHYKYQAEKGIKDE-