Monarch geneset OGS2.0

DPOGS213746
TranscriptDPOGS213746-TA846 bp
ProteinDPOGS213746-PA281 aa
Genomic positionDPSCF300212 - 776926-779187
RNAseq coverage1092x (Rank: top 12%)
Annotation
HeliconiusHMEL0139106e-7592.96% 
BombyxBGIBMGA009234-TA3e-11480.49% 
DrosophilaPros35-PA2e-9062.92% 
EBI UniRef50UniRef50_P257864e-9966.28%Proteasome subunit alpha type-1 n=94 Tax=Opisthokonta RepID=PSA1_HUMAN
NCBI RefSeqXP_002422679.16e-11574.72%proteasome subunit alpha type, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420032831e-11374.72%proteasome subunit alpha type, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420032832e-10974.72%proteasome subunit alpha type, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00516039e-54proteolysis involved in cellular protein catabolic process
GO:00042989e-54threonine-type endopeptidase activity
GO:00058399e-54proteasome core complex
GO:00041751.2e-13endopeptidase activity
GO:00197731.2e-13proteasome core complex, alpha-subunit complex
GO:00065111.2e-13ubiquitin-dependent protein catabolic process
KEGG pathwayphu:Phum_PHUM0057202e-114 
 K02725 (PSMA1)maps-> Proteasome
InterPro domain[29-215] IPR0013539e-54Proteasome, subunit alpha/beta
[6-28] IPR0004261.2e-13Proteasome, alpha-subunit, conserved site
Orthology groupMCL11159 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213746-TA
ATGTTCCGCAACCAGTATGACAGTGATGTCACAGTGTGGAGCCCTCAGGGTCGACTCCATCAAGTGGAGTACGCTATGGAAGCTGTAAAACTGGGCTCAGCCACGGTTGGACTTAAGAATAAACAATTTGCCGTCCTAATAGCTCTCAAGCGGGCCGTGAGTGAACTGTCAGCTTATCAGAAGAAGATCATTCCTATTGATGATCATATTGGTATTTCAATATCTGGATTGACTGCTGATGCTAGAATGCTAAGCCGTTTCATGAGGACGGAATGCCTCAATCATCGTTATGCTCATGACGCACCAATGCCGGTTGGCAGGTTAATTTCTCTTGTCGGCAACAAGATGCAGATATGTACCCAGCGGTATGATAAGCGACCTTTAGGAGTTGGCCTGCTGGTTGCTGGGTATGATGATCAAGGTCCTCATATCTACCAGACATGTCCTTCGGCCAATTACTTTGACTGCCGTGCCATGGCTATCGGTGCACGATCGCAATCCGCAAGAACTTACTTGGAGAAACATCTTAACACGTTCCTCGATTGTGATCTGAATGAACTCGTAGCTCATGGCCTAAGAGCTTTAAGAGATACACTGCCAAATGAGGTTGATCTCAACAACAAGAATGTATCCATTGCGATTGTTGGGCCCAATACGCCTTTAAGGATTGCCGAAGAGCCCGACCTGACTCGTTACCTGTCTCTGGTGGAGGGAGAGGAACGTCGCGGCGGGGCCAGTGCTACGGGGGACGCGGGGGGCCCAGTGGAGGGGGAGGGGGAACCAGCACCTCAGAACAGTTTTGCATTTATCACGGACAAGACAGCTGGATATGATAATGTTCGTTAA

Protein sequence:

>DPOGS213746-PA
MFRNQYDSDVTVWSPQGRLHQVEYAMEAVKLGSATVGLKNKQFAVLIALKRAVSELSAYQKKIIPIDDHIGISISGLTADARMLSRFMRTECLNHRYAHDAPMPVGRLISLVGNKMQICTQRYDKRPLGVGLLVAGYDDQGPHIYQTCPSANYFDCRAMAIGARSQSARTYLEKHLNTFLDCDLNELVAHGLRALRDTLPNEVDLNNKNVSIAIVGPNTPLRIAEEPDLTRYLSLVEGEERRGGASATGDAGGPVEGEGEPAPQNSFAFITDKTAGYDNVR-