Monarch geneset OGS2.0

DPOGS200660
TranscriptDPOGS200660-TA705 bp
ProteinDPOGS200660-PA234 aa
Genomic positionDPSCF300076 + 886482-889660
RNAseq coverage1596x (Rank: top 8%)
Annotation
HeliconiusHMEL0045711e-10298.31% 
BombyxBGIBMGA009562-TA8e-13897.86% 
DrosophilaPros25-PA8e-11783.33% 
EBI UniRef50UniRef50_P403019e-11583.33%Proteasome subunit alpha type-2 n=35 Tax=root RepID=PSA2_DROME
NCBI RefSeqNP_001040344.13e-13697.86%proteasome 25 kDa subunit [Bombyx mori]
NCBI nr blastpgi|1140525044e-13597.86%proteasome 25 kDa subunit [Bombyx mori]
NCBI nr blastxgi|1140525049e-13097.86%proteasome 25 kDa subunit [Bombyx mori]
Group
Gene OntologyGO:00516035.2e-61proteolysis involved in cellular protein catabolic process
GO:00042985.2e-61threonine-type endopeptidase activity
GO:00058395.2e-61proteasome core complex
GO:00041754.5e-11endopeptidase activity
GO:00197734.5e-11proteasome core complex, alpha-subunit complex
GO:00065114.5e-11ubiquitin-dependent protein catabolic process
KEGG pathwayaag:AaeL_AAEL0060615e-122 
 K02726 (PSMA2)maps-> Proteasome
InterPro domain[30-212] IPR0013535.2e-61Proteasome, subunit alpha/beta
[6-28] IPR0004264.5e-11Proteasome, alpha-subunit, conserved site
Orthology groupMCL12743 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200660-TA
ATGGCGTCCGAACGTTATAGTTTCTCTCTCACAACTTTTAGTCCCTCTGGAAAATTGGTGCAAATCGAATATGCCTTAGCAGCTGTAGCAGCTGGTGGTACCTCCGTCGGAATTAAAGCATCGAATGGTGTCGTGATCGCAACTGAAAACAAACACAAAAGCATTTTATACGATGAACACAGCGTAAACAAAGTTGAAATGATCACAGGACATATCGGTATGGTATACTCCGGTATGGGCCCGGACTACCGTTTGCTGGTGACCCAGGCTCGTAAGATGGCACAACAGTACTACCTGATGTACCACGAGCCTATCCCCACAGCACAATTGGTGCAACGTGTCGCTACTGTCATGCAAGAGTACACACAGTCTGGAGGAGTTCGTCCGTTTGGCGTCTCTCTATTGATCTGTGGATGGGACAGCGGTAGACCTTACCTGTTCCAATGTGATCCTTCCGGTGCATACTTTGCTTGGAAGGCCACCGCAATGGGGAAGAATTTTAATAACGGGAAGACATTCTTGGAGAAGAGGTACACCGAGGAGTTGGAGCTAGATGACGCTGTCCACACAGCGATCCTGACTCTCAAGGAAGGTTTCGAAGGTCAAATGACGGCTGATAATATAGAGGTCGGGATCTGTGACGCCGCTGGCTTCAGACGACTGGAACCAGCCCACGTCAAGGACTACCTCGCCAATATACCATAA

Protein sequence:

>DPOGS200660-PA
MASERYSFSLTTFSPSGKLVQIEYALAAVAAGGTSVGIKASNGVVIATENKHKSILYDEHSVNKVEMITGHIGMVYSGMGPDYRLLVTQARKMAQQYYLMYHEPIPTAQLVQRVATVMQEYTQSGGVRPFGVSLLICGWDSGRPYLFQCDPSGAYFAWKATAMGKNFNNGKTFLEKRYTEELELDDAVHTAILTLKEGFEGQMTADNIEVGICDAAGFRRLEPAHVKDYLANIP-