Monarch geneset OGS2.0

DPOGS202926
TranscriptDPOGS202926-TA1491 bp
ProteinDPOGS202926-PA496 aa
Genomic positionDPSCF300220 - 53676-59300
RNAseq coverage870x (Rank: top 15%)
Annotation
HeliconiusHMEL0078740.081.47% 
BombyxBGIBMGA001915-TA0.074.95% 
DrosophilaPros29-PA3e-11077.96% 
EBI UniRef50UniRef50_E2A3M24e-12785.99%Proteasome subunit alpha type n=13 Tax=Eukaryota RepID=E2A3M2_CAMFO
NCBI RefSeqXP_968456.16e-13187.75%PREDICTED: similar to proteasome alpha 4 subunit [Tribolium castaneum]
NCBI nr blastpgi|531484613e-13796.11%proteasome alpha 4 subunit [Plutella xylostella]
NCBI nr blastxgi|531484613e-13796.48%proteasome alpha 4 subunit [Plutella xylostella]
Group
Gene OntologyGO:00516034.9e-55proteolysis involved in cellular protein catabolic process
GO:00042984.9e-55threonine-type endopeptidase activity
GO:00058394.9e-55proteasome core complex
GO:00041752.9e-14endopeptidase activity
GO:00197732.9e-14proteasome core complex, alpha-subunit complex
GO:00065112.9e-14ubiquitin-dependent protein catabolic process
GO:00160213.9e-14integral to membrane
KEGG pathwaytca:6568632e-130 
 K02728 (PSMA4)maps-> Proteasome
InterPro domain[267-454] IPR0013534.9e-55Proteasome, subunit alpha/beta
[244-266] IPR0004262.9e-14Proteasome, alpha-subunit, conserved site
[93-224] IPR0048773.9e-14Cytochrome b561, eukaryote
Orthology groupMCL14254 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202926-TA
ATGGATAAACCTAGCACTAGTCGTCAATCGATCGAGAAGGTGGAAATATTTGAAGTCCGAGAGCATGTTCCTTATCAAGCAGAAGTGACAAGGGCACCGGTAGCATATGATGAAGAAAGTGGTGACACTTATAACAGCTCGTCCTGGAGTTCCGCGTGGAGAGCCATCTGCCAGCTGGTCAATCTTCTTCATCATATGCAGATAGCCATTGTAGTGTTCTGCTTATGGCGGTTTGCCTTAACCTCCCAGCCAAATGGATCGATCACGAATTTGCAACTTCATATTGTTTTTGCGGGAACTGGTTACCAACTGTTTCTGGTCGAATCTGTGCTGACATTGCACAGACACAACTCGTGGTCATTTCAGTTACGTCAGGACAGCAAAAGGATAATTCACGGATGTCTCCAACTCGTTGGCTCGCTTTTCGTCATGGCGGGAACGTTTCTCGCATTGTCAGAAGTTAAAATGGTGATCAACACAGCCCATGGAATTTGCGGTGTCATCGCTCTCACATTTACCCTCATAAGCTTCGTGTCCGGAATTCTAGCTTTGTTTTCGTCAAAAATACGATTACTCGTAAAAAGTGGTCCTATCAAAATTCTTCACATAGCTGTGGGACTATTTGCAGTAACTATGGGTCTGGTCACCATGATTATGGGCTTCAATATGGATTATTTCAGCGCTACACAAGGAGAGCTATCCACGGCTCTGATGGTTTTCGCTCGTCGTTATGATACGAGAACTACAATCTTTTCGCCTGAAGGTAGATTATACCAAGTTGAATATGCCATGGAAGCTATCAGCCATGCTGGAACTTCCCTTGGTATTTTAGCCACTGACGGAATTCTGTTGGCAGCAGAAAGAAGAAACACAAATAAACTTTTAGATGAAGTGTTCTTTTCTGAAAAAATTTATAAATTGAATGATGACATGGTATGCTCTGTTGCCGGTATCACTTCTGATGCGAATGTCCTAACTAATGAACTGCGTCTGATTGCTCAGAGGTATCTTTTGCAATATGGTGAATCCATACCATGTGAACAATTGGTTTCATGGCTCTGTGATGTTAAACAGGCATACACACAGTATGGAGGTAAAAGACCATTTGGTGTATCAATCCTATACATGGGTTGGGACAAGCACTATGGTTACCAACTCTACCAGTCTGATCCCAGCGGTAACTATGGAGGGTGGAAGGCCACTTGCATTGGGAATAATAGTGCTGCCGCTGTTTCAAGTCTTAAGCAGGAGTACAAGGAAAATGAAACTACTTTAGCCGAAGCTCAGGCTCTAGCCATCAAAGTTCTCAGCAAGACATTGGATATGACAAAACTTTCGCCAGAAAAAGTTGAAATGGCTACACTTACGCGCAAAGACAATAAAACAATAATAAGAATTCTTACAAGTGCTGAAGTTGAAAAACTCATTCAAGACTTTGAAAAGAGTGAAGCTGAAGCCGAAGCTGCCAAGAAACAGCCTCCAAAGTCGTAA

Protein sequence:

>DPOGS202926-PA
MDKPSTSRQSIEKVEIFEVREHVPYQAEVTRAPVAYDEESGDTYNSSSWSSAWRAICQLVNLLHHMQIAIVVFCLWRFALTSQPNGSITNLQLHIVFAGTGYQLFLVESVLTLHRHNSWSFQLRQDSKRIIHGCLQLVGSLFVMAGTFLALSEVKMVINTAHGICGVIALTFTLISFVSGILALFSSKIRLLVKSGPIKILHIAVGLFAVTMGLVTMIMGFNMDYFSATQGELSTALMVFARRYDTRTTIFSPEGRLYQVEYAMEAISHAGTSLGILATDGILLAAERRNTNKLLDEVFFSEKIYKLNDDMVCSVAGITSDANVLTNELRLIAQRYLLQYGESIPCEQLVSWLCDVKQAYTQYGGKRPFGVSILYMGWDKHYGYQLYQSDPSGNYGGWKATCIGNNSAAAVSSLKQEYKENETTLAEAQALAIKVLSKTLDMTKLSPEKVEMATLTRKDNKTIIRILTSAEVEKLIQDFEKSEAEAEAAKKQPPKS-