Monarch geneset OGS2.0

DPOGS207864
TranscriptDPOGS207864-TA768 bp
ProteinDPOGS207864-PA255 aa
Genomic positionDPSCF300101 - 455886-458524
RNAseq coverage1045x (Rank: top 12%)
Annotation
HeliconiusHMEL0064064e-14292.16% 
BombyxBGIBMGA004657-TA7e-3133.49% 
DrosophilaProsalpha7-PA2e-8559.76% 
EBI UniRef50UniRef50_P257884e-10166.27%Proteasome subunit alpha type-3 n=128 Tax=Eukaryota RepID=PSA3_HUMAN
NCBI RefSeqNP_001040387.15e-14192.94%proteasome alpha 3 subunit [Bombyx mori]
NCBI nr blastpgi|1140512459e-14092.94%proteasome alpha 3 subunit [Bombyx mori]
NCBI nr blastxgi|1140512451e-13392.94%proteasome alpha 3 subunit [Bombyx mori]
Group
Gene OntologyGO:00516035.8e-54proteolysis involved in cellular protein catabolic process
GO:00042985.8e-54threonine-type endopeptidase activity
GO:00058395.8e-54proteasome core complex
GO:00041753.9e-12endopeptidase activity
GO:00197733.9e-12proteasome core complex, alpha-subunit complex
GO:00065113.9e-12ubiquitin-dependent protein catabolic process
KEGG pathwayame:4089864e-112 
 K02727 (PSMA3)maps-> Proteasome
InterPro domain[31-216] IPR0013535.8e-54Proteasome, subunit alpha/beta
[8-30] IPR0004263.9e-12Proteasome, alpha-subunit, conserved site
Orthology groupMCL15354 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207864-TA
ATGAGCTCTATCGGAACCGGTTACGATCTATCGGCATCACAATTTTCACCTGACGGCAGAGTATTTCAAGTGGAATACGCTGCTAAGGCCGTAGAAAACTCCGGCACAGTCATAGGCTTACGAGGAAAGGATGGTGTAGTTTTTGCAGTCGAAAAACTTGTTTCTTCTAAACTTTATGAACCAGGTGCAAATAAGAGGATTTTCCATATTGATGAACATGTTGGCATGGCTGTTGCGGGTCTTATATCTGATGCACGACAAATTGTAGAGACAGCAAGATCTGAAGCCTCCAACTATAGATCTCAGTACGGCGTGCCAGTACCCTTGAAGTATCTTAATGAACGTGTATCAATGTATATGCATGCATACACTCTCTACAGTGCTGTACGTCCGTATGGATGCTCGGTTATAATGGGTACTTGGACAGATTACGAAGGACCCCAAATGTATATGCTAGAACCTAGTGGTGTTTCATTTTCTTACTTTGGATGTGCTGTTGGTAAAGCAAAGCAGGCTGCGAAGACGGAAATTGAAAAACTTAAACTTGCCGATTTAACTGTGAAGGAGTTGGTGAAGGAAGCGGCTAGAATAATATACATAGTCCACGACGAGTTAAAAGACAAACAGTTCGAGTTAGAACTATCCTGGGTGTGTAAAGACAGTAATGGCCGCCATCAGCTCGTACCCAAAGATATGGCTGTAGAAGCTGAGAACCTCGCCAAGCAAGCACTCGCTGACATTGAAGACTCCGATGAAGGGGATATGTAG

Protein sequence:

>DPOGS207864-PA
MSSIGTGYDLSASQFSPDGRVFQVEYAAKAVENSGTVIGLRGKDGVVFAVEKLVSSKLYEPGANKRIFHIDEHVGMAVAGLISDARQIVETARSEASNYRSQYGVPVPLKYLNERVSMYMHAYTLYSAVRPYGCSVIMGTWTDYEGPQMYMLEPSGVSFSYFGCAVGKAKQAAKTEIEKLKLADLTVKELVKEAARIIYIVHDELKDKQFELELSWVCKDSNGRHQLVPKDMAVEAENLAKQALADIEDSDEGDM-