Monarch geneset OGS2.0

DPOGS211071
TranscriptDPOGS211071-TA1488 bp
ProteinDPOGS211071-PA495 aa
Genomic positionDPSCF300007 - 1435654-1437141
RNAseq coverage967x (Rank: top 13%)
Annotation
HeliconiusHMEL0093810.091.31% 
BombyxBGIBMGA002953-TA0.091.31% 
DrosophilaRpn3-PA0.070.55% 
EBI UniRef50UniRef50_O432421e-16260.16%26S proteasome non-ATPase regulatory subunit 3 n=106 Tax=Eumetazoa RepID=PSMD3_HUMAN
NCBI RefSeqXP_001605959.10.076.49%PREDICTED: similar to 26S proteasome regulatory subunit S3 [Nasonia vitripennis]
NCBI nr blastpgi|1565427930.076.49%PREDICTED: probable 26S proteasome non-ATPase regulatory subunit 3-like [Nasonia vitripennis]
NCBI nr blastxgi|910828830.077.22%PREDICTED: similar to 26S proteasome regulatory subunit S3 [Tribolium castaneum]
Group
Gene OntologyGO:00005025.7e-28proteasome complex
GO:00421765.7e-28regulation of protein catabolic process
GO:00302345.7e-28enzyme regulator activity
GO:00055151.7e-21protein binding
KEGG pathwaynvi:1001140550.0 
 K03033 (PSMD3, RPN3)maps-> Proteasome
InterPro domain[182-354] IPR0131432.6e-70PCI/PINT associated module
[426-492] IPR0135865.7e-2826S proteasome regulatory subunit, C-terminal
[354-444] IPR0007171.7e-21Proteasome component (PCI) domain
[362-422] IPR0119911.5e-06Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL13253 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211071-TA
ATGGCTCCTGTGGATCAAGATGTGGAAATGAAGAGTGTAGACAGCCCAGCAGCCGCAGGTTCCGAAGAAACTAGTGATATTAAAAAAGATGTCGATGTTCTTGCTGTTCAGGATTTAAGAGAACATGTTCGACAAATTGACAAAGCTGTTTCATCTAAGGAACCGCGATTTGCTATGCGAGTTTTAAGATCTCTTCCAAGTACCAGAAGGAAGCTTAATGGTAATGTTCTTCGTGCTATTATAAACCAACTTTATCCTGCTGGCAATGAAAAAGAAGCTTTAATGTGCTTTGTTGAAAATCCCTTACCCACTGCAGTGGAAATTGAAACGCCTCGCTCTAGAAGTGCACCGAAGCAACCAGTGCCAGAGGTTGATGTTTATATTCACTTGCTGGTGTTACTTCGCTTACTTGACACTAAAAAATTAGAAGAAGCTGTAGAATGTTCGCAACAACTTATGAATAAAATTGTTGTCCATAATCGCAGAACTTTAGATTTAATTGCTGCTAAATGCTATTTTTACCATTCTATGGTGTTTGAATTGAACAACAAGTTGGACTTTATTAGAGGTTTGTTACATGCAAGGCTTCGCACATCTACCTTGAGAAGTGACTATGAGGGCCAGGCAGTGTTAATTAACTGTCTGTTAAGAAATTACTTGCATTACTCCTTGTATGAGCAAGCGGATAAGTTAGTGAGTAAGTCAGTTTTTCCGGAAAATGCAAGTAATAATGAATGGGCCAGGTTTTTATATTATCTTGGCAGAATTAAAGCTGCAAGACTTGAATATAGTGATGCACATAAACATTTAGTTCAAGCTTTACGTAAAGCTCCACAAACTGCTGCTGTAGGGTTCCGTCAGACTGTTCAGAAGTTAGCAATTGTGGTTGAACTATTGTTGGGTGATATTCCAGAACGTGCTATTTTCCGTCAAGCTCCGCTCCGGAAATCATTAGCACCATATTTCCAACTCACCCAAGCAGTAAGATTGGGAAATTTGCAAAGATTTGGGGAGGTCCTAGAAAATTATGGGCCACAATTTCGTAATGATCACACCTTCACACTAATCCTCCGTTTACGTCAAAATGTTATTAAGACAGCTATCCGTTCAATCGGACTGTCATATTCCCGGATATCTCCAAAGGATATTGCTCGCAAACTTGGTTTAGACTCTGCTGAAGACGCTGAGTTTATTGTAGCTAAAGCTATCCGTGACGGAGTTATTGAAGCTACACTTGACCCAGAAAAGGGATACATGAGTAACAAAGAGAGTTCTGATATATATTGTACTAGAGAACCGCAGTTGGCTTTCCACCAGCGTATTTATTTCTGCCTTGAATTACACAATCAGAGTGTTAAAGCGATGAGATATCCTCCAAAATCCTACGGAAAAGAGTTGGAGAGTGCAGAAGAAAGACGGGAACGAGAACAACAAGACCTGGAACTGGCCAAGGAAATGGCTGAAGAGGATGATGATGGTTTCCCTTAA

Protein sequence:

>DPOGS211071-PA
MAPVDQDVEMKSVDSPAAAGSEETSDIKKDVDVLAVQDLREHVRQIDKAVSSKEPRFAMRVLRSLPSTRRKLNGNVLRAIINQLYPAGNEKEALMCFVENPLPTAVEIETPRSRSAPKQPVPEVDVYIHLLVLLRLLDTKKLEEAVECSQQLMNKIVVHNRRTLDLIAAKCYFYHSMVFELNNKLDFIRGLLHARLRTSTLRSDYEGQAVLINCLLRNYLHYSLYEQADKLVSKSVFPENASNNEWARFLYYLGRIKAARLEYSDAHKHLVQALRKAPQTAAVGFRQTVQKLAIVVELLLGDIPERAIFRQAPLRKSLAPYFQLTQAVRLGNLQRFGEVLENYGPQFRNDHTFTLILRLRQNVIKTAIRSIGLSYSRISPKDIARKLGLDSAEDAEFIVAKAIRDGVIEATLDPEKGYMSNKESSDIYCTREPQLAFHQRIYFCLELHNQSVKAMRYPPKSYGKELESAEERREREQQDLELAKEMAEEDDDGFP-