Monarch geneset OGS2.0

DPOGS207763
TranscriptDPOGS207763-TA3216 bp
ProteinDPOGS207763-PA1071 aa
Genomic positionDPSCF300042 - 318860-327456
RNAseq coverage1080x (Rank: top 12%)
Annotation
HeliconiusHMEL0175520.093.38% 
BombyxBGIBMGA005315-TA0.088.68% 
DrosophilaRpn1-PA0.043.18% 
EBI UniRef50UniRef50_G6D9790.0100.00%Putative uncharacterized protein n=7 Tax=Ditrysia RepID=G6D979_DANPL
NCBI RefSeqXP_002085558.10.043.27%GD14837 [Drosophila simulans]
NCBI nr blastpgi|3287169360.060.71%PREDICTED: 26S proteasome non-ATPase regulatory subunit 2-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287169360.058.16%PREDICTED: 26S proteasome non-ATPase regulatory subunit 2-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00005020proteasome complex
GO:00421760regulation of protein catabolic process
GO:00302340enzyme regulator activity
GO:00054887.4e-29binding
KEGG pathwaydsi:Dsim_GD148370.0 
 K03028 (PSMD2, RPN1)maps-> Proteasome
InterPro domain[1-1069] IPR016643026S proteasome regulatory complex, non-ATPase subcomplex, Rpn1 subunit
[25-932] IPR0160247.4e-29Armadillo-type fold
[424-459] IPR0020157e-07Proteasome/cyclosome, regulatory subunit
Orthology groupMCL10602 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207763-TA
ATGACGGTCAAGAATAAGACAGAGGAACCGAAAAAGGAAAAAGTGGAACCAACACCCGCACCAAATGATGACTTGTCCGAAGAAGATAAACGTCTTCAGGAAGAGCTTAATATGCTGGTTGAAAAATTAGTGGGTAACGATGTTGATCTATATTTGCCTGCATTGCAAATGCTAAGCAATCTTATAAGAACCTCTACAACTTCAATGACATCTGTCCCAAAGCCTCTGAAATTTTTGAGAGAGCATTATCCTGCATTAAAACAAGTTTATGAGAAAATTCAAGATGAAAAGACAAAGAAATATTGTGCTGATGTTGTTTCTGTTCTTGCTATGGGTGTCAGCGGTGCTCCAGATGCTGCAGAGAAACGAGAGTGCCTCAAGTATTGTATGCTGGGAACACTTTCAAATGTTGGTGAATGGGGTCATGAATATGTAAGGCAACTTGAGGGTGAGATTGCTGAGGAGTGGAACTTAGACAATATGAACAGCCTCATGCCGCTAGTGTGTGATGTAGTAGCATTTGATATGAAGCACTCGGCCGAGATCCAGGCCTGTGATTTGTTGATGGAGATTGATCAGTTGGATATCCTCTCACAGCACATGGACCAGAGCAACTATCCTCGTGTTTGTCTATATTTGATTGGATGTGCAAGTTATGTAGTGGAGCCGGAATCGACACAAATATTGCAAGGTGTTCTCGACACTTACCTGCGATTTGGTGAATATCCTCGTGCCTTGTTGGTAGCGATGCAACTTCACAACAAGAGTAAATGCGAAGAAGTTTTTAATGCTTGTAATGATCCGTTAATTAAAAAACAACTCTGCTATATGTTGGCTCGTCAGTATGTACCACTGGAATTGGAGGATGAAGATCTCCGCACCATACTGTTGAATGCACATATAAATGATCATTTCCTCAGTTTAGCTAGAGAGCTCGATATAATGGAGCCTAAAACACCAGAAGAGGTTTACAAAACCTGGCTCGAGTCAGCTGGTTCGGCGCTTCGTCCTTCACTGCTTGCTGAACATCCTGTTGACTCAGCGAGACAGAACCTCTCAGCTACCTTCGTGAATGCTTTCGTGAACGCAGGATTCGGAAGAGATAAACTGGTTACAACTGAGGATGGAAATAAATGGATGTACAAGAATAAGGATCATGGTATGTTATCAGCAGCAGCTTCACTCGGTATGATCCATCTGTGGGATGTCGATGGTGGTCTAACCCCCATAGACAAGTATCTCTATACATCTGAAGAACACATCAAGGCGGGCGCGCTTTTGGCCTTGGGACTAGTGAATTGCGGGGTCCGTAATGAGTGCGACCCCGCACTAGCTCTGCTGTCCGATTATGTTCTGCATTCAAGTGCTAATCTCAGAATTGGCAGTATTTTAGGTCTTGGCATAGCCTACGCGGGTACACAGCGTGAGGACGTTCTATCCCACCTTCTGCCAGCGTTAGCTGATACCTCAGCACCTCCAGAGATATGCGCCCTGGCCGCAGTTGCCTGTGGACTCATCGCCGTCGGTTCATGCAATGGAGACGTCACATGTGCTATTATCCAGAGGCTGATAGATGACAACAAAGATTTGCATTCTTCAACATACGCCAGGTTCCTCCATCTGGGTCTAGGATTATGTTTCTTAGGTTGCAAGGAGCGAACTGAAGCCACTATGGCAGCCCTGGAGGTGCTTCCTGAGCCCCAACAGTCTCTATGCCAGACTACACTATCAATGTGTGCATACGCTGGCACTGGGGATGTTCTGGTTGTACAACAGATGCTGCACATATGCTCCAAGCACTATGACACAGATAATGAGCAATCTTCTACTGAAGACACAGCATTTAAGAAACAGGAAAAGAAGGAGTCTAAAGAAGGCGGCAGCGGCAGTGGAAGTTCGTCCAGTGGATCAAAGGATGATAAGAATAAAACTAGAGTTTGGTTCGATACTATTATAACCGTCACATGTGCTATTATCCAGAGGCTGATAGATGACAACAAAGATCTGCATTCTTCAACATACGCCAGGTTCCTCCACCTGGGTCTAGGATTATGTTTCTTAGGTTGCAAGGAGCGAACTGAAGCCACTATGGCAGCTCTGGAGGTGCTTCCCGAGCCCCAACAGTCTCTATGCCAGACTACACTATCAATGTGTGCATACGCTGGCACTGGGGACGTTCTGGTTGTACAACAGATGCTGCACATATGCTCCAAGCATTACGACACAGATAATGAGCAATCTTCTACTGAAGACACAGCATTTAAGAAACAAGAAAAAAAGGAGTCTAAAGAAGGCGGCAGCGGCAGTGGAAGTTCGTCCAGTGGATCAAAGGATGATAAGAACAAAAGCAAGTCAAAGGAGAGTAAGAGTAAGGACAAAGAAAAGGAGAAGGAGAAGGAGGCAAACAAGGAGTTGTCTTCAGTACAGGCTGTGGCTACCCTGGGAGTAGCGGTTATAGCGCTGGCTGAGGAAACTGGAGCTGAAATGTGTACACGAATCTTTGGACAGCTCGGTCGTTACGGCGAGCCGGCCGTCCGTCGCGCGGTGCCCTTGGCGATCGCCCTTTGTTCAGTCTCAAACCCTCAACTGTCAGTCATAGATGTACTGAACAAGTACTCACACGACGCCGACAATGACGTCGCTTACAACGCCATATTCGCCATGGGACTCGTTGGAGCTGGGACAAATAATGCCAGACTGGCGACTATGCTGCGTGCGTTGGCCTTATACCACGGGAAATCCCCGGTTCATTTGTTCATGGTCCGGCTGGCTCAAGGTCTCTGTCACGCTGGTAAGGGCACAGTCACACTGTGTCCGGCTCACTCGGACCGTCGACTTCTCAACCAGCCCGCACTCGCCGGACTGCTTGTTGTACTCACAGCCTTCCTAGACTGCAAGAATATAATCCTCGGTAAATCTCACTACCTCCTGTATGTGTTGGCAACTGCAATGCAACCTCGCTGGTTGGTCACTCTAGATGAAAACCTACAGCCCTTAAACGTTAGCGTCCGTGTTGGACAGGCTGTTGATGTTATTGGTAAGGCCGGTACACCGAAAACCATCGCTGGTTCACACACACATACGACACCTGTGCTGCTATCTTTCGGTGAACGTGCAGAGTTAGCTACTGACGAATATATACCACTATCACCGGTCATGGAAGGATTCGTCATTCTTAAGAAAAATGAAGACAGTGTCATGGCATCTGTCCAGTGA

Protein sequence:

>DPOGS207763-PA
MTVKNKTEEPKKEKVEPTPAPNDDLSEEDKRLQEELNMLVEKLVGNDVDLYLPALQMLSNLIRTSTTSMTSVPKPLKFLREHYPALKQVYEKIQDEKTKKYCADVVSVLAMGVSGAPDAAEKRECLKYCMLGTLSNVGEWGHEYVRQLEGEIAEEWNLDNMNSLMPLVCDVVAFDMKHSAEIQACDLLMEIDQLDILSQHMDQSNYPRVCLYLIGCASYVVEPESTQILQGVLDTYLRFGEYPRALLVAMQLHNKSKCEEVFNACNDPLIKKQLCYMLARQYVPLELEDEDLRTILLNAHINDHFLSLARELDIMEPKTPEEVYKTWLESAGSALRPSLLAEHPVDSARQNLSATFVNAFVNAGFGRDKLVTTEDGNKWMYKNKDHGMLSAAASLGMIHLWDVDGGLTPIDKYLYTSEEHIKAGALLALGLVNCGVRNECDPALALLSDYVLHSSANLRIGSILGLGIAYAGTQREDVLSHLLPALADTSAPPEICALAAVACGLIAVGSCNGDVTCAIIQRLIDDNKDLHSSTYARFLHLGLGLCFLGCKERTEATMAALEVLPEPQQSLCQTTLSMCAYAGTGDVLVVQQMLHICSKHYDTDNEQSSTEDTAFKKQEKKESKEGGSGSGSSSSGSKDDKNKTRVWFDTIITVTCAIIQRLIDDNKDLHSSTYARFLHLGLGLCFLGCKERTEATMAALEVLPEPQQSLCQTTLSMCAYAGTGDVLVVQQMLHICSKHYDTDNEQSSTEDTAFKKQEKKESKEGGSGSGSSSSGSKDDKNKSKSKESKSKDKEKEKEKEANKELSSVQAVATLGVAVIALAEETGAEMCTRIFGQLGRYGEPAVRRAVPLAIALCSVSNPQLSVIDVLNKYSHDADNDVAYNAIFAMGLVGAGTNNARLATMLRALALYHGKSPVHLFMVRLAQGLCHAGKGTVTLCPAHSDRRLLNQPALAGLLVVLTAFLDCKNIILGKSHYLLYVLATAMQPRWLVTLDENLQPLNVSVRVGQAVDVIGKAGTPKTIAGSHTHTTPVLLSFGERAELATDEYIPLSPVMEGFVILKKNEDSVMASVQ-