DPGLEAN10642 in OGS1.0

New model in OGS2.0DPOGS205492 
Genomic Positionscaffold494:+ 74317-77905
See gene structure
CDS Length3003
Paired RNAseq reads  5680
Single RNAseq reads  15690
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008426 (2e-170)
Best Drosophila hit  Rpn2 (0.0)
Best Human hit26S proteasome non-ATPase regulatory subunit 1 isoform 1 (0.0)
Best NR hit (blastp)  26S proteasome non-ATPase regulatory subunit 1 [Culex quinquefasciatus] (0.0)
Best NR hit (blastx)  PREDICTED: similar to 26S proteasome non-ATPase regulatory subunit 1 isoform 1 [Tribolium castaneum] (0.0)
GeneOntology terms






  
GO:0008540 proteasome regulatory particle, base subcomplex
GO:0006508 proteolysis
GO:0005838 proteasome regulatory particle
GO:0004175 endopeptidase activity
GO:0042176 regulation of protein catabolic process
GO:0030234 enzyme regulator activity
GO:0005488 binding
GO:0006974 response to DNA damage stimulus
InterPro families


  
IPR016024 Armadillo-type fold
IPR002015 Proteasome/cyclosome, regulatory subunit
IPR011989 Armadillo-like helical
IPR016642 26S proteasome regulatory complex, non-ATPase subcomplex, Rpn2/Psmd1 subunit
Orthology groupMCL13533

Nucleotide sequence:

ATGAATATCACATCCGCTGCGGGGATAATATCCCTGCTTGATGAGCCTATGCATGAAGTG
AAAAAGTTCGCTCTAAAGAGATTAGACAACATCGTAGACGAATTCTGGCCGGAAATTTCT
GAATCAATTGAAAAGATTGAAATTCTACACGAAGATAAAGTTTTTTCACAGCACCAACTG
GCTGCTTTAGTGGCAAGCAAAGTTTATTATCATTTGGGTGCATTCGAAGACTCTCTAACA
TACGCGCTAGGAGCTGGGGAGTTGTTCGATGTAAACGCAAGGAACGAATATGTCGACACC
ACGATTGCAAAGGCCATCGATTTCTATACACAGAAACGTAAAGCCCTGTTCATTGATAGT
GCCTGTGAGACCATTGACCCTCGACTGGAGGCTATCGTGAACCGAATGTTCCAACGGTGT
CTTGATGACGGTCAATACAGGCAGGCCCTCGGTCTGGCGTTAGAGACGCGACGAATGGAC
ATCTTTGAAGAATCTATTATGAAGTCTGATGATGTGTCAGGCATGCTGCAGTACGCATTT
ACTGTGGCAATGAGTCTTCTCCAGAACAGAGGTTTCCGAAGCACAGTCCTCCGATCTTTA
GTAGGTTTATACCGAGGTCTGAACATTCCAGACTATGTCAATATGTGTCAATGTCTCATC
TTCTTAGAGGATCCATTATCAGTCGCAGAAATCTTGGACAAATTAACTCACGGCCCACAA
GATTCAGTACTAATGGCTTACCAAATTGCATTTGATCTGTATGACTCAGCAACTCAACAA
TTTTTAGGGCGAGTTCTACAGGCCCTTAGAATTACAGCACCTATACCGAGTGCTTTGGGC
GGTAAACCCCAACCGCAAGGTGGCCCTTTCCCAGAATCTACTATGGAGGTAGATCAATCA
CCCTCTGAAGAACCTAAGAAACCAGAACGGGATATTGATAGTCTTAATGATGAGGAAAAG
GAACATCAGAGGAGAGTAGAAAAATTAATATCTATATTAGGAGGAGATGTATCTATAGGT
TTACAATTACAGTTCTTAATTAGGTCTAATCATGCGGATATGTTAATTCTAAAGAACACG
AAAGATGCCATCAGAGTTTCAATCTGCCACACAGCTACGGTTATAGCTAATGCATTCATG
CACGCTGGCACGACAAGCGATCAGTTCTTAAGGGATAACTTGGAATGGTTGGCGCGAGCG
ACGAATTGGGCAAAACTGACAGTGACGGCTTCGCTTGGCGTAATACACAGAGGCCACGAG
AACGAATCCTTGGCTCTCATGCAATCTTATCTGCCCAAAGAGGCTGGGCCATCATCTGGC
TATTCTGAAGGTGGCGGCTTATACGCATTAGGTTTGATTCATGCAAACCACGGCGCCAAT
ATCATTGATTATCTTTTAACTCAATTAAAAGACGCTCAGAATGAAATGGTTCGCCACGGA
GGCTGTCTGGGTCTCGGTTTAGCTGCAATGGGCACTCACCGGCAGGACGTTTACGAACAA
CTCAAGTTCAACCTATACCAAGACGACGCAGTTACCGGTGAAGCTGCTGGTATTGCTATG
GGAATGGTCATGTTGGGTTCCCGCAACGCTGCCGCCATCGAGGACATGGTCGCCTACGCC
CAGGAGACTCAACACGAAAAGATTTTGCGTGGTTTAGCCGTCGGCATATCCTTCACCATG
TACGGACGGCTGGAAGAGGCTGATGCTCTCGTCCAACAGCTATTGAGAGATAAGGATCCG
TTATTGCGTCGAGCTGGTTGTTACACCATAGCTACAGCCTACTGCGGCACTGGCAATAAC
GATTCAATTCGTACATTACTTCACGTGGCCGTTTCTGACGTGAACGACGACGTCCGCCGC
GCTGCTGTAACTGCTTTAGGATTCCTACTGTTCAGAACGCCCGAACAATGTCCGTCTGTG
GTGTCACTATTGGCGGAGTCCTACAATCCTCATGTACGGTACGGCGCTGCTATGGCCTTG
GGTATCGCATGCGCCGGTACTGGGAATCGTGAAGCTATCGGACTTCTAGAACCTATGGTC
AAATTTGACCCTGTTAATTTCGTCAGACAAGGAGCGCTTATAGCATCGGCGATGATTTTG
ATTCAGCAGACCGAGGCGCTATGTCCCAAAGTTACATACTTCCGTACGCTTTATTCACAA
GTAATTTCAAACAAACACGAAGATGTTATGGCCAAATTTGGGGCTATATTGGCCCAAGGT
ATCATAGATGCAGGTGGACGGAATGTAACAGTCTCCCTTCAGAACAGAACCGGTCACATG
AATATGTTGGCTGTTGTTGGCATGCTAGTATTCACTCAATACTGGTACTGGTTCCCGTTG
GCTCATTGCCTATCACTTGCTTTTACGCCCACATGCGTGATTGCCCTAAATTCCGATTTA
AAAATGCCACTACTGGAAATGAAATCCAACGCTAAACCATCGCTGTACGCCTACCCAGCA
CCGCTTGAAGAAAAGAAACGCGAAGAAAGAGAAAGAGTCACCACTGCCGTACTAAGTATT
GCCGCAGCCAGAGCGCGCAGACGAGCTCACGGAACAGAGGGTTCCGCTAGCAGTAGTGTG
ACGTCATCGACCACATCTAAGATGGATGTCGATGAAGAAGAGAAGAAGCCTTCCAAATCA
CCAAACCCAAATATAACAGTTCACGGTAAATCCGATAAAGATGCCGGATCGTCGAAAGAA
GGCAAGAAAGACGAAAAGGAAGCAGAAGAAAAAGATGTCAAGGAGAAGAAGGAACCGGAA
CCAAACTTTGAAATTCTCAGCAACCCAGCCAGGGTTATGCGTCAACAACTAAAAACTCTG
ACAGTTGTTGAGGGTTCCGGATACATGCCTTTGAAGGACGTCACTATTGGCGGTATCGTA
ATGTTGAATCATACGGGAGACAGTGAACAAGTGCTTGTGGAACCTGTCGCTGCTTTTGGT
CCGAAAGCTGAAGAAGAAAAAGAACCTGAACCTCCTGAACCATTTGAATACTTGGACGAA
TGA

Protein sequence:

MNITSAAGIISLLDEPMHEVKKFALKRLDNIVDEFWPEISESIEKIEILHEDKVFSQHQL
AALVASKVYYHLGAFEDSLTYALGAGELFDVNARNEYVDTTIAKAIDFYTQKRKALFIDS
ACETIDPRLEAIVNRMFQRCLDDGQYRQALGLALETRRMDIFEESIMKSDDVSGMLQYAF
TVAMSLLQNRGFRSTVLRSLVGLYRGLNIPDYVNMCQCLIFLEDPLSVAEILDKLTHGPQ
DSVLMAYQIAFDLYDSATQQFLGRVLQALRITAPIPSALGGKPQPQGGPFPESTMEVDQS
PSEEPKKPERDIDSLNDEEKEHQRRVEKLISILGGDVSIGLQLQFLIRSNHADMLILKNT
KDAIRVSICHTATVIANAFMHAGTTSDQFLRDNLEWLARATNWAKLTVTASLGVIHRGHE
NESLALMQSYLPKEAGPSSGYSEGGGLYALGLIHANHGANIIDYLLTQLKDAQNEMVRHG
GCLGLGLAAMGTHRQDVYEQLKFNLYQDDAVTGEAAGIAMGMVMLGSRNAAAIEDMVAYA
QETQHEKILRGLAVGISFTMYGRLEEADALVQQLLRDKDPLLRRAGCYTIATAYCGTGNN
DSIRTLLHVAVSDVNDDVRRAAVTALGFLLFRTPEQCPSVVSLLAESYNPHVRYGAAMAL
GIACAGTGNREAIGLLEPMVKFDPVNFVRQGALIASAMILIQQTEALCPKVTYFRTLYSQ
VISNKHEDVMAKFGAILAQGIIDAGGRNVTVSLQNRTGHMNMLAVVGMLVFTQYWYWFPL
AHCLSLAFTPTCVIALNSDLKMPLLEMKSNAKPSLYAYPAPLEEKKREERERVTTAVLSI
AAARARRRAHGTEGSASSSVTSSTTSKMDVDEEEKKPSKSPNPNITVHGKSDKDAGSSKE
GKKDEKEAEEKDVKEKKEPEPNFEILSNPARVMRQQLKTLTVVEGSGYMPLKDVTIGGIV
MLNHTGDSEQVLVEPVAAFGPKAEEEKEPEPPEPFEYLDE