Monarch geneset OGS2.0

DPOGS205492
TranscriptDPOGS205492-TA3003 bp
ProteinDPOGS205492-PA1000 aa
Genomic positionDPSCF300166 + 361889-365477
RNAseq coverage1498x (Rank: top 9%)
Annotation
HeliconiusHMEL0177430.095.90% 
BombyxBGIBMGA008426-TA0.082.90% 
DrosophilaRpn2-PA0.072.54% 
EBI UniRef50UniRef50_B0WHS60.074.47%26S proteasome non-ATPase regulatory subunit 1 n=12 Tax=Eumetazoa RepID=B0WHS6_CULQU
NCBI RefSeqXP_001848260.10.074.47%26S proteasome non-ATPase regulatory subunit 1 [Culex quinquefasciatus]
NCBI nr blastpgi|1700409840.074.47%26S proteasome non-ATPase regulatory subunit 1 [Culex quinquefasciatus]
NCBI nr blastxgi|910820730.077.71%PREDICTED: similar to 26S proteasome non-ATPase regulatory subunit 1 isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00005020proteasome complex
GO:00421760regulation of protein catabolic process
GO:00302340enzyme regulator activity
GO:00054883.1e-37binding
KEGG pathwaycqu:CpipJ_CPIJ0067020.0 
 K03032 (PSMD1, RPN2)maps-> Proteasome
InterPro domain[1-1001] IPR016642026S proteasome regulatory complex, non-ATPase subcomplex, Rpn2/Psmd1 subunit
[9-747] IPR0160243.1e-37Armadillo-type fold
[564-753] IPR0119893.3e-18Armadillo-like helical
[655-690] IPR0020152.8e-10Proteasome/cyclosome, regulatory subunit
Orthology groupMCL13806 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205492-TA
ATGAATATCACATCCGCTGCGGGGATAATATCCCTGCTTGATGAGCCTATGCATGAAGTGAAAAAGTTCGCTCTAAAGAGATTAGACAACATCGTAGACGAATTCTGGCCGGAAATTTCTGAATCAATTGAAAAGATTGAAATTCTACACGAAGATAAAGTTTTTTCACAGCACCAACTGGCTGCTTTAGTGGCAAGCAAAGTTTATTATCATTTGGGTGCATTCGAAGACTCTCTAACATACGCGCTAGGAGCTGGGGAGTTGTTCGATGTAAACGCAAGGAACGAATATGTCGACACCACGATTGCAAAGGCCATCGATTTCTATACACAGAAACGTAAAGCCCTGTTCATTGATAGTGCCTGTGAGACCATTGACCCTCGACTGGAGGCTATCGTGAACCGAATGTTCCAACGGTGTCTTGATGACGGTCAATACAGGCAGGCCCTCGGTCTGGCGTTAGAGACGCGACGAATGGACATCTTTGAAGAATCTATTATGAAGTCTGATGATGTGTCAGGCATGCTGCAGTACGCATTTACTGTGGCAATGAGTCTTCTCCAGAACAGAGGTTTCCGAAGCACAGTCCTCCGATCTTTAGTAGGTTTATACCGAGGTCTGAACATTCCAGACTATGTCAATATGTGTCAATGTCTCATCTTCTTAGAGGATCCATTATCAGTCGCAGAAATCTTGGACAAATTAACTCACGGCCCACAAGATTCAGTACTAATGGCTTACCAAATTGCATTTGATCTGTATGACTCAGCAACTCAACAATTTTTAGGGCGAGTTCTACAGGCCCTTAGAATTACAGCACCTATACCGAGTGCTTTGGGCGGTAAACCCCAACCGCAAGGTGGCCCTTTCCCAGAATCTACTATGGAGGTAGATCAATCACCCTCTGAAGAACCTAAGAAACCAGAACGGGATATTGATAGTCTTAATGATGAGGAAAAGGAACATCAGAGGAGAGTAGAAAAATTAATATCTATATTAGGAGGAGATGTATCTATAGGTTTACAATTACAGTTCTTAATTAGGTCTAATCATGCGGATATGTTAATTCTAAAGAACACGAAAGATGCCATCAGAGTTTCAATCTGCCACACAGCTACGGTTATAGCTAATGCATTCATGCACGCTGGCACGACAAGCGATCAGTTCTTAAGGGATAACTTGGAATGGTTGGCGCGAGCGACGAATTGGGCAAAACTGACAGTGACGGCTTCGCTTGGCGTAATACACAGAGGCCACGAGAACGAATCCTTGGCTCTCATGCAATCTTATCTGCCCAAAGAGGCTGGGCCATCATCTGGCTATTCTGAAGGTGGCGGCTTATACGCATTAGGTTTGATTCATGCAAACCACGGCGCCAATATCATTGATTATCTTTTAACTCAATTAAAAGACGCTCAGAATGAAATGGTTCGCCACGGAGGCTGTCTGGGTCTCGGTTTAGCTGCAATGGGCACTCACCGGCAGGACGTTTACGAACAACTCAAGTTCAACCTATACCAAGACGACGCAGTTACCGGTGAAGCTGCTGGTATTGCTATGGGAATGGTCATGTTGGGTTCCCGCAACGCTGCCGCCATCGAGGACATGGTCGCCTACGCCCAGGAGACTCAACACGAAAAGATTTTGCGTGGTTTAGCCGTCGGCATATCCTTCACCATGTACGGACGGCTGGAAGAGGCTGATGCTCTCGTCCAACAGCTATTGAGAGATAAGGATCCGTTATTGCGTCGAGCTGGTTGTTACACCATAGCTACAGCCTACTGCGGCACTGGCAATAACGATTCAATTCGTACATTACTTCACGTGGCCGTTTCTGACGTGAACGACGACGTCCGCCGCGCTGCTGTAACTGCTTTAGGATTCCTACTGTTCAGAACGCCCGAACAATGTCCGTCTGTGGTGTCACTATTGGCGGAGTCCTACAATCCTCATGTACGGTACGGCGCTGCTATGGCCTTGGGTATCGCATGCGCCGGTACTGGGAATCGTGAAGCTATCGGACTTCTAGAACCTATGGTCAAATTTGACCCTGTTAATTTCGTCAGACAAGGAGCGCTTATAGCATCGGCGATGATTTTGATTCAGCAGACCGAGGCGCTATGTCCCAAAGTTACATACTTCCGTACGCTTTATTCACAAGTAATTTCAAACAAACACGAAGATGTTATGGCCAAATTTGGGGCTATATTGGCCCAAGGTATCATAGATGCAGGTGGACGGAATGTAACAGTCTCCCTTCAGAACAGAACCGGTCACATGAATATGTTGGCTGTTGTTGGCATGCTAGTATTCACTCAATACTGGTACTGGTTCCCGTTGGCTCATTGCCTATCACTTGCTTTTACGCCCACATGCGTGATTGCCCTAAATTCCGATTTAAAAATGCCACTACTGGAAATGAAATCCAACGCTAAACCATCGCTGTACGCCTACCCAGCACCGCTTGAAGAAAAGAAACGCGAAGAAAGAGAAAGAGTCACCACTGCCGTACTAAGTATTGCCGCAGCCAGAGCGCGCAGACGAGCTCACGGAACAGAGGGTTCCGCTAGCAGTAGTGTGACGTCATCGACCACATCTAAGATGGATGTCGATGAAGAAGAGAAGAAGCCTTCCAAATCACCAAACCCAAATATAACAGTTCACGGTAAATCCGATAAAGATGCCGGATCGTCGAAAGAAGGCAAGAAAGACGAAAAGGAAGCAGAAGAAAAAGATGTCAAGGAGAAGAAGGAACCGGAACCAAACTTTGAAATTCTCAGCAACCCAGCCAGGGTTATGCGTCAACAACTAAAAACTCTGACAGTTGTTGAGGGTTCCGGATACATGCCTTTGAAGGACGTCACTATTGGCGGTATCGTAATGTTGAATCATACGGGAGACAGTGAACAAGTGCTTGTGGAACCTGTCGCTGCTTTTGGTCCGAAAGCTGAAGAAGAAAAAGAACCTGAACCTCCTGAACCATTTGAATACTTGGACGAATGA

Protein sequence:

>DPOGS205492-PA
MNITSAAGIISLLDEPMHEVKKFALKRLDNIVDEFWPEISESIEKIEILHEDKVFSQHQLAALVASKVYYHLGAFEDSLTYALGAGELFDVNARNEYVDTTIAKAIDFYTQKRKALFIDSACETIDPRLEAIVNRMFQRCLDDGQYRQALGLALETRRMDIFEESIMKSDDVSGMLQYAFTVAMSLLQNRGFRSTVLRSLVGLYRGLNIPDYVNMCQCLIFLEDPLSVAEILDKLTHGPQDSVLMAYQIAFDLYDSATQQFLGRVLQALRITAPIPSALGGKPQPQGGPFPESTMEVDQSPSEEPKKPERDIDSLNDEEKEHQRRVEKLISILGGDVSIGLQLQFLIRSNHADMLILKNTKDAIRVSICHTATVIANAFMHAGTTSDQFLRDNLEWLARATNWAKLTVTASLGVIHRGHENESLALMQSYLPKEAGPSSGYSEGGGLYALGLIHANHGANIIDYLLTQLKDAQNEMVRHGGCLGLGLAAMGTHRQDVYEQLKFNLYQDDAVTGEAAGIAMGMVMLGSRNAAAIEDMVAYAQETQHEKILRGLAVGISFTMYGRLEEADALVQQLLRDKDPLLRRAGCYTIATAYCGTGNNDSIRTLLHVAVSDVNDDVRRAAVTALGFLLFRTPEQCPSVVSLLAESYNPHVRYGAAMALGIACAGTGNREAIGLLEPMVKFDPVNFVRQGALIASAMILIQQTEALCPKVTYFRTLYSQVISNKHEDVMAKFGAILAQGIIDAGGRNVTVSLQNRTGHMNMLAVVGMLVFTQYWYWFPLAHCLSLAFTPTCVIALNSDLKMPLLEMKSNAKPSLYAYPAPLEEKKREERERVTTAVLSIAAARARRRAHGTEGSASSSVTSSTTSKMDVDEEEKKPSKSPNPNITVHGKSDKDAGSSKEGKKDEKEAEEKDVKEKKEPEPNFEILSNPARVMRQQLKTLTVVEGSGYMPLKDVTIGGIVMLNHTGDSEQVLVEPVAAFGPKAEEEKEPEPPEPFEYLDE-