Monarch geneset OGS2.0

DPOGS202257
TranscriptDPOGS202257-TA1005 bp
ProteinDPOGS202257-PA334 aa
Genomic positionDPSCF300032 - 558237-561209
RNAseq coverage1476x (Rank: top 9%)
Annotation
HeliconiusHMEL0123142e-10755.24% 
BombyxBGIBMGA004908-TA0.099.40% 
DrosophilaPros26.4-PA0.098.20% 
EBI UniRef50UniRef50_P621910.094.61%26S protease regulatory subunit 4 n=395 Tax=root RepID=PRS4_HUMAN
NCBI RefSeqXP_001655853.10.098.20%26S protease regulatory subunit [Aedes aegypti]
NCBI nr blastpgi|1571314530.098.20%26S protease regulatory subunit [Aedes aegypti]
NCBI nr blastxgi|1571314530.098.20%26S protease regulatory subunit [Aedes aegypti]
Group
Gene OntologyGO:00167874.6e-137hydrolase activity
GO:00301634.6e-137protein catabolic process
GO:00057374.6e-137cytoplasm
GO:00055241.5e-42ATP binding
GO:00001661.4e-22nucleotide binding
GO:00171111.4e-22nucleoside-triphosphatase activity
KEGG pathwayaag:AaeL_AAEL0120950.0 
 K03062 (PSMC1, RPT2)maps-> Proteasome
InterPro domain[3-320] IPR0059374.6e-13726S proteasome subunit P45
[116-249] IPR0039591.5e-42ATPase, AAA-type, core
[112-251] IPR0035931.4e-22ATPase, AAA+ type, core
Orthology groupMCL11539 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202257-TA
ATGTCAGTTGGTACTCTCGAAGAGATCATTGATGACAATCATGCCATAGTCTCCACATCCGTCGGCAGTGAACACTATGTCAGTATCCTATCATTTGTTGATAAAGACCAGCTAGAGCCAGGCTGTTCAGTTTTACTAAACCACAAGGTTCATGCTGTGGTGGGTGTGCTGGGTGATGACACCGATCCGATGGTGTCAGTCATGAAGCTCGAGAAGGCTCCGCAAGAGACATATGCAGATATTGGTGGTCTTGACACACAGATACAGGAAATTAAGGAATCAGTGGAACTTCCACTAACTCACCCGGAGTATTACGAAGAGATGGGCATTAAGCCGCCAAAGGGAGTCATTTTGTATGGACCACCAGGTACGGGGAAGACCTTACTGGCTAAGGCCGTAGCCAACCAGACATCAGCCACCTTCTTGAGGGTTGTCGGCTCTGAGCTGATTCAGAAGTATTTGGGTGATGGTCCGAAATTAGTTCGTGAACTATTCAGAGTAGCCGAAGAACATGCTCCATCAATTGTATTTATTGATGAAATAGATGCTGTCGGGACCAAACGTTATGACTCCAACTCTGGCGGTGAGAGGGAAATTCAAAGAACTATGTTGGAGCTCCTCAATCAGTTGGACGGTTTTGATTCAAGAGGAGATGTTAAGGTTATTATGGCAACTAACAGAATAGAGACCCTAGACCCGGCCCTGATCCGTCCAGGCCGGATCGATCGCAAGATAGAGTTCCCGCTGCCCGACGAGAAGACCAAACGACGCATCTTCACCATACATACCTCCAGGATGACCTTGGCCGATGATGTCAACTTGTCAGAGCTCATCATGTCCAAGGATGATCTGTCCGGGGCAGATATGAAGGCTATTTGTACCGAGGCTGGTTTGATGGCACTCAGAGAACGACGTATGAAGGTTACTAATGAAGACTTCAAGAAGTCTAAAGAGAGTGTCCTGTACCGCAAGAAGGAAGGCACTCCGGAAGGGCTTTACCTTTAA

Protein sequence:

>DPOGS202257-PA
MSVGTLEEIIDDNHAIVSTSVGSEHYVSILSFVDKDQLEPGCSVLLNHKVHAVVGVLGDDTDPMVSVMKLEKAPQETYADIGGLDTQIQEIKESVELPLTHPEYYEEMGIKPPKGVILYGPPGTGKTLLAKAVANQTSATFLRVVGSELIQKYLGDGPKLVRELFRVAEEHAPSIVFIDEIDAVGTKRYDSNSGGEREIQRTMLELLNQLDGFDSRGDVKVIMATNRIETLDPALIRPGRIDRKIEFPLPDEKTKRRIFTIHTSRMTLADDVNLSELIMSKDDLSGADMKAICTEAGLMALRERRMKVTNEDFKKSKESVLYRKKEGTPEGLYL-