Monarch geneset OGS2.0

DPOGS216160
TranscriptDPOGS216160-TA1194 bp
ProteinDPOGS216160-PA397 aa
Genomic positionDPSCF300155 - 321788-322981
RNAseq coverage1516x (Rank: top 8%)
Annotation
HeliconiusHMEL0165590.099.50% 
BombyxBGIBMGA014177-TA0.099.50% 
DrosophilaPros45-PA0.097.49% 
EBI UniRef50UniRef50_O184130.097.49%26S protease regulatory subunit 8 n=62 Tax=root RepID=PRS8_DROME
NCBI RefSeqXP_001663850.10.097.73%26S protease regulatory subunit [Aedes aegypti]
NCBI nr blastpgi|17097990.099.50%18-56 protein [Manduca sexta]
NCBI nr blastxgi|17097990.099.50%18-56 protein [Manduca sexta]
Group
Gene OntologyGO:00167871.6e-138hydrolase activity
GO:00301631.6e-138protein catabolic process
GO:00057371.6e-138cytoplasm
GO:00055245.1e-43ATP binding
GO:00001669.5e-24nucleotide binding
GO:00171119.5e-24nucleoside-triphosphatase activity
KEGG pathwayaag:AaeL_AAEL0147230.0 
 K03066 (PSMC5, RPT6)maps-> Proteasome
InterPro domain[25-382] IPR0059371.6e-13826S proteasome subunit P45
[177-309] IPR0039595.1e-43ATPase, AAA-type, core
[173-312] IPR0035939.5e-24ATPase, AAA+ type, core
Orthology groupMCL10874 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216160-TA
ATGGAAGTAGACTCCGTAAAAGGAGAAGGCTTCCGACCGTACTACATTACAAAGATCGAAGAACTGCAACTCGTAGTTGCCGAGAAGTCACAGAATCTTCGACGTCTTCAGGCACAACGGAATGAACTGAACGCTAAAGTTCGTATGCTGCGAGAAGAGCTGCAGCTTCTGCAGGAGCAAGGATCTTACGTCGGTGAAGTCGTCAAACCAATGGACAAGAAGAAGGTTCTGGTTAAGGTACACCCAGAAGGAAAGTTCGTTGTAGACCTGGACAAAAATGTCGACATCAATGATGTGACCGCTAATTGTCGCGTCGCTTTACGAAACGAAAGCTATACGCTACACAAGATTCTGCCTAATAAAGTTGATCCATTGGTGTCGCTTATGATGGTGGAAAAAGTCCCTGATTCCACATACGAGATGGTTGGAGGTTTGGACAAACAAATCAAGGAGATTAAAGAGGTAATTGAATTGCCCGTTAAGCATCCTGAGCTGTTTGATGCGCTCGGTATTGCTCAGCCAAAAGGCGTTCTTTTGTATGGTCCACCTGGAACTGGTAAGACACTCTTAGCTCGCGCCGTCGCCCACCACACCGAGTGCACCTTCATTCGTGTCTCTGGTTCAGAGCTTGTGCAGAAATTCATTGGAGAAGGTAGTCGTATGGTGCGAGAACTTTTCGTAATGGCTCGAGAGCATGCTCCTTCAATAATTTTCATGGACGAAATTGATTCTATTGGATCATCTCGAATTGAGTCTGGCAGCGGAGGAGATTCTGAAGTCCAGAGAACCATGTTAGAATTGCTCAATCAATTGGACGGATTCGAGGCCACCAAGAATATTAAGGTCATTATGGCAACCAACCGTATTGACATTTTGGATCCCGCCCTATTAAGACCTGGCCGTATCGACAGGAAAATCGAGTTCCCACCTCCGAATGAAGAAGCTAGGTTGGATATCCTTAAAATCCATTCACGTAAGATGAATTTGACCAGAGGTATCAATCTCCGTAAGATTGCCGAGTTAATGCCTGGCGCATCTGGAGCGGAGGTTAAAGGTGTGTGCACTGAAGCCGGTATGTATGCTTTGCGTGAACGCAGAGTCCACGTCACTCAGGAGGACTTTGAAATGGCCGTGGCAAAGGTCATGCAGAAGGATTCAGAGAAGAATATGTCCATCAAGAAGTTGTGGAAGTAA

Protein sequence:

>DPOGS216160-PA
MEVDSVKGEGFRPYYITKIEELQLVVAEKSQNLRRLQAQRNELNAKVRMLREELQLLQEQGSYVGEVVKPMDKKKVLVKVHPEGKFVVDLDKNVDINDVTANCRVALRNESYTLHKILPNKVDPLVSLMMVEKVPDSTYEMVGGLDKQIKEIKEVIELPVKHPELFDALGIAQPKGVLLYGPPGTGKTLLARAVAHHTECTFIRVSGSELVQKFIGEGSRMVRELFVMAREHAPSIIFMDEIDSIGSSRIESGSGGDSEVQRTMLELLNQLDGFEATKNIKVIMATNRIDILDPALLRPGRIDRKIEFPPPNEEARLDILKIHSRKMNLTRGINLRKIAELMPGASGAEVKGVCTEAGMYALRERRVHVTQEDFEMAVAKVMQKDSEKNMSIKKLWK-