Monarch geneset OGS2.0

DPOGS204033
TranscriptDPOGS204033-TA1287 bp
ProteinDPOGS204033-PA428 aa
Genomic positionDPSCF300138 + 25640-28315
RNAseq coverage1674x (Rank: top 8%)
Annotation
HeliconiusHMEL0081620.0100.00% 
BombyxBGIBMGA004782-TA0.099.77% 
DrosophilaTbp-1-PA0.094.39% 
EBI UniRef50UniRef50_P179800.087.41%26S protease regulatory subunit 6A n=378 Tax=root RepID=PRS6A_HUMAN
NCBI RefSeqXP_001866517.10.095.56%26S protease regulatory subunit 6A [Culex quinquefasciatus]
NCBI nr blastpgi|3085127630.098.60%proteasome 26S subunit 6A [Biston betularia]
NCBI nr blastxgi|3085127630.098.60%proteasome 26S subunit 6A [Biston betularia]
Group
Gene OntologyGO:00167876.5e-127hydrolase activity
GO:00301636.5e-127protein catabolic process
GO:00057376.5e-127cytoplasm
GO:00055246.3e-41ATP binding
GO:00001661.1e-22nucleotide binding
GO:00171111.1e-22nucleoside-triphosphatase activity
KEGG pathwaycqu:CpipJ_CPIJ0164070.0 
 K03065 (PSMC3, RPT5)maps-> Proteasome
InterPro domain[58-415] IPR0059376.5e-12726S proteasome subunit P45
[212-344] IPR0039596.3e-41ATPase, AAA-type, core
[208-347] IPR0035931.1e-22ATPase, AAA+ type, core
Orthology groupMCL12286 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204033-TA
ATGGCTACAACGCTGGAAGACAAGTCTATTTGGGAAGATGGTGAAGAAGCTCTCAGTGAGGAAGTTTTGCGTATGCCAACAGATGAGATAATAAGCCGAACACGATTACTGGACAATGAAATTAAGATCATGAAAAGTGAAGTCATGAGAATTTCGCATGAATTACAAGCACAGAATGACAAAATCAAGGAAAACACAGAGAAAATTAAAGTAAATAAGACTCTGCCATATCTTGTTTCGAATGTAATCGAATTACTCGACGTAGACCCTCAGGAGGAGGAAGAAGACGGCGCTGTGGTAGACCTGGACTCGCAAAGGAAAGGGAAATGCGCTGTCATCAAAACTTCGACCAGACAAACATATTTCTTACCTGTAATTGGTCTCGTGGATGCTGAAAAACTGAAGCCCGGTGACCTGGTTGGAGTCAACAAGGATTCTTATTTGATCTTGGAAACTCTACCAGCTGAGTATGACGCCAGAGTTAAAGCTATGGAGGTGGACGAGAGACCTACGGAACAATACTCAGATATTGGTGGTTTGGATAAACAAATACAGGAACTGATAGAGGCAGTTGTGCTTCCTATGACCCACAAGGAGAAGTTTGTGAACCTCGGTATCCATCCTCCGAAGGGTGTGCTGTTGTATGGTCCTCCAGGAACTGGCAAGACTTTGTTAGCTCGGGCTTGTGCTGCTCAAACTAAGTCCACATTCTTGAAGCTTGCCGGCCCTCAGCTTGTACAAATGTTCATTGGCGATGGAGCTAAGCTTGTTAGAGATGCCTTCGCATTAGCTAAGGAGAAGGCACCAGCGATAATTTTCATTGATGAGTTGGATGCTATCGGTACCAAGAGATTTGACTCAGAGAAGGCAGGCGACCGTGAAGTGCAGAGAACTATGTTGGAACTACTTAACCAGTTGGATGGTTTTAGCTCCACTGCAGATATAAAGGTTATTGCTGCCACAAACAGAGTAGATATTTTAGATCCAGCCCTGCTCCGATCTGGTCGTCTGGACAGGAAGATTGAATTCCCTCACCCCAACGAGGAGGCTAGGGCTAGGATCATGCAAATTCACTCTCGAAAAATGAATGTGTCCCCGGATGTGAATTTTGAGGAGCTGTCGCGTTCTACTGATGATTTCAATGGTGCCCAATGTAAAGCTGTGTGTGTTGAGGCTGGTATGATAGCGCTCAGACGGTCCGCTACCGCCGTCACTCACGAGGACTTCATGGACGCCATCCTTGAAGTACAGGCCAAGAAAAAGGCCAATCTCAGCTATTATGCTTAA

Protein sequence:

>DPOGS204033-PA
MATTLEDKSIWEDGEEALSEEVLRMPTDEIISRTRLLDNEIKIMKSEVMRISHELQAQNDKIKENTEKIKVNKTLPYLVSNVIELLDVDPQEEEEDGAVVDLDSQRKGKCAVIKTSTRQTYFLPVIGLVDAEKLKPGDLVGVNKDSYLILETLPAEYDARVKAMEVDERPTEQYSDIGGLDKQIQELIEAVVLPMTHKEKFVNLGIHPPKGVLLYGPPGTGKTLLARACAAQTKSTFLKLAGPQLVQMFIGDGAKLVRDAFALAKEKAPAIIFIDELDAIGTKRFDSEKAGDREVQRTMLELLNQLDGFSSTADIKVIAATNRVDILDPALLRSGRLDRKIEFPHPNEEARARIMQIHSRKMNVSPDVNFEELSRSTDDFNGAQCKAVCVEAGMIALRRSATAVTHEDFMDAILEVQAKKKANLSYYA-