Monarch geneset OGS2.0

DPOGS209519
TranscriptDPOGS209519-TA1305 bp
ProteinDPOGS209519-PA434 aa
Genomic positionDPSCF300127 + 440752-445489
RNAseq coverage2627x (Rank: top 5%)
Annotation
HeliconiusHMEL0162703e-14992.28% 
BombyxBGIBMGA007332-TA0.099.08% 
DrosophilaRpt1-PA0.093.32% 
EBI UniRef50UniRef50_Q7KMQ00.093.32%26S proteasome regulatory complex subunit p48B n=32 Tax=Bilateria RepID=Q7KMQ0_DROME
NCBI RefSeqXP_972389.10.096.54%PREDICTED: similar to Rpt1 CG1341-PA [Tribolium castaneum]
NCBI nr blastpgi|910888850.096.54%PREDICTED: similar to Rpt1 CG1341-PA [Tribolium castaneum]
NCBI nr blastxgi|910888850.096.54%PREDICTED: similar to Rpt1 CG1341-PA [Tribolium castaneum]
Group
Gene OntologyGO:00167871.2e-122hydrolase activity
GO:00301631.2e-122protein catabolic process
GO:00057371.2e-122cytoplasm
GO:00055246e-40ATP binding
GO:00001661.1e-22nucleotide binding
GO:00171111.1e-22nucleoside-triphosphatase activity
KEGG pathwaytca:6611140.0 
 K03061 (PSMC2, RPT1)maps-> Proteasome
InterPro domain[108-417] IPR0059371.2e-12226S proteasome subunit P45
[213-345] IPR0039596e-40ATPase, AAA-type, core
[209-348] IPR0035931.1e-22ATPase, AAA+ type, core
Orthology groupMCL13714 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209519-TA
ATGCCTGATCACCTAGGAGATGATATGCGAAAAGTAAAAGATGACAAGGAAGAGCCAGAAAAAGAGATTAAATCCCTTGATGAAGGAGACATAGCATTATTAAAGTCTTATGGCCAGGGACAGTATATGAAGATTATCAAAGAGGTGGAAGAGGGGATACAGACTGTAATGAAGAGAGTTAATGAGCTAACTGGAATAAAGGAATCTGACACTGGTCTAGCTCCACCTGCCCTGTGGGATCTTGCTGCAGATAAACAAACTCTTCAGAATGAACAACCCTTACAGGTTGCAAGATGTACAAAAATCATTAATGCAGATTCAAATGATCCCAAATACATAATAAATGTGAAGCAATTTGCTAAGTTTGTTGTGGACCTGGCTGACTCGGTGGCTCCTACTGATATTGAAGAAGGAATGAGAGTCGGAGTCGATCGTAACAAATATCAGATCCACATACCCTTACCACCCAAAATAGACCCAACTGTGACTATGATGCAAGTGGAAGAAAAGCCTGATGTTACATACAGTGATGTCGGAGGCTGTAAGGAGCAGATTGAAAAGTTAAGAGAGGTCGTTGAAACACCACTGTTACATCCAGAGAAATTCGTGAAGCTCGGTATTGAGCCACCCAAGGGAGTGCTGTTGTTCGGACCCCCCGGTACTGGCAAGACGTTGTGTGCGAGGGCCGTGGCCAACAGGACAGATGCTTGCTTCATAAGAGTCATCGGTTCGGAGCTAGTTCAGAAGTATGTTGGTGAAGGTGCTCGTATGGTGAGGGAGTTGTTTGAGATGGCGAGGAGTAAGAAAGCCTGTTTGATATTCTTTGATGAGATTGACGCCATCGGAGGTGCCAGGTTCGATGATGGAGCCGGTGGTGATAATGAAGTGCAGAGGACTATGCTAGAGCTGATCAACCAATTGGATGGGTTCGATCCACGAGGAAACATCAAGGTGTTAATGGCGACCAATCGTCCGGATACCCTGGACCCCGCCCTCATGCGTCCCGGACGGCTAGACCGGAAGGTGGAGTTCGGTCTGCCGGAGCTGGAGGGGCGAGCTCACATCTTCCGCATACACGCCAGATCCATGAGCGTCGAGAGGGACATACGATTCGACCTACTCGCCAGGCTCTGTCCCAACTCCACCGGCGCTGAGATCAGGTCTGTGTGTACCGAGGCCGGTATGTTCGCGATCCGCGCTCGTCGGAAGGTCGCCACCGAGAAGGACTTCCTGGAGGCTGTTAACAAGGTCATCAAGTCGTACGCGAAGTTCTCAGCGACGCCGCGCTACATGACCTACAATTAA

Protein sequence:

>DPOGS209519-PA
MPDHLGDDMRKVKDDKEEPEKEIKSLDEGDIALLKSYGQGQYMKIIKEVEEGIQTVMKRVNELTGIKESDTGLAPPALWDLAADKQTLQNEQPLQVARCTKIINADSNDPKYIINVKQFAKFVVDLADSVAPTDIEEGMRVGVDRNKYQIHIPLPPKIDPTVTMMQVEEKPDVTYSDVGGCKEQIEKLREVVETPLLHPEKFVKLGIEPPKGVLLFGPPGTGKTLCARAVANRTDACFIRVIGSELVQKYVGEGARMVRELFEMARSKKACLIFFDEIDAIGGARFDDGAGGDNEVQRTMLELINQLDGFDPRGNIKVLMATNRPDTLDPALMRPGRLDRKVEFGLPELEGRAHIFRIHARSMSVERDIRFDLLARLCPNSTGAEIRSVCTEAGMFAIRARRKVATEKDFLEAVNKVIKSYAKFSATPRYMTYN-