Monarch geneset OGS2.0

DPOGS202256
TranscriptDPOGS202256-TA1287 bp
ProteinDPOGS202256-PA428 aa
Genomic positionDPSCF300032 - 562676-566104
RNAseq coverage853x (Rank: top 15%)
Annotation
HeliconiusHMEL0050942e-12083.05% 
BombyxBGIBMGA004908-TA0.086.80% 
DrosophilaPros26.4-PA0.084.81% 
EBI UniRef50UniRef50_P621910.081.38%26S protease regulatory subunit 4 n=395 Tax=root RepID=PRS4_HUMAN
NCBI RefSeqXP_312923.40.085.94%AGAP003215-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123840640.086.39%hypothetical protein AND_02632 [Anopheles darlingi]
NCBI nr blastxgi|3123840640.086.39%hypothetical protein AND_02632 [Anopheles darlingi]
Group
Gene OntologyGO:00055241.3e-23ATP binding
KEGG pathwayaga:AgaP_AGAP0032160.0 
 K03062 (PSMC1, RPT2)maps-> Proteasome
InterPro domain[245-343] IPR0039591.3e-23ATPase, AAA-type, core
Orthology groupMCL11539 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202256-TA
ATGGGACAAAATCAATCTGGTGGTGGCAGCGGCGGAGACAAAAAGGATGACAAGGATAAGAAGAAGAAATATGAACCACCGATTCCTACAAGGGTTGGAAAAAAGAAGCGCAAGGCTAAGGGGCCAGACGCAGCTTTAAAGCTGCCTCAAGTAACGCCACATACGCGATGTAGGTTGAAACTACTTAAGTTGGAGAGAATTAAGGATTACTTACTTATGGAGGAGGAATTCATCCGCAATCAAGAAAGACTGAAGCCACAAGAAGAGAAAATTGAAGAGGAAAGATCAAAGGTAGATGATCTCCGTGGCACACCAATGTCAGTAGGTACTCTGGAAGAGATCATTGATGACAATCATGCCATAGTCTCCACATCCGTCGGCAGTGAACACTATGTCAGCATCCTGTCATTTGTTGACAAAGACCAGCTAGAGCCAGGCTGCTCAGTTTTACTAAACCATAAGGTTCATGCTGTGGTGGGTGTGCTGGGTGATGACACCGATCCGATGGTGTCAGTCATGAAGCTCGAGAAGGCTCCACAAGAGACATATGCAGATATTGGTGGTCTTGACACACAGATACAGGAAATTAAGGGAGCAAATGGCCTACCTATTGGATATACACAATATCCACAAAATAGTTTTGCAAATGCATTGCCACCCTTATATTGGGAAGAGGGAGGGGGAAAAGTAGGAAAAGTTAAAGGTCACATAAACGGTGGGAAAAGGAAAAGGGCAACCGGCTCTGGTGATGGTCCGAAATTAGTTCGTGAACTATTCAGAGTAGCCGAAGAACATGCTCCATCAATTGTATTTATTGATGAAATAGATGCTGTCGGGACCAAACGTTATGACTCCAACTCTGGCGGTGAGAGGGAAATTCAAAGAACTATGTTGGAGCTCCTCAATCAGTTGGACGGTTTTGATTCAAGAGGAGATGTTAAGGTTATTATGGCAACTAACAGAATAGAGACCCTAGACCCGGCCCTGATCCGTCCAGGCCGGATCGATCGCAAGATAGAGTTCCCGCTGCCCGACGAGAAGACCAAACGACGCATCTTCACCATACATACCTCCAGGATGACCTTGGCCGATGATGTCAACTTGTCAGAGCTCATCATGTCCAAGGATGATCTGTCCGGGGCAGATATGAAGGCTATTTGTACCGAGGCTGGTTTGATGGCACTCAGAGAACGGCGTATGAAGGTTACTAATGAAGACTTCAAGAAGTCTAAAGAGAGTGTCCTGTACCGCAAGAAGGAAGGCACTCCGGAAGGGCTTTACCTTTAA

Protein sequence:

>DPOGS202256-PA
MGQNQSGGGSGGDKKDDKDKKKKYEPPIPTRVGKKKRKAKGPDAALKLPQVTPHTRCRLKLLKLERIKDYLLMEEEFIRNQERLKPQEEKIEEERSKVDDLRGTPMSVGTLEEIIDDNHAIVSTSVGSEHYVSILSFVDKDQLEPGCSVLLNHKVHAVVGVLGDDTDPMVSVMKLEKAPQETYADIGGLDTQIQEIKGANGLPIGYTQYPQNSFANALPPLYWEEGGGKVGKVKGHINGGKRKRATGSGDGPKLVRELFRVAEEHAPSIVFIDEIDAVGTKRYDSNSGGEREIQRTMLELLNQLDGFDSRGDVKVIMATNRIETLDPALIRPGRIDRKIEFPLPDEKTKRRIFTIHTSRMTLADDVNLSELIMSKDDLSGADMKAICTEAGLMALRERRMKVTNEDFKKSKESVLYRKKEGTPEGLYL-