Monarch geneset OGS2.0

DPOGS203598
TranscriptDPOGS203598-TA1260 bp
ProteinDPOGS203598-PA419 aa
Genomic positionDPSCF300063 - 635936-638619
RNAseq coverage988x (Rank: top 13%)
Annotation
HeliconiusHMEL0088830.085.56% 
BombyxBGIBMGA007268-TA0.083.11% 
DrosophilaRpn5-PA9e-8560.98% 
EBI UniRef50UniRef50_O002324e-13254.27%26S proteasome non-ATPase regulatory subunit 12 n=77 Tax=Eumetazoa RepID=PSD12_HUMAN
NCBI RefSeqNP_001040208.10.082.67%proteasome 26S non-ATPase subunit 12 [Bombyx mori]
NCBI nr blastpgi|1140520860.082.67%proteasome 26S non-ATPase subunit 12 [Bombyx mori]
NCBI nr blastxgi|1140520860.082.67%proteasome 26S non-ATPase subunit 12 [Bombyx mori]
Group
Gene OntologyGO:00055151.2e-17protein binding
KEGG pathwaytca:6551441e-161 
 K03035 (PSMD12, RPN5)maps-> Proteasome
InterPro domain[302-383] IPR0119911.1e-26Winged helix-turn-helix transcription repressor DNA-binding
[268-374] IPR0007171.2e-17Proteasome component (PCI) domain
Orthology groupMCL13711 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203598-TA
ATGGCTTCAAACGGTGATATTGAAAGTCTAGATGCTAGCGGCAAAATCATTAAAATGGAAGTGGACTACAGTGCCACTTGCGATGAGAAAATACCATTATGGAAATCCTGGGCTTCTAATGGAAAAGTGCAGGAGGCTATAGATCAGCTACTGGCTTTGGAAAAACAAACAAGGACTGCTGCTGATATGGCCTCCACTGCAAGGATTCTGGTCACAATAGTACAGATCTGTTTCGAGGCCAAGAACTGGACTGCTCTTAATGATCACATCATTCTTCTTTCAAAGAGAAGGTCACAGTTAAAACAAGCTGTTGTGAAAATGGTTCAAGAATGCTATACATATGTAAACAAGACCCCGGATAAAGAGACTAAAATAAAACTCATAGAAACCCTCAGAACTATAACGGAAGGAAAAATCTATGTAGAAGTGGAAAGGGCCCGTCTGACTCATATCTTGGCCAAAATGCGCGAAGAAGAAAACAATATAGCAGAAGCTGCAAAGATAATACAAGAATTACAAGTTGAAACTTACGGCTCTATGGATAAGAGAGAGAAAGTAGAGCTTATATTGGAGCAGATGAGGCTGTGTCTAGCCATCAAAGATTATATCCGTACACAGATTATCTCAAAGAAGATAAACACAAAATTCTTTGAGGAAGATGATACTCAGGAGCTAAAAGAGAAATTCTACCGACTCATGATAGCTGTTGATCAACAAAATGGTCAATATCTCTCAGTGTGCCGTCATTTCCGAGCTCTCGGACAAGCTGGCGGGGCAGATGCTCTGATTGGTAGTGTGGTACAACTTCTCGGTTTATTCATAACTCCGGAGATCATAAGATGGAATACATTATGCTCCACCTATGAGAAGATGCTGAGAACAACTCCCTTCTTCCAGAGCAATGATGAGAAAGGTCAAGAGCGTTGGAATGACCTCAAGAATAGAGTTGTCGAACATAATATCCGCATCATGTCAATGTACTACACTCGTATAACGATCCAACGTATGAGTGAGCTTCTCGGTCTTAGCGTCACTGAGACAGAGGACGCCTTGAGTCAATTAGTGGTGAGCGCGGTGGTGAAAGCCAAGATAGACCGACCAGCCGGCGTCGTGCACTTCAGATTAAATATGGACGCGTCCGATCGTTTGAACGAATGGTCCCGCAACTTGAACACGCTGATGCAGCTCGTCAACAAAACAACTCACTTGATCAACAAAGAGGAATGCGTTCACAAACATTTGTTAGCGACGTCCGAATAA

Protein sequence:

>DPOGS203598-PA
MASNGDIESLDASGKIIKMEVDYSATCDEKIPLWKSWASNGKVQEAIDQLLALEKQTRTAADMASTARILVTIVQICFEAKNWTALNDHIILLSKRRSQLKQAVVKMVQECYTYVNKTPDKETKIKLIETLRTITEGKIYVEVERARLTHILAKMREEENNIAEAAKIIQELQVETYGSMDKREKVELILEQMRLCLAIKDYIRTQIISKKINTKFFEEDDTQELKEKFYRLMIAVDQQNGQYLSVCRHFRALGQAGGADALIGSVVQLLGLFITPEIIRWNTLCSTYEKMLRTTPFFQSNDEKGQERWNDLKNRVVEHNIRIMSMYYTRITIQRMSELLGLSVTETEDALSQLVVSAVVKAKIDRPAGVVHFRLNMDASDRLNEWSRNLNTLMQLVNKTTHLINKEECVHKHLLATSE-