Monarch geneset OGS2.0

DPOGS206320
TranscriptDPOGS206320-TA1263 bp
ProteinDPOGS206320-PA420 aa
Genomic positionDPSCF300082 - 500333-501595
RNAseq coverage1982x (Rank: top 6%)
Annotation
HeliconiusHMEL0125950.098.33% 
BombyxBGIBMGA014136-TA0.097.62% 
DrosophilaRpn6-PA0.076.54% 
EBI UniRef50UniRef50_E0W0L53e-17774.05%26S proteasome non-ATPase regulatory subunit, putative n=19 Tax=Eukaryota RepID=E0W0L5_PEDHC
NCBI RefSeqXP_391945.10.080.85%PREDICTED: similar to Proteasome p44.5 subunit CG10149-PB, isoform B isoform 1 [Apis mellifera]
NCBI nr blastpgi|480977640.080.85%PREDICTED: 26S proteasome non-ATPase regulatory subunit 11-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|480977640.080.85%PREDICTED: 26S proteasome non-ATPase regulatory subunit 11-like isoform 1 [Apis mellifera]
Group
Gene OntologyGO:00055151.7e-18protein binding
KEGG pathwayame:4083970.0 
 K03036 (PSMD11, RPN6)maps-> Proteasome
InterPro domain[141-318] IPR0131439.6e-58PCI/PINT associated module
[314-395] IPR0119913.5e-22Winged helix-turn-helix transcription repressor DNA-binding
[319-402] IPR0007171.7e-18Proteasome component (PCI) domain
Orthology groupMCL15252 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206320-TA
ATGGCTGGGGCAATGTTGTTTGAGAGATCACGCGTGTCGTCTTCTAACAGAGAGGAAGACGTCCGCATGACTGACAAAATGGTTAGCACAGGTGAAGTGCCCGAAGATGACGAGGAAAATATCAGAGCTAAGGAACAAGGAATACTGAATCTTGGAGAGAAATACAAGAAGGAAGGGAAAGCCAAGGAGTTGGCTGAATTAATAAAAGCTACCAGGCCTTTTTTAAGTCTGATAAGCAAGGCAAAAGCAGCTAAACTTGTCCGTTCTCTTGTTGATTTCTTTCTTGATTTAGAAGCTGGTATTGGTATTGAAGTTCAATTATGCAAAGAATGTATAGAATGGGCCAAGGAAGAGCGCCGCACCTTCTTAAGACAGTCTCTGGAAGCCAGACTCGTCGCCCTTTACTTCGACACTGGTATGTACACCGAGGCCTTAGATTTAGCAACTGCCCTCTTAAAAGAACTCAAGAAGTTAGATGACAAAAATTTGTTAGTGGAAGTGCTACTTTTGGAGAGTAAAACCTATCATGCCCTCAGTAATCTTCCTAAGGCACGCGCTTCGTTGACGTCAGCTAGGACCACAGCAAATGCCATCTACTGCCCTCCAAAAATGCAGGCAGCATTGGATTTGCAATCAGGTATACTCCACGCTGCTGATGAGAGGGACTTCAAAACTGCTTACTCTTACTTCTACGAAGCCTTTGAAGGTTACGACGGCGCTGACAGTCCTAAGGCTTTAACGGCTCTAAAATATATGTTATTGTCCAAAATCATGCTCAGTCAAGCAGAGGAGGTGGCTACAGTTTGCAGTAGTAAAGCAGCTCTGAAATATGCAGGTAAAGAATTAGAAGCCATGAGAGCAGTTGCTACTGCTTCTCACAAAAGATCACTTGCTGATTTCCAAGCTGCATTAAAGACATATAAGCCCGAATTAGAAGAAGATGCCGTTGTCAGAGCTCACCTCGGCTCTCTATATGATACCATGCTGGAACAGAACTTGTGTCGCATCGTAGAGCCTTATATGAGAGTTCAAGTGGACCATGTAGCTAAGTGTATCCGTCTGCCAGTTGTTCAAGTAGAGAAGAAATTATCACAGATGATACTCGATAAGAAGCTGAATGGTATACTTGACCAGGGAGAGGGCGTTTTAATTGTTTTCGACGAATCCCCCCTCGAGAAGACTTACGAAACAGTTTTAGAAACAATACATCATATGAGCAAAGTTGTTGACACACTCTACCAGAAAGCTAAAAAACTATCATAG

Protein sequence:

>DPOGS206320-PA
MAGAMLFERSRVSSSNREEDVRMTDKMVSTGEVPEDDEENIRAKEQGILNLGEKYKKEGKAKELAELIKATRPFLSLISKAKAAKLVRSLVDFFLDLEAGIGIEVQLCKECIEWAKEERRTFLRQSLEARLVALYFDTGMYTEALDLATALLKELKKLDDKNLLVEVLLLESKTYHALSNLPKARASLTSARTTANAIYCPPKMQAALDLQSGILHAADERDFKTAYSYFYEAFEGYDGADSPKALTALKYMLLSKIMLSQAEEVATVCSSKAALKYAGKELEAMRAVATASHKRSLADFQAALKTYKPELEEDAVVRAHLGSLYDTMLEQNLCRIVEPYMRVQVDHVAKCIRLPVVQVEKKLSQMILDKKLNGILDQGEGVLIVFDESPLEKTYETVLETIHHMSKVVDTLYQKAKKLS-