Monarch geneset OGS2.0

DPOGS215611
TranscriptDPOGS215611-TA1257 bp
ProteinDPOGS215611-PA418 aa
Genomic positionDPSCF300041 - 2236255-2239467
RNAseq coverage2308x (Rank: top 5%)
Annotation
HeliconiusHMEL0059161e-9656.78% 
BombyxBGIBMGA003671-TA2e-14270.86% 
DrosophilaCG13349-PE5e-8447.21% 
EBI UniRef50UniRef50_F4WT806e-8449.06%Proteasomal ubiquitin receptor ADRM1-like protein n=7 Tax=Formicidae RepID=F4WT80_ACREC
NCBI RefSeqXP_001949000.12e-9046.86%PREDICTED: similar to Protein ADRM1 homolog (p42E) [Acyrthosiphon pisum]
NCBI nr blastpgi|3838509575e-9246.59%PREDICTED: proteasomal ubiquitin receptor ADRM1-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|1700333635e-10650.12%ADRM1 [Culex quinquefasciatus]
Group
Gene OntologyGO:00056348.3e-122nucleus
GO:00057378.3e-122cytoplasm
KEGG pathway 
InterPro domain[5-418] IPR0067738.3e-12226S proteasome complex ubiquitin receptor, subunit Rpn13
Orthology groupMCL15756 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215611-TA
ATGTCCGCGACAGCATTGTTTGGCAACACTTCGGGACTGGGAGGCAGTTCTGGAGGGAATAAACACTTAGTTGAGTTTCGGGCTGGTAGAATGACTCTTAAAGGACGAATGGTACATCCCGACAAGAGGAAGGGTTTATTATATGTGTATCAAGGTGAAGACTCATTGATGCATTTCTGTTGGAAAGATCGTACAACTGGAGAAGTTGAAGATGACCTATTGATCTTTCCGGATGACTGTGAATTTGTAAGAGTTAATGAATGTACAACCGGAAGAGTATATGTATTGAAATTTAAATCATTTTCAAAGAAATATTTCTTTTGGATGCAGGAGCCAAAAACAGACAAGGATGATGAATATTGTCGCCGTCTCAATGAAGCGTTGAATAACCCACCAACTTCCGGTGGTCGTGGTGGTAGCGGCGGTGGTGCCCAAGATGGAGACTTACACAACCTTCTTAACAACATGTCACAACAGCAATTGATGCAGTTATTTGGTGGTGTTGGTCAGATTGGTGGATTATCATCACTGCTGGGAACTATGGGCAACAACAGTAGCAGTGGCAACGCAACTCGTCCATCTGGAAATAGCAGCAATTCCCGTGGTGGTTCGGCACCTCGGTCTGAGCCCACAACACGTACATCAGCACGAGCTCGTGATGAACGCACACCTGTTCCCATCGTTGCCCCCACACCCGCTACTGTTCCCGCCCCTACTGCCACCCCACCCAACACTGCCACTGGAACTCAACCTCGCAGTGGTCAGATATTTCTATCCGACCTGCAGCGGTACTTCTCCGGTCTAGGCAACGCACCTCCGGAGGGCGAAGGCGCGGTGGGTGGGACGGGGGCTTCCCGCGTGGAGTTGGGGGCTGCGCTGGCGACTCCGGAGGTGGTGTCAACGGCCAGCGAGCCGGCTAACTCCCAGCGCCTCGCCCCTCACCTGCCGCCCGCGCCGCCCGCCGCCCCCCAGGACGATGTAAGGACCACACTGCTCTCACCGCAGTTCGCCCAGGCAGCCAATCAGTTTTCATCGGCTCTTACATCCGGTCAAATGGGACCAGTCATGACACAGTTTGGGCTTCCGGCTGACGTCACTACAGCCGCCAACACGGGAGACATGCAGGCCTTCTTTAAAGCCTTGGAGAGTGCGTCTTCATCCGAAAGCGGAAAGTCGGAAGGAGACAGAAAGAAAGATAAACCTCAAGATGACAAAAATGACAAAAAAGATGGTGATGCTGGAATGTCACTCGATTAA

Protein sequence:

>DPOGS215611-PA
MSATALFGNTSGLGGSSGGNKHLVEFRAGRMTLKGRMVHPDKRKGLLYVYQGEDSLMHFCWKDRTTGEVEDDLLIFPDDCEFVRVNECTTGRVYVLKFKSFSKKYFFWMQEPKTDKDDEYCRRLNEALNNPPTSGGRGGSGGGAQDGDLHNLLNNMSQQQLMQLFGGVGQIGGLSSLLGTMGNNSSSGNATRPSGNSSNSRGGSAPRSEPTTRTSARARDERTPVPIVAPTPATVPAPTATPPNTATGTQPRSGQIFLSDLQRYFSGLGNAPPEGEGAVGGTGASRVELGAALATPEVVSTASEPANSQRLAPHLPPAPPAAPQDDVRTTLLSPQFAQAANQFSSALTSGQMGPVMTQFGLPADVTTAANTGDMQAFFKALESASSSESGKSEGDRKKDKPQDDKNDKKDGDAGMSLD-