Monarch geneset OGS2.0

DPOGS211134
TranscriptDPOGS211134-TA825 bp
ProteinDPOGS211134-PA274 aa
Genomic positionDPSCF300007 - 285279-286429
RNAseq coverage2515x (Rank: top 5%)
Annotation
HeliconiusHMEL0172209e-15694.14% 
BombyxBGIBMGA003009-TA2e-15091.67% 
DrosophilaProsbeta2-PA3e-10470.83% 
EBI UniRef50UniRef50_Q994361e-9663.31%Proteasome subunit beta type-7 n=120 Tax=Eukaryota RepID=PSB7_HUMAN
NCBI RefSeqNP_001040536.15e-14991.67%proteasome subunit beta 7 [Bombyx mori]
NCBI nr blastpgi|1140530731e-14791.67%proteasome subunit beta 7 [Bombyx mori]
NCBI nr blastxgi|1140530735e-14291.67%proteasome subunit beta 7 [Bombyx mori]
Group
Gene OntologyGO:00516033.1e-50proteolysis involved in cellular protein catabolic process
GO:00042983.1e-50threonine-type endopeptidase activity
GO:00058393.1e-50proteasome core complex
GO:00041753.4e-07endopeptidase activity
KEGG pathwaynvi:1001213985e-115 
 K02739 (PSMB7)maps-> Proteasome
InterPro domain[37-217] IPR0013533.1e-50Proteasome, subunit alpha/beta
[47-62] IPR0002433.4e-07Peptidase T1A, proteasome beta-subunit
Orthology groupMCL14317 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211134-TA
ATGGCGTCGGTTCTGGTACCTGAAGTGCCTACCCCAGGTTTCTCTTTCGAAAATTGTCAACGCAATGCATTTTTATCTCAAAAAGGCTTTCCTGCCCCGAAAGCTACGAAGACTGGTACGACCATTGTGGGTATAATTTACGCAGACGGGGTTATTTTGGGTGCTGATACACGGGCGACAGAAAACACTGTTGTTTCGGATAAGAATTGTCAGAAGATTCACTATTTGGCTGGGAATATGTACTGCTGTGGTGCCGGCACTGCTGCTGACACTGAGATGACGACACAAACTGTAGCTTCTCAGCTCGAACTGCAGCGTCTTCACACGGGCCGCACTGTCCCCGTCGAGACTGCTGCTACTTTGTTAAAGCGCATGTTATTCCGATACCAAGGGCATATAGGTGCTGCACTTGTTTTGGGAGGTGTAGATCGCACCGGACCACATATTTACTGTATTTACCCCCATGGATCTGTTGATAAGTTACCTTATGCAACCATGGGTTCGGGATCACTGGCTGCTATGGCAGTGTTTGAATCTCGGTGGAAGCCAAATATGTCTGAGGAAGAAGGCAAAAAGTTGGTGAGGGATGCCATTGCAGCTGGTATCTTTAATGATCTTGGTTCTGGTTCTAACGTGGACCTGTGTGTCATCCGTTCTTCAGGACCCGCTCAATATCTAAGAACTTATGAGGAAGCCAATGTGAAGGGTAAAAAACAAGGGTCTTATAGATATCCTCTCGGTACTACAGCAGTCTTGAGACAAAGAGTTATTCCACTTGAAGTAGAGTCTGTTGCGGTCCGACCGGTACAGCCAATGGAAACATAG

Protein sequence:

>DPOGS211134-PA
MASVLVPEVPTPGFSFENCQRNAFLSQKGFPAPKATKTGTTIVGIIYADGVILGADTRATENTVVSDKNCQKIHYLAGNMYCCGAGTAADTEMTTQTVASQLELQRLHTGRTVPVETAATLLKRMLFRYQGHIGAALVLGGVDRTGPHIYCIYPHGSVDKLPYATMGSGSLAAMAVFESRWKPNMSEEEGKKLVRDAIAAGIFNDLGSGSNVDLCVIRSSGPAQYLRTYEEANVKGKKQGSYRYPLGTTAVLRQRVIPLEVESVAVRPVQPMET-