Monarch geneset OGS2.0

DPOGS202037
TranscriptDPOGS202037-TA786 bp
ProteinDPOGS202037-PA261 aa
Genomic positionDPSCF300053 - 100277-101254
RNAseq coverage23236x (Rank: top 0%)
Annotation
HeliconiusHMEL0099414e-14999.62% 
BombyxBGIBMGA000867-TA3e-14899.23% 
DrosophilaRpS2-PA7e-11489.81% 
EBI UniRef50UniRef50_B4G7B46e-10690.59%GL18987 n=9 Tax=Eukaryota RepID=B4G7B4_DROPE
NCBI RefSeqNP_001037564.18e-14799.23%ribosomal protein S2 [Bombyx mori]
NCBI nr blastpgi|3423563173e-14699.62%ribosomal protein S2 [Heliconius melpomene cythera]
NCBI nr blastxgi|3423563172e-14899.62%ribosomal protein S2 [Heliconius melpomene cythera]
Group
Gene OntologyGO:00058402.1e-174ribosome
GO:00064122.1e-174translation
GO:00056222.1e-174intracellular
GO:00037352.1e-174structural constituent of ribosome
GO:00159351.7e-94small ribosomal subunit
GO:00037231.9e-29RNA binding
KEGG pathwayapi:1001675321e-114 
 K02981 (RP-S2e, RPS2)maps-> Ribosome
InterPro domain[1-259] IPR0008512.1e-174Ribosomal protein S5
[44-249] IPR0057111.7e-94Ribosomal protein S5, eukaryotic/archaeal
[163-238] IPR0147217.5e-33Ribosomal protein S5 domain 2-type fold, subgroup
[92-153] IPR0147201.9e-29Double-stranded RNA-binding-like
[92-157] IPR0138106.2e-28Ribosomal protein S5, N-terminal
[168-249] IPR0205681.1e-27Ribosomal protein S5 domain 2-type fold
[174-245] IPR0053241.7e-21Ribosomal protein S5, C-terminal
Orthology groupMCL10678 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202037-TA
ATGGCGGACGCAGCTCCAGCCGGTGGACGCGGTGGTTTCCGTGGCGGATTCGGATCACGTGGTGGTGATAGAGGACGTGGTGGCCCACGCGGTCGTGGTCGTGGTCGCGGCCGTGGCCGTGGACGCGGTAAGGAAGAGCAGAAAGAATGGGTGCCTGTAACCAAGCTGGGACGTCTCGTGCGAGAGGGAAAGATCGACAAGCTTGAGAGCATTTATCTGTTCTCATTGCCCATTAAAGAATTCGAAATTATCGATTTCTTCCTTGGTGCCTCTTTGAACGATGAGGTTTTGAAGATCATGCCCGTACAGAAGCAGACTAGGGCTGGTCAACGTACCCGTTTCAAGGCTTTCGTGGCTATCGGGGACAACAATGGACACATTGGTCTTGGAGTTAAGTGCAGCAAGGAAGTAGCTACAGCTATCCGAGGCGCTATTATTCTAGCAAAATTGTCCGTTCTGCCTGTCCGCAGAGGATACTGGGGTAACAAGATCGGCAAGCCTCATACTGTACCCTGCAAGGTAACTGGTAAGTGTGGTTCAGTCACTGTCCGTCTTATTCCTGCTCCCCGTGGTACTGGTATTGTGTCGGCCCCTGTTCCCAAGAAGCTTCTTCAGATGGCCGGAGTCCAGGATTGCTACACCTCAGCTCGTGGTTCAACTGGAACCCTCGGCAACTTTGCCAAAGCCACCTATGCGGCCATTGCAAAGACCTATGCCTACTTGACACCAGACTTATGGAGGGACATTCCATTGACCAAATCACCATACTCTGAATTCAAAGTCTAA

Protein sequence:

>DPOGS202037-PA
MADAAPAGGRGGFRGGFGSRGGDRGRGGPRGRGRGRGRGRGRGKEEQKEWVPVTKLGRLVREGKIDKLESIYLFSLPIKEFEIIDFFLGASLNDEVLKIMPVQKQTRAGQRTRFKAFVAIGDNNGHIGLGVKCSKEVATAIRGAIILAKLSVLPVRRGYWGNKIGKPHTVPCKVTGKCGSVTVRLIPAPRGTGIVSAPVPKKLLQMAGVQDCYTSARGSTGTLGNFAKATYAAIAKTYAYLTPDLWRDIPLTKSPYSEFKV-