Monarch geneset OGS2.0

DPOGS205845
TranscriptDPOGS205845-TA792 bp
ProteinDPOGS205845-PA263 aa
Genomic positionDPSCF300081 + 22526-25712
RNAseq coverage43097x (Rank: top 0%)
Annotation
HeliconiusHMEL0099262e-14587.54% 
BombyxBGIBMGA010867-TA4e-13897.14% 
DrosophilaRpS4-PB6e-12781.85% 
EBI UniRef50UniRef50_P410427e-12581.85%40S ribosomal protein S4 n=199 Tax=root RepID=RS4_DROME
NCBI RefSeqNP_001037257.14e-14897.34%40S ribosomal protein S4 [Bombyx mori]
NCBI nr blastpgi|3151154072e-14998.86%ribosomal protein S4 [Euphydryas aurinia]
NCBI nr blastxgi|3151154074e-14898.86%ribosomal protein S4 [Euphydryas aurinia]
Group
Gene OntologyGO:00058407.9e-221ribosome
GO:00064127.9e-221translation
GO:00056227.9e-221intracellular
GO:00037357.9e-221structural constituent of ribosome
GO:00037231.7e-08RNA binding
KEGG pathwaytca:6577273e-132 
 K02987 (RP-S4e, RPS4)maps-> Ribosome
InterPro domain[2-263] IPR0008767.9e-221Ribosomal protein S4e
[87-181] IPR0138454e-48Ribosomal protein S4e, central
[3-40] IPR0138431.6e-20Ribosomal protein S4e, N-terminal
[43-90] IPR0029421.7e-08RNA-binding S4
[178-211] IPR0058244.4e-06KOW
Orthology groupMCL15166 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205845-TA
ATGGCTCGTGGACCTAAAAAGCATTTGAAGCGTTTAAACGCGCCCAAGGCATGGATGTTGGACAAACTGGGTGGTGTGTACGCACCACGGCCCTCCACCGGTCCACACAAGTTGCGAGAGTGTTTGCCACTTGTTATTTTTCTACGTAACCGCCTGAAGTACGCCCTAACCGGCAATGAGGTACTGAAAATTGTAAAGCAGCGTCTAATCAAGGTCGATGGCAAAGTCAGAACTGATCCGACCTACCCCGCTGGCTTTATGGATGTTGTATCAATTGAGAAGACAAACGAACTGTTCCGTCTGATCTATGATGTCAAGGGTAGATTCACTATTCACAGGATAACTCCCGAGGAGGCCAAGTACAAGCTGTGCAAGGTCCGCCATGTCGGTACTGGTCCCAAGAACGTACCATACCTTGTGACGCACGATGGCCGTACCATCCGTTACCCGGATCCACTCATTAAGGTCAATGACTCCATCCAACTGGACATTGCCTCGTCCAAGATTATGGACTTCATCAAGTTTGAGTCTGGTAACCTGTGCATGATCACCGGAGGTCGTAACTTGGGTCGTGTGGGCACCATCGTGTCCCGTGAGCGTCACCCCGGTTCCTTCGACATTGTACACATCAAGGACTCTATGGGACATACTTTTGCTACCAGATTGAACAACGTGTTCATCATCGGCAAAGGCACGAAGGCTTACATTTCTCTGCCACGCGGTAAAGGTATCCGCCTCACCATCGCCGAGGAGCGCGACAAGCGCATCGCAGCGAAGGTCGCCGCGCACTAG

Protein sequence:

>DPOGS205845-PA
MARGPKKHLKRLNAPKAWMLDKLGGVYAPRPSTGPHKLRECLPLVIFLRNRLKYALTGNEVLKIVKQRLIKVDGKVRTDPTYPAGFMDVVSIEKTNELFRLIYDVKGRFTIHRITPEEAKYKLCKVRHVGTGPKNVPYLVTHDGRTIRYPDPLIKVNDSIQLDIASSKIMDFIKFESGNLCMITGGRNLGRVGTIVSRERHPGSFDIVHIKDSMGHTFATRLNNVFIIGKGTKAYISLPRGKGIRLTIAEERDKRIAAKVAAH-