Monarch geneset OGS2.0

DPOGS207421
TranscriptDPOGS207421-TA738 bp
ProteinDPOGS207421-PA245 aa
Genomic positionDPSCF300087 + 298643-300518
RNAseq coverage29330x (Rank: top 0%)
Annotation
HeliconiusHMEL0020703e-13198.68% 
BombyxBGIBMGA009319-TA5e-13595.90% 
DrosophilaRpS3-PA2e-10579.06% 
EBI UniRef50UniRef50_P233964e-11282.79%40S ribosomal protein S3 n=166 Tax=root RepID=RS3_HUMAN
NCBI RefSeqNP_001037253.12e-13395.90%ribosomal protein S3 [Bombyx mori]
NCBI nr blastpgi|13509901e-13396.72%ribosomal protein S3 [Manduca sexta]
NCBI nr blastxgi|3151153419e-12996.68%ribosomal protein S3 [Euphydryas aurinia]
Group
Gene OntologyGO:00064125.1e-71translation
GO:00037355.1e-71structural constituent of ribosome
GO:00159355.1e-71small ribosomal subunit
GO:00058405.8e-28ribosome
GO:00056225.8e-28intracellular
GO:00037236.4e-25RNA binding
KEGG pathwaynvi:1001142254e-114 
 K02985 (RP-S3e, RPS3)maps-> Ribosome
InterPro domain[9-211] IPR0057035.1e-71Ribosomal protein S3, eukaryotic/archaeal
[93-194] IPR0013515.8e-28Ribosomal protein S3, C-terminal
[11-102] IPR0090196.4e-25K Homology, prokaryotic type
[7-92] IPR0159468.1e-14K homology domain-like, alpha/beta
[46-84] IPR0040441e-05K Homology, type 2
Orthology groupMCL13875 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207421-TA
ATGGCGGTGAACAACATTTCCAAAAAGCGAAAATTCGTCGGGGACGGAGTGTTCAAGGCTGAATTGAACGAGTTCCTTACTCGAGAGTTGGCCGAAGATGGCTACTCTGGCGTGGAGGTGCGTGTCACTCCTACCCGATCAGAAATCATCATTATGGCTACCAGGACACAAAGCGTACTGGGAGAGAAAGGTCGCAGGATCCGTGAATTGACGTCAGTTGTACAAAAGAGGTTTAATATCCCCGAGCACTCAGTCGAGCTTTACGCCGAGAAGGTCGCCACCCGCGGTCTCTGTGCCATCGCTCAAGCTGAATCCCTACGTTACAAACTCATTGGAGGATTGGCCGTCCGTCGTGCGTGCTACGGTGTGCTGAGATTCATCATGGAATCTGGTGCTCGTGGCTGTGAAGTCGTAGTCTCCGGCAAGTTGAGAGGTCAGAGAGCTAAGTCTATGAAGTTTGTGGATGGACTCATGATCCACTCGGGAGACCCCTGCAATGACTACGTTAACACCGCCACCCGACATGTACTGCTCAGACAAGGAGTACTTGGTATTAAGGTGAAGATCATGTTACCGTGGGACCAACAAGGCAAGAACGGACCCAAGAAGCCCCAGCCGGACCACATCTTGGTGACGGAGCCCAAGGACGAGCCCGCGCCCCTGGAGCCCACCTCGGACGTGAGGTCGTTGGCCCCGGTGCCCGCCGCCGCACCAACGCCCGCGCCTGTCGCTGTCTAG

Protein sequence:

>DPOGS207421-PA
MAVNNISKKRKFVGDGVFKAELNEFLTRELAEDGYSGVEVRVTPTRSEIIIMATRTQSVLGEKGRRIRELTSVVQKRFNIPEHSVELYAEKVATRGLCAIAQAESLRYKLIGGLAVRRACYGVLRFIMESGARGCEVVVSGKLRGQRAKSMKFVDGLMIHSGDPCNDYVNTATRHVLLRQGVLGIKVKIMLPWDQQGKNGPKKPQPDHILVTEPKDEPAPLEPTSDVRSLAPVPAAAPTPAPVAV-