Monarch geneset OGS2.0

DPOGS205356
TranscriptDPOGS205356-TA930 bp
ProteinDPOGS205356-PA309 aa
Genomic positionDPSCF300295 + 193749-196029
RNAseq coverage18606x (Rank: top 1%)
Annotation
HeliconiusHMEL0045502e-13685.91% 
BombyxBGIBMGA007311-TA1e-13385.05% 
Drosophilasta-PD4e-10169.14% 
EBI UniRef50UniRef50_A2I3Z22e-10166.56%40S ribosomal protein SA n=43 Tax=Eukaryota RepID=RSSA_MACHI
NCBI RefSeqNP_001106143.11e-13184.72%40S ribosomal protein SA [Bombyx mori]
NCBI nr blastpgi|3151154095e-14085.11%ribosomal protein SA [Euphydryas aurinia]
NCBI nr blastxgi|3151154096e-14385.11%ribosomal protein SA [Euphydryas aurinia]
Group
Gene OntologyGO:00064122.6e-177translation
GO:00037352.6e-177structural constituent of ribosome
GO:00159352.6e-177small ribosomal subunit
GO:00058404.4e-31ribosome
GO:00056224.4e-31intracellular
KEGG pathwayphu:Phum_PHUM3273706e-107 
 K02998 (RP-SAe, RPS0)maps-> Ribosome
InterPro domain[1-302] IPR0057072.6e-177Ribosomal protein S2, eukaryotic/archaeal
[8-201] IPR0235919.9e-71Ribosomal protein S2, flavodoxin-like domain
[15-33] IPR0018654.4e-31Ribosomal protein S2
Orthology groupMCL11030 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205356-TA
ATGTCGGGAGGATTAGACATACTAGCCCTCAACGAGGAAGATGTAACCAAAATGTTGGCGGCGACAACCCATTTGGGGTCCGAGAATGTCCACTTCCAGATGGAAACCTACGTGTACAAGCGTCGTGCTGATGGCACCCACGTCATCAACCTTCGCCGTACATGGGAGAAGCTAGTGTTGGCCGCACGTGCGGTAGTCGCTATCGAGAATCCCGCGGATATCTTCGTGATATCTTCTCGTCCTTTCGGTCAACGTGCCGTGCTTAAGTTCGCAGCTCACACTGGAGCCACACCTATCGCTGGACGTTTCACACCTGGTGCCTTCACCAACCAGATCCAAGCCGCTTTCCGTGAGCCTCGTCTCTTGATTGTCTTGGACCCAGCTCAAGATCATCAGCCGATCACCGAGGCTTCATATGTGAACATTCCAGTCATTGCTTTCTGCAACACTGACTCCCCACTCAGATTTGTTGACATTGCCATCCCCTGCAACACTAAGTCGTCCCATTCCATTGGTTTGATGTGGTGGCTGTTGGCACGCGAGGTTCTACGCCTGCGGGGTGTGCTAGCTAGAGATCAGAAGTGGGATGTTGTTGTGGACCTGTTCTTCTACAGAGACCCTGAGGAGAGCGAGAAGGAGGAACAACAGGCTAAGGAACAGGCTGTTGTAGCTGCTAAGGCTGAGGAGGCTGTGCTTCCTGTGGAGTATGTGGAGCCGGTTGAGGCTCCAACTGTATGGGCTGATGATACACCTGGAGCACCTGCTGCTGCTGTAGCCTTCACCGCCACACCACCCAAGGAAGACTGGGCTGCTCAGGTCCAAGAGGAATGGCCGAACGCGCCAACAGCCCCCGCCCCGGCCGCGGCCGCTGCACCGTCATGGGGTGGATCCACCCCAGGGATGGAATCCAGCTTAAACGTCCACTCCTAA

Protein sequence:

>DPOGS205356-PA
MSGGLDILALNEEDVTKMLAATTHLGSENVHFQMETYVYKRRADGTHVINLRRTWEKLVLAARAVVAIENPADIFVISSRPFGQRAVLKFAAHTGATPIAGRFTPGAFTNQIQAAFREPRLLIVLDPAQDHQPITEASYVNIPVIAFCNTDSPLRFVDIAIPCNTKSSHSIGLMWWLLAREVLRLRGVLARDQKWDVVVDLFFYRDPEESEKEEQQAKEQAVVAAKAEEAVLPVEYVEPVEAPTVWADDTPGAPAAAVAFTATPPKEDWAAQVQEEWPNAPTAPAPAAAAAPSWGGSTPGMESSLNVHS-