Monarch geneset OGS2.0

DPOGS203182
TranscriptDPOGS203182-TA1488 bp
ProteinDPOGS203182-PA495 aa
Genomic positionDPSCF300035 - 159577-168462
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0071902e-12495.74% 
BombyxBGIBMGA011416-TA2e-12293.59% 
DrosophilaRpS3A-PA4e-10072.80% 
EBI UniRef50UniRef50_Q6EV042e-10982.20%40S ribosomal protein S3a n=101 Tax=Eukaryota RepID=RS3A_BIPLU
NCBI RefSeqNP_001037255.16e-12193.59%40S ribosomal protein S3a [Bombyx mori]
NCBI nr blastpgi|2518312603e-13194.51%s3a protein [Actias selene]
NCBI nr blastxgi|3151154554e-13797.25%ribosomal protein S3A [Euphydryas aurinia]
Group
Gene OntologyGO:00058408.6e-169ribosome
GO:00064128.6e-169translation
GO:00056228.6e-169intracellular
GO:00037358.6e-169structural constituent of ribosome
GO:00160206.3e-59membrane
GO:00071556.3e-59cell adhesion
KEGG pathwaytca:6564387e-110 
 K02984 (RP-S3Ae, RPS3A)maps-> Ribosome
InterPro domain[1-239] IPR0015938.6e-169Ribosomal protein S3Ae
[255-474] IPR0021596.3e-59CD36 antigen
Orthology groupMCL10509 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203182-TA
ATGGCGGTCGGTAAAAATAAAGGCCTATCGAAAGGCGGTAAAAAGGGAGTTAAAAAGAAGATCGTCGACCCGTTCACACGCAAAGACTGGTACGATGTCAAAGCACCGTCGATGTTCACCAAGAGGCTGGTGGGAACCACGCTCGTCAACCGCACCCAGGGAACAAAAATCGCTTCTGAGGGTCTGAAGGGCCGCGTCTTCGAGGTTTCCCTCGCTGATTTACAAGCTGACACAGATGCGGAAAGGTCTTTCCGCAAGTTCCGCTTGATCGCTGAAGATGTCCAGGGACGTAATGTCCTCTGTAACTTCCATGGCATGGATCTTACTACTGACAAGTTGAGATGGATGGTTAAGAAATGGCAGACATTAGTTGAAGCCAACATTGATGTGAAGACCACCGACGGCTACCTCCTGAGGGTGTTCTGCATCGGCTTCACTAACAAGGACTCGCTCAGTCAGAGGAAGACATGCTATGCTCAGCACACCCAGGTCCGCGCCATCAGGAAGAAGATGTGCGAGATCATAACCCGTGATGTAGCGGGATCTGAATTGAAGGAAGTTGTCAACAAACTGATCCCCGACTCCATCGCCAAGGATATTGAAAAAGCCTGCCTCAGCATCTACCCGCTGAGGGATGTGTGCATCCGTAAGGTGAAAGTGCTCAAACGTCCGAGGTTTGAGATTGCCAAGCTGATGGAGCTGCACGGAGAAGGCGGTGGCAAGAGGGGAGAAGGAGGTGACAAGTCAGACAGACCTGAAGGCATAACCTCCTCCGACGGCACCATTTTCCCGCCGTCCCTGCTGAACAAGAAGACCAGGCTGTTCGTGTTCAACTCCAACATGTGCAGGAGACTGCCGTTCGACTACCTCAAAGACGTTGAGATGGAACAAGGGATCCGCCTCATGAGGTATGTCATGCCATCAAACGTGTTCGATGATCCCCAAAGCAATCCCGACAACCAGTGCTACTGTGATGTGGACAGCGGTACCTGCCCTCCGAGAGGAATCATAAACGTCACAGCCTGTTCTATGGGCGCTCCCCTGGTGGCTTCCTTTCCCCATTTCTACCGCGGAGACCCCAAACTGTATGAGGACATCCAAGGCCTGAGCCCCAACGGCGAGCTCCATGACTCCTTTATTGACATACACCCAACCCTTGGCATCGCACTGAACGGCAGATCAAGCCTCCAGCTGAACATCCAAGTCAAGAAGTCCAGCGTGTTCGGCGCTCTCAGTTTCCTGCCAGAAGGAATTATCCTGCCAATCGCGTGGATAGAAATGGCGCTCGAAGAACTACCAGAGAGTCTTCAATCGTTGGTCTACCACGGGACTTTTTCCACAGCGGCCGTGCAACTCGGCCTGGCCATGTTCTGTTCCGTAACCCTGCTCATTTCTTCCATATGCATGCTGATGATGATCATCACTAGAAGAAGGAAACCATGCGCCACCCTCAAAATTATACCAGCCGACATTGAACTGAAGACTTAA

Protein sequence:

>DPOGS203182-PA
MAVGKNKGLSKGGKKGVKKKIVDPFTRKDWYDVKAPSMFTKRLVGTTLVNRTQGTKIASEGLKGRVFEVSLADLQADTDAERSFRKFRLIAEDVQGRNVLCNFHGMDLTTDKLRWMVKKWQTLVEANIDVKTTDGYLLRVFCIGFTNKDSLSQRKTCYAQHTQVRAIRKKMCEIITRDVAGSELKEVVNKLIPDSIAKDIEKACLSIYPLRDVCIRKVKVLKRPRFEIAKLMELHGEGGGKRGEGGDKSDRPEGITSSDGTIFPPSLLNKKTRLFVFNSNMCRRLPFDYLKDVEMEQGIRLMRYVMPSNVFDDPQSNPDNQCYCDVDSGTCPPRGIINVTACSMGAPLVASFPHFYRGDPKLYEDIQGLSPNGELHDSFIDIHPTLGIALNGRSSLQLNIQVKKSSVFGALSFLPEGIILPIAWIEMALEELPESLQSLVYHGTFSTAAVQLGLAMFCSVTLLISSICMLMMIITRRRKPCATLKIIPADIELKT-