Monarch geneset OGS2.0

DPOGS208341
TranscriptDPOGS208341-TA1014 bp
ProteinDPOGS208341-PA337 aa
Genomic positionDPSCF300383 + 73425-74438
RNAseq coverage163x (Rank: top 51%)
Annotation
HeliconiusHMEL0139857e-17785.03% 
BombyxBGIBMGA004078-TA3e-17283.58% 
DrosophilamRpS9-PA4e-13364.69% 
EBI UniRef50UniRef50_G6CLF80.0100.00%Mitochondrial ribosomal protein S9 n=2 Tax=Arthropoda RepID=G6CLF8_DANPL
NCBI RefSeqXP_002019568.13e-13465.58%GL12144 [Drosophila persimilis]
NCBI nr blastpgi|2897400593e-13968.25%mitochondrial ribosomal protein S9 [Glossina morsitans morsitans]
NCBI nr blastxgi|2897400591e-13568.25%mitochondrial ribosomal protein S9 [Glossina morsitans morsitans]
Group
Gene OntologyGO:00058406.7e-113ribosome
GO:00064126.7e-113translation
GO:00056226.7e-113intracellular
GO:00037356.7e-113structural constituent of ribosome
KEGG pathwaymxa:MXAN_19944e-19 
 K02996 (RP-S9, rpsI)maps-> Ribosome
InterPro domain[1-337] IPR0007546.7e-113Ribosomal protein S9
[218-337] IPR0147213.2e-32Ribosomal protein S5 domain 2-type fold, subgroup
[210-337] IPR0205688.4e-31Ribosomal protein S5 domain 2-type fold
Orthology groupMCL15766 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208341-TA
ATGAAGGCTTATTTGGAACGAGCTCGAGAGCATGACGAGTTTATAAAAAAACAACAATTCGAATATAACATAGGTAAACGTCACCTCGCTAACATGATGGGCGAGGATCCAGAAACATTCACTCAAAAAGATGTGGATCGCGCCATTGAATATTTATTTCCCAGTGGTATTTACGATCCAGCTGCCAGACCACTAATGAAGCCACCAGAAGAAGTATTCCCAACTCGAAAGGCCGCAGAGTTCGATGAAGCCGGACGTCCGCATCATTTTCTATTTTACACAGGAAAACCAAATTTTTTCAAATTATTATATGATGCAACTGAAACCTTACAGAGTTTGTATAAGTTTGAGGACAAGGTGATAAGAAAGAAAGAAATTCCTGATCCAAATGCAAAACTTGAATTATTAGCAAGTGTTTGGATTACAAAAGATGAATTAGAACAGTCTTTGGTAGAGAAGTTAACTGACTTAGAATATGATAACTTTAAATTAGTTATGGAAAGAATAGTTTCATCACCATACTCTTATAGGTGCAAAGATTTCATTGAAAAGTACAGGAAACCCCTTGCATCTCAAAGATTTGCATTAGAGATTCCAAAACCGAGTTATGATCAAGATGGCCGTGCTTTTATTACAACATATGAATGTTTACGTAAAAAAGCTAGAGGTGATGTAACAATCAGATCACCTGGAACAGGCAAGATAACAATAAATGGCAAAGATTTGACCTATTTCCATGATGTACAGTCAAGAGAGCAAGTCATTTTCCCTTTAATATTTAGTGAAATGTTAGGAAAGGTAGATGTGGAATGTAATATTGAAGGAGGTGGACCTTCAGGGCAGTCTGGAGCTATAAGGTGGGGTATAGCTTGGGGATTACGTAGCTTCGTTGATAAGAGTATGCTTGAAGCCATGCAAGTAGCCGGTCTACTAACAAGAGACCATAGAAGGCGTGAACGTAAGAAGCCAGGGCAACCAGGAGCAAGAAAGAAGCCCACTTGGAAAAAGAGATAG

Protein sequence:

>DPOGS208341-PA
MKAYLERAREHDEFIKKQQFEYNIGKRHLANMMGEDPETFTQKDVDRAIEYLFPSGIYDPAARPLMKPPEEVFPTRKAAEFDEAGRPHHFLFYTGKPNFFKLLYDATETLQSLYKFEDKVIRKKEIPDPNAKLELLASVWITKDELEQSLVEKLTDLEYDNFKLVMERIVSSPYSYRCKDFIEKYRKPLASQRFALEIPKPSYDQDGRAFITTYECLRKKARGDVTIRSPGTGKITINGKDLTYFHDVQSREQVIFPLIFSEMLGKVDVECNIEGGGPSGQSGAIRWGIAWGLRSFVDKSMLEAMQVAGLLTRDHRRRERKKPGQPGARKKPTWKKR-