Monarch geneset OGS2.0

DPOGS204482
TranscriptDPOGS204482-TA1839 bp
ProteinDPOGS204482-PA612 aa
Genomic positionDPSCF300002 + 1076813-1079788
RNAseq coverage8977x (Rank: top 2%)
Annotation
HeliconiusHMEL0156890.076.31% 
BombyxBGIBMGA007710-TA1e-12094.98% 
Drosophiladare-PA1e-10753.58% 
EBI UniRef50UniRef50_E2A1581e-11054.85%NADPH:adrenodoxin oxidoreductase, mitochondrial n=9 Tax=Coelomata RepID=E2A158_CAMFO
NCBI RefSeqNP_001037259.14e-11994.98%ribosomal protein S5 [Bombyx mori]
NCBI nr blastpgi|3151154057e-12297.25%ribosomal protein S5 [Euphydryas aurinia]
NCBI nr blastxgi|3151154052e-11697.25%ribosomal protein S5 [Euphydryas aurinia]
Group
Gene OntologyGO:00064122.2e-93translation
GO:00037352.2e-93structural constituent of ribosome
GO:00159352.2e-93small ribosomal subunit
GO:00054883.3e-07binding
KEGG pathway 
InterPro domain[424-612] IPR0057162.2e-93Ribosomal protein S7, eukaryotic/archaeal
[425-612] IPR0237981.6e-90Ribosomal protein S7 domain
[302-374] IPR0160403.3e-07NAD(P)-binding domain
Orthology groupMCL14931 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204482-TA
ATGCATATAACCAAGCATTTCAGACCAGTCAAGATTGACATTCTAGAAAAACTTCCTGTACCCTTCGGGCTTGTAAGGTATGGTGTGGCTCCTGATCATCCGGAAGTTAAAAATGTTATCAATCAATTTTCAAAATGTGCTCAACAAGATAATGTAAATTTTTATGGTAATATTACATTAGGAAAGGATATTAGTTTAAAACAAATAAGACAGCACTATGATGCTGTATTGCTGACTTATGGGGCTGAGGAAGACAGGATTCTAGGTATAGATAATGAAGATGCTCATAATGTTATAGCAGCCAGGAATTTTGTTGGATGGTATAATGGACACCCCAGGGATAGAAATTTAAAGGTTGATTTATCTCAACCAACAGCAGCTATTCTTGGTCAAGGCAATGTTGCTTTGGATGTGGCAAGAATACTTCTCTCACCTATTGATGAATTAAAGAAAACAGACATCACCGAATATGCCCTTCAAGCATTGGCTGATTCAAGAGTCAAGGAATTATATCTTGTTGGCAGAAGAGGACCCTTACAAGTAGCATTTACTATAAAGGAACTTCGTGAACAAATCAAGTTAAGAAATTGTTCAACTTTATGGAGAGAAAATGATTTTCAAGGTGTAGCTGATGCCGTTAGTCAATTAGAAAGACCAAGGAAGAGGTTGACCGAACTAATGCTCAAATCATTAGCTGAAAATAGCATAAAAGAAGGTTATGAAAAATGCTTTAAACCAATTTTCTTCAGAAGCCCAAAAAGGTTTTTAGTTGATGGTGATAAAAACCTTACTGGTATTGAATTAGTGTGTAATAAGCTTGTAGGGGATAGTATAGAAAATCAGAAGTGTGTACCAACAGAAAAATTAGAAATTTTAAAATGTAATCTTGCTTTTCGTAGCATAGGATATAAAAGTATTAAGGTTGATAATGATCTAACATTCAATTCATATGGGTATGTTCATAATTCTAAAGGTCGTATTGAAGATTTGGAGTGTAAAGGTTTGGCAAAAGTCTATGTCTCTGGTTGGCTAGGAACAGGCCCAGTCGGTGTTATTTTACACACAATGGGAAATGCATTTCAAGTTGCAAAAATGATTTGTGAAGATTTAAAACAGGATAAATTTGATATGGATAAAGGTAACACTTTGATTATGGCAACACCGATGTTCTACGTGTTAACAATGGCCGAGGAAAATTGGACCGAAGATGGCGAGGCAGGCAGCATGGCTGTCGATGCCATGCCCCCGCCACAACCGGCAGATATCCCAGAGATCAAACTCTTTGGAAGATGGAGTTGTTATGATGTCCAAGTTTCGGACATGTCCCTACAGGATTATATTTCTGTGAAAGAAAAGTACGCCAAGTACTTACCTCACTCAGCGGGCAGGTATGCACACAAACGGTTCCGCAAAGCTCAGTGCCCAATTGTTGAGCGTCTGACCAATTCCCTGATGATGCACGGCAGAAACAACGGCAAAAAACTTATGGCTGTTCGTATTGTCAAACACGCATTCGAGATCATCCACCTTCTGACAGGTGAAAATCCCCTTCAGGTTCTTGTGACAGCTATCATTAACTCGGGCCCTCGTGAAGATTCCACTAGGATCGGTCGTGCTGGTACGGTGCGTCGTCAGGCCGTGGATGTTTCTCCCCTACGTCGTGTGAACCAGGCTATTTGGCTGTTATGCACTGGCGCTCGTGAAGCTGCCTTCAGGAACATCAAAACGATTGCTGAATGTGTTGCTGATGAGCTCATCAATGCCGCTAAAGGCTCATCCAACTCATATGCCATCAAGAAGAAAGATGAGCTGGAACGTGTTGCTAAATCCAACCGTTAA

Protein sequence:

>DPOGS204482-PA
MHITKHFRPVKIDILEKLPVPFGLVRYGVAPDHPEVKNVINQFSKCAQQDNVNFYGNITLGKDISLKQIRQHYDAVLLTYGAEEDRILGIDNEDAHNVIAARNFVGWYNGHPRDRNLKVDLSQPTAAILGQGNVALDVARILLSPIDELKKTDITEYALQALADSRVKELYLVGRRGPLQVAFTIKELREQIKLRNCSTLWRENDFQGVADAVSQLERPRKRLTELMLKSLAENSIKEGYEKCFKPIFFRSPKRFLVDGDKNLTGIELVCNKLVGDSIENQKCVPTEKLEILKCNLAFRSIGYKSIKVDNDLTFNSYGYVHNSKGRIEDLECKGLAKVYVSGWLGTGPVGVILHTMGNAFQVAKMICEDLKQDKFDMDKGNTLIMATPMFYVLTMAEENWTEDGEAGSMAVDAMPPPQPADIPEIKLFGRWSCYDVQVSDMSLQDYISVKEKYAKYLPHSAGRYAHKRFRKAQCPIVERLTNSLMMHGRNNGKKLMAVRIVKHAFEIIHLLTGENPLQVLVTAIINSGPREDSTRIGRAGTVRRQAVDVSPLRRVNQAIWLLCTGAREAAFRNIKTIAECVADELINAAKGSSNSYAIKKKDELERVAKSNR-