Monarch geneset OGS2.0

DPOGS212089
TranscriptDPOGS212089-TA1131 bp
ProteinDPOGS212089-PA376 aa
Genomic positionDPSCF300038 - 1013504-1015904
RNAseq coverage650x (Rank: top 20%)
Annotation
HeliconiusHMEL0125583e-17674.20% 
BombyxBGIBMGA006722-TA9e-13060.71% 
DrosophilamRpS22-PA4e-5633.61% 
EBI UniRef50UniRef50_D6WJ699e-6938.54%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJ69_TRICA
NCBI RefSeqXP_975160.12e-6938.54%PREDICTED: similar to AGAP007519-PA [Tribolium castaneum]
NCBI nr blastpgi|910839753e-6838.54%PREDICTED: similar to AGAP007519-PA [Tribolium castaneum]
NCBI nr blastxgi|1700413641e-6739.71%mitochondrial 28S ribosomal protein S22 [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[45-290] IPR0193741.2e-83Ribosomal protein S22, mitochondrial
Orthology groupMCL13665 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212089-TA
ATGTCCTTATTAAAATTGTGTCAGAGTAGTACCAAAACCATAGTCTTTAAATGTACAGAATCACAGCGCCTGTATATTTCTGCCAGAAAATTGAGTATCGTACCTTCAATTTATGATGGCGAAAATCCTGCGCCCAAATTTTTCTCCTCGGGTGTCCAAATAATTCTTAAGAGATTAACAAGACCTAATTTCGAAAAAGTATTTCGGAAAAGAACAAATAGCAACATATCTGTATTAAGGACTCCCGTGTATAAATTTCTAACAGACGAAGAACTAAAAATTGAACAGTCCAATGCATACAAAAGTGCTGAAAGGCTATTACAAATGCCACCAGTTGTAAAGGTACAAGAACCCGTTGATGACATTTTATCGAAAGATCCAGCGTTAGTCGGATATGATTCATCAACTTATTTGTTCACCGATATAACTTATGGTGTTGCAAATGAGCACAGGTTAATAATACAAAGAGAACCGGATGGAACTCTAAGAAGCTGTGACCATGATGTTAGAAAAAAACTGAATCAAATATATTTTCCCATGCAGGGACGTAAATTGAGGGATCCCTTAGTATTTTCTGATCCTGAAAAGTTCAACAGCTTATTGGAGAGACAGAAATATGAATATATATTAGACCGAGCCTGTGTCCAGTATGAGCCGGATGATCCGCAGTACCAGAAATTAACGAGCATCACTTATCAGCATGTAGATATGAATAATCAGTATAATTTAATTCGATCGACTCGTCATTTCGGTCCTCTAGCATTTTACTTAACCTGGCATGATAGTATTGACAATCTTATGCTAGAAATGGTACAGACTGGTATTATTCGTGAAGCTGTCCTACTAATGTCACTTCGGCAAGCTATCAAAGAAGATCTGGTCAATGGTGAAGAGAGTACTGCTTTGGTGTCAGAGATATTACCAACTCGTATACAACTGAGAAAACCTGATTATGTCTCCGAGGATGACATCCAACTAGATACTAAATGTATGGAATGTCTTGACAAATACATAAATAACAACTCGGCTATGAAGAGTCAACAAGGGCTGGCTTTACAGGGCTTCAGGGAATATTACCAACAGTTGATAGAAATCAGTAGGGGATTGCAGAAGGCCCATGGAAGTGCTTAA

Protein sequence:

>DPOGS212089-PA
MSLLKLCQSSTKTIVFKCTESQRLYISARKLSIVPSIYDGENPAPKFFSSGVQIILKRLTRPNFEKVFRKRTNSNISVLRTPVYKFLTDEELKIEQSNAYKSAERLLQMPPVVKVQEPVDDILSKDPALVGYDSSTYLFTDITYGVANEHRLIIQREPDGTLRSCDHDVRKKLNQIYFPMQGRKLRDPLVFSDPEKFNSLLERQKYEYILDRACVQYEPDDPQYQKLTSITYQHVDMNNQYNLIRSTRHFGPLAFYLTWHDSIDNLMLEMVQTGIIREAVLLMSLRQAIKEDLVNGEESTALVSEILPTRIQLRKPDYVSEDDIQLDTKCMECLDKYINNNSAMKSQQGLALQGFREYYQQLIEISRGLQKAHGSA-