Monarch geneset OGS2.0

DPOGS205619
TranscriptDPOGS205619-TA1116 bp
ProteinDPOGS205619-PA371 aa
Genomic positionDPSCF300023 - 955766-957730
RNAseq coverage228x (Rank: top 44%)
Annotation
HeliconiusHMEL0073421e-14683.04% 
BombyxBGIBMGA001131-TA5e-16773.07% 
DrosophilaRrp45-PA5e-5937.58% 
EBI UniRef50UniRef50_D2A3643e-9051.19%Putative uncharacterized protein GLEAN_07574 n=1 Tax=Tribolium castaneum RepID=D2A364_TRICA
NCBI RefSeqXP_974147.16e-9151.19%PREDICTED: similar to AGAP002348-PA [Tribolium castaneum]
NCBI nr blastpgi|910806131e-8951.19%PREDICTED: similar to AGAP002348-PA [Tribolium castaneum]
NCBI nr blastxgi|910806131e-8844.62%PREDICTED: similar to AGAP002348-PA [Tribolium castaneum]
Group
KEGG pathwaytca:6629872e-90 
 K03678 (RRP45, EXOSC9)maps-> RNA degradation
InterPro domain[22-170] IPR0205683e-29Ribosomal protein S5 domain 2-type fold
[31-163] IPR0012472.2e-21Exoribonuclease, phosphorolytic domain 1
Orthology groupMCL15379 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205619-TA
ATGAGGGATCTTTATTTATCGAATTGTGAAAAGAATTTTATTCAAAAAATTATTTCGGAAGGGCATCGATTAGATGGCAGAATTTACAATGAGAGCCGAAAATTAGATATATCATATGGATCAGAATATGGCAGTTGTATTGTTTCTTTAGGAGAAACTAAGATATTAGCCCAAGTATCATGTGAAGTAGTACAGCCTAAGCAAATAAGGCCCAACGAAGGTATTCTATTTATTAATGTAGAAATAAGTCCTATGGCCGCACCACAGTTTGAGGCCAACAGACAGACAGATCTTACAGTTTATCTTAACAGACTTTTGGAGAAGTGCTATAAAGACTCTAAATGTATAGATTTAGAATCCTTATGCATTGTTGTAGAAGAGAAGGTGTGGTCATTAAGAGTAGATATAAAAGTTTTAAATCATGACGGAAACTTAATTGAATGTGCCAGCATCGCTACACTAGCATCATTGGCCCATTTTAAAAGACCCGATGTAACTCGTAGTGGAGATAGTATTATAATTCACACATTATCTGAAAAAGACCCCATACCTACCGTCTTATACCACTACCCTGTTTGTGTAACTTTTGCTATATTTAATAACAATATTCTAGTATCGGACCCTAGTTTTATAGAAGAATCAGTGTGCACAAGTAACTCTGAGGAAGGCAGTACAGGAGGATTGCTAGTCGTTGGAATGAATCAGTATAAGGAGTTGTGTGTATTGAATCTAAGTGGTGCTGCTATATATAATTCAAATGTTGTGCATAAAGCCATACTTAATGCGGCTGAGAAGTGTAAGAATATTGTCGAAGAGGTTAAGAGTAAAATTGTTACCGATGATAATTTTAGGCAAAAACGTGTTAAGATCAATTTCGCCGATATTATTACAACAGATCACATACAGACACTGTGTAAGAAGGATCTTAGTATTTGTCTTAAAAATTTTAAAATCAATGATGTGAAACAAGAGGACGATTCGGAAATGTCAGAGGATAATGTAACAGAAGACAGTAAATACGACGTTGTAGCAACAAAGCCAAATGTAGCTGAAATTAAGTCCAAGGAAGCTAAATCGTCGTGGATAGAAATATCATCAGAATCTGAAGAAGATTAA

Protein sequence:

>DPOGS205619-PA
MRDLYLSNCEKNFIQKIISEGHRLDGRIYNESRKLDISYGSEYGSCIVSLGETKILAQVSCEVVQPKQIRPNEGILFINVEISPMAAPQFEANRQTDLTVYLNRLLEKCYKDSKCIDLESLCIVVEEKVWSLRVDIKVLNHDGNLIECASIATLASLAHFKRPDVTRSGDSIIIHTLSEKDPIPTVLYHYPVCVTFAIFNNNILVSDPSFIEESVCTSNSEEGSTGGLLVVGMNQYKELCVLNLSGAAIYNSNVVHKAILNAAEKCKNIVEEVKSKIVTDDNFRQKRVKINFADIITTDHIQTLCKKDLSICLKNFKINDVKQEDDSEMSEDNVTEDSKYDVVATKPNVAEIKSKEAKSSWIEISSESEED-