Monarch geneset OGS2.0

DPOGS211168
TranscriptDPOGS211168-TA1863 bp
ProteinDPOGS211168-PA620 aa
Genomic positionDPSCF300007 + 300187-305975
RNAseq coverage340x (Rank: top 34%)
Annotation
HeliconiusHMEL0172230.081.92% 
BombyxBGIBMGA003154-TA0.078.71% 
DrosophilaCYLD-PE3e-16156.40% 
EBI UniRef50UniRef50_E0VY653e-16457.95%40S ribosomal protein S3a, putative n=9 Tax=Coelomata RepID=E0VY65_PEDHC
NCBI RefSeqXP_002431059.15e-16557.95%40S ribosomal protein S3a, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700077582e-16949.31%hypothetical protein TcasGA2_TC014455 [Tribolium castaneum]
NCBI nr blastxgi|2700077583e-16649.61%hypothetical protein TcasGA2_TC014455 [Tribolium castaneum]
Group
Gene OntologyGO:00058405.9e-122ribosome
GO:00064125.9e-122translation
GO:00056225.9e-122intracellular
GO:00037355.9e-122structural constituent of ribosome
KEGG pathwayphu:Phum_PHUM5095601e-164 
 K08601 (CYLD, USLP2)maps-> RIG-I-like receptor signaling pathway
InterPro domain[35-620] IPR0015935.9e-122Ribosomal protein S3Ae
[118-284] IPR0009382.2e-22Cytoskeleton-associated protein, Gly-rich domain
Orthology groupMCL14354 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211168-TA
ATGAGTCATGAGGTTCATCATGAGGAGAACGAAAATGCAAGCAGCAGTACTTCACAGCTTGAATTATTCAACCTCATTGGAGGGAGTTGGCCACCAGCTCCAAGGGAACCTGAGCGTTTAGAGTATGGGTCAGTGCCCCGTGGTGCGAGACGAACTATGCAGCACTCGGGACACAATGAAAGGATCAAGTCTGTAAACCTGATTAATCACCTGGCACCGGAAAAGAGAAAAATAAGACAAAATACGTCAAACAGATCTGGGATCAGTGAGGAGTCTGTACAAAGTAACAAAATGGTGAACGAAACTAAAAAGGTTGAGACTGGAGTCTTAATTGATGTCTGCGGTGGCGATGTCAACGAAGAGTTTTATGACTCTCCGTCGTTGGGATCCCCGCGTGCTGTTAATTTGGAGGCTTCGAGTCAGTCCCCGCCTGTTTTAGATGACATCGGAGTGGGTTCTCTGGTCGAAGTAGCGACTGACGTTGATCAAAACTACTACGGAGTTGTGAGATGGATCGGACTCATTGATGACATAGCGACAGCCGGCGTGGAGTTGGAGCAGAGCGTGTGTGGTCTCGGTGACGGCCTGCACCGCGGGTCACGTGTTTTCACGTGCGCCTCTGGACGGGCGCTGTTCGTGCCCCTTCCTCTGTGCAGGAGAGACGCACGGTTCACGGACACGCCGCCCCCGGAGACCCACCACACTGAGATCGGAGAACAGCCAGAGTGTCCCGTGGTGACCGGTGTAGTTCCCCCGCTGACTTCGCTGGGTGACCTGGCGGGGAAGAACCGCGGTATCCAGGGACATCACAATTCATGTTACTTGGACGCCACGTTATTCGCGATGTTCACTTTCACCAGCGTCTTTGATGCCTTGCTCTATAGGCCCCCTGAACCTGAGGATTCCCCTCACTACTCGGAGGTGCAGCGTGTTCTCCGTGAGGAGGTGGTGAACCCACTCCGTCGTCACGGGTACGTGAGGGCCGACCGCGTCATGAAGCTGCGTACACTGCTAGAGAGACTCTCGGATGTGCCAGGACTCACGTGCGAGGAGAAGGATCCTGAAGAATTCCTCAACGGACTGGTGGCGCAGCTGCTGAGGGCTGAGCCGTTCCTGAAATTGTCCTCCGGCCAGGAGGCGTTTTGCTACCAGCTGTTCGTGGAGAAGGATGAACACATCACATTACCCAGTGTTCAGCAGCTGTTAGAACAATCATTTGCCACATCTGGAGTGAAACTAAGCGAGGTTCCCGCTGCGTTTATAATACAAATGCCCAGATTCGGCAAACAATACAAGCTGTATCAAAGAGTGCAGCCATCTCCCTTGCTGGATGTCACTGATCTCATCGAAGGATTGCCTCGCCAGTGCACTGTATGTGGCGGTCTTGCTCGCTGGGAATGTTCTTCGTGTGCAGGGGGCGCTCTCGACGCGGGGGCCTTGTGCAACTCCTGCCTGAGGTTGGCACACGCCACCCGACCCACACATAAGGCCGTTCCTTTAACTATCAGTGAAGAATATGCGAATATTCTCGAGTCGTGTCCAGTGCCGAGGGTGTACATGGAGTTGTTTGCTGTGCTCTGTATAGAGACCAGTCACTACGTAGCCTTCGTCAAGACAGGAGTCGGCCATGACGCGCCCTGGTGTTTCTTCGACTCAATGGCCGACAGGAAGGGGGAGCGCAACGGTTACAACATCCCAGAGATCGTGTGTGTGAGTGAGCTGGGGTCGTGGCTGAGCGAGGAGGGCCGCGCGCTCGCTCGGGCTGCGCCCCTCGACCGCCACCTGCCCGCGCCGGCAAAACGACTCCTGTCAGACGCATACATGTGCTTCTACCGCAGCCCCGATGTCGCTATGTACAGATAA

Protein sequence:

>DPOGS211168-PA
MSHEVHHEENENASSSTSQLELFNLIGGSWPPAPREPERLEYGSVPRGARRTMQHSGHNERIKSVNLINHLAPEKRKIRQNTSNRSGISEESVQSNKMVNETKKVETGVLIDVCGGDVNEEFYDSPSLGSPRAVNLEASSQSPPVLDDIGVGSLVEVATDVDQNYYGVVRWIGLIDDIATAGVELEQSVCGLGDGLHRGSRVFTCASGRALFVPLPLCRRDARFTDTPPPETHHTEIGEQPECPVVTGVVPPLTSLGDLAGKNRGIQGHHNSCYLDATLFAMFTFTSVFDALLYRPPEPEDSPHYSEVQRVLREEVVNPLRRHGYVRADRVMKLRTLLERLSDVPGLTCEEKDPEEFLNGLVAQLLRAEPFLKLSSGQEAFCYQLFVEKDEHITLPSVQQLLEQSFATSGVKLSEVPAAFIIQMPRFGKQYKLYQRVQPSPLLDVTDLIEGLPRQCTVCGGLARWECSSCAGGALDAGALCNSCLRLAHATRPTHKAVPLTISEEYANILESCPVPRVYMELFAVLCIETSHYVAFVKTGVGHDAPWCFFDSMADRKGERNGYNIPEIVCVSELGSWLSEEGRALARAAPLDRHLPAPAKRLLSDAYMCFYRSPDVAMYR-