Monarch geneset OGS2.0

DPOGS212020
TranscriptDPOGS212020-TA915 bp
ProteinDPOGS212020-PA304 aa
Genomic positionDPSCF300054 - 841004-843239
RNAseq coverage378x (Rank: top 32%)
Annotation
HeliconiusHMEL0136017e-11968.75% 
BombyxBGIBMGA010173-TA9e-13673.53% 
DrosophilamRpL19-PA1e-9264.88% 
EBI UniRef50UniRef50_E3WJ311e-9064.88%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WJ31_ANODA
NCBI RefSeqXP_322023.23e-9667.36%AGAP001139-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583966085e-9567.36%AGAP001139-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583966085e-9467.63%AGAP001139-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00058404.3e-120ribosome
GO:00064124.3e-120translation
GO:00056224.3e-120intracellular
GO:00037354.3e-120structural constituent of ribosome
KEGG pathwaypub:SAR11_02534e-08 
 K02884 (RP-L19, rplS)maps-> Ribosome
InterPro domain[58-293] IPR0018574.3e-120Ribosomal protein L19
[81-193] IPR0089912e-26Translation protein SH3-like
Orthology groupMCL13933 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212020-TA
ATGTCACTTGCTATGAGGAAGTCCCCTATAGAATATTTTAATGCGGCTAAATGGATCATCAGAAGAGACTGCCGTTACTTATCAAATTTACCAGAAGTAGCTGAAACAACTGAAGTGGTTGAAAATAAGCTTCCAAAAGGACGGAAACCAAGATTTCTCAATCCAACCTTGCAGTACAGACATGTCTATCCTGAATTTTTGCCAGATCCAAATCCGAAATTTCGTAATAGTCTCAGAGAGAAGCTGGAGAGAGCTGATATGCTTAAAAGACGGAGTCAGGTTGATATACCTGAGTTTTATGTAGGTACTATATTAGCTGTAACTATATCAGATCCTCATGCTCAAGGAAAAACAAACAAATTTGTTGGTATCTGTATTGAGCGTAAAGGTTGTGGACTCAGAGCTGAATTCACTCTGAGAAATGTTATTGATCATCAGGGAATTGAGGTCCGATATGATCTCTATGATCCCACAATACAAAACATTCAAGTACTAAGGCTGGAGAAGAGGTTAGATGATAAGTTGTTATACTTGCGTGATGCTCTACCTGAGTATTGCACCTTCCCTATTGATATGGATCCAGAAATACTCCCTGAGGGTAGTCCCGTGCCTGTTAATACTGTTCAGGTAAAATTGAAACCAAGGCCATGGCTTGAGAGATGGGAAAGACAAGAACTTAAAGGCGTTTCAAATATTGAGGAGCATTTGAAAGAAAAGGACAGAGTTAGAAGAGAGTTAAGGAAAACGCCTTGGGAGAAGTTTGATTTGATGAAAGATTACAGGAAAACCATCCCTATTGAAGACCAAGCGGAGATTTGGGGTGAAGTTTATAACCAACTTCAACAAATGCGTGTATCGCGTAAGAAAATGTCAAAACAACGAACATTTACTGCTCCCAAGACGCAATTGGGATAG

Protein sequence:

>DPOGS212020-PA
MSLAMRKSPIEYFNAAKWIIRRDCRYLSNLPEVAETTEVVENKLPKGRKPRFLNPTLQYRHVYPEFLPDPNPKFRNSLREKLERADMLKRRSQVDIPEFYVGTILAVTISDPHAQGKTNKFVGICIERKGCGLRAEFTLRNVIDHQGIEVRYDLYDPTIQNIQVLRLEKRLDDKLLYLRDALPEYCTFPIDMDPEILPEGSPVPVNTVQVKLKPRPWLERWERQELKGVSNIEEHLKEKDRVRRELRKTPWEKFDLMKDYRKTIPIEDQAEIWGEVYNQLQQMRVSRKKMSKQRTFTAPKTQLG-