Monarch geneset OGS2.0

DPOGS214363
TranscriptDPOGS214363-TA1485 bp
ProteinDPOGS214363-PA494 aa
Genomic positionDPSCF300020 + 709866-712799
RNAseq coverage669x (Rank: top 19%)
Annotation
HeliconiusHMEL0063590.088.33% 
BombyxBGIBMGA003977-TA0.088.75% 
DrosophilaCG17259-PA0.076.24% 
EBI UniRef50UniRef50_P495910.069.70%Serine--tRNA ligase, cytoplasmic n=58 Tax=Eumetazoa RepID=SYSC_HUMAN
NCBI RefSeqXP_001962529.10.077.69%GF15510 [Drosophila ananassae]
NCBI nr blastpgi|3286708850.089.17%seryl-tRNA synthetase [Helicoverpa armigera]
NCBI nr blastxgi|3286708850.089.00%seryl-tRNA synthetase [Helicoverpa armigera]
Group
Gene OntologyGO:00055241.2e-272ATP binding
GO:00048281.2e-272serine-tRNA ligase activity
GO:00001661.2e-272nucleotide binding
GO:00064341.2e-272seryl-tRNA aminoacylation
GO:00057371.2e-272cytoplasm
GO:00064183.2e-37tRNA aminoacylation for protein translation
GO:00048123.2e-37aminoacyl-tRNA ligase activity
KEGG pathwaydan:Dana_GF155100.0 
 K01875 (SARS, serS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[1-494] IPR0023171.2e-272Seryl-tRNA synthetase, class IIa
[204-385] IPR0023143.2e-37Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
[2-81] IPR0158661.7e-18Seryl-tRNA synthetase, class IIa, N-terminal
[2-146] IPR0109781.5e-12tRNA-binding arm
Orthology groupMCL13443 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214363-TA
ATGGTGTTAGATCTAGACCTTTTTCGTGCCGATAAAGATGGGGATCCTGAAAAAATTCGCGAAAACCAACGGAAACGTTTCAAAGACGTTGCTTTAGTGGACACTGTAGTGGAACAAGATACCTTATGGAGAAAATTACGCCACGATGCCGACAACCTAAACAAACTGAAAAACGTATGTAGTAAAGAGATTGGCCAAAAGATGAAGAGTAAAGAACCCGTTGGCGCTGAAGATGAGGCCGTACCTCAAGAAATCGCTGATAACCTTATTTATCTTACCGGTGAACAATTGCGACCTCTCACCGTCAATCAAATTAAAAAGGTTAGAGTAATGATTGATGAAGCAATTATGAAAAATGAGCAAGCATTGGTAGCAGCAGAAAAGACTCGATCTGCTGCACTTAGAGAAGTAGGCAACCATTTACATGAGTCTGTGCCTGTTGATGATGACGAGGATCATAATGCTGTGGAAAGGACGTTTGGAGACTGTAGCTTTAGACAAAAATATTCTCATGTAGATCTTATATGCATGATCGATGGAATGGATGGAGAAAGAGGTGCAGCTGTGTCCGGGGGCAGGGGATATTACCTGAAGGGTCCAGCTGTATTCCTTGAACAGGCTCTAGTGCAACTTTCTCTTAGAATGCTTTTAGAGAGAGGTTATACACCATTATATACACCATTTTTTATGAAAAAAGAGGTAATGCAAGAGGTTGCACAACTTGCACAATTTGATGAAGAGCTTTACAAGGTCATTGGCAAAGGTTCAGAAAACAAAGGTGATAGTGCTGTAGAGGAAAAATATCTCATAGCTACTTCAGAGCAGCCCATTGCTGCCTACCACAGAGATGAATGGCTTCCGGAAGCATCTTTACCTATTAAATATGCAGGTCTATCCACATGTTTCCGACAGGAAGTGGGTTCACATGGTCGAGACACCCGTGGTATCTTTAGAGTTCATCAATTTGAAAAGGTAGAGCAGTTTGTTCTTACCTCTCCCCATGATAATGCCTCATGGGTAATGATGGAGGAGATGATAAAAAACGCAGAGGACTTTTACCAGAGCCTTGGTATTCCATACCGAGTCGTCAATATAGTGTCGGGAGCTCTAAATCACGCAGCTTCTAAGAAGCTGGACTTGGAGGCATGGTTCCCTGGCTCTGGAGCATTTCGTGAACTTGTATCATGTAGCAACTGTCTGGAGTATCAAGCTAGACGTTTACTTGTTAGATACGGCCAAACAAAGAAAATGAATGCAGCAACTGAATATGTCCACATGCTTAATGCCACAATGTGCGCCACCACACGCGTCATCTGCGCTATCTTAGAGGTCAACCAGACAGAGGAGGGTGTTAAGGTGCCCGAGGCCCTCAAACTCTGGATGCCAGAACAGTATCAAGAATTAATTCCATTTGTTAAGCCCGCACCCATAGACCTAGAGGCTGCAGCTTCTGCCAAGAAAGGCAAGAAGAATGACAAGAAATAG

Protein sequence:

>DPOGS214363-PA
MVLDLDLFRADKDGDPEKIRENQRKRFKDVALVDTVVEQDTLWRKLRHDADNLNKLKNVCSKEIGQKMKSKEPVGAEDEAVPQEIADNLIYLTGEQLRPLTVNQIKKVRVMIDEAIMKNEQALVAAEKTRSAALREVGNHLHESVPVDDDEDHNAVERTFGDCSFRQKYSHVDLICMIDGMDGERGAAVSGGRGYYLKGPAVFLEQALVQLSLRMLLERGYTPLYTPFFMKKEVMQEVAQLAQFDEELYKVIGKGSENKGDSAVEEKYLIATSEQPIAAYHRDEWLPEASLPIKYAGLSTCFRQEVGSHGRDTRGIFRVHQFEKVEQFVLTSPHDNASWVMMEEMIKNAEDFYQSLGIPYRVVNIVSGALNHAASKKLDLEAWFPGSGAFRELVSCSNCLEYQARRLLVRYGQTKKMNAATEYVHMLNATMCATTRVICAILEVNQTEEGVKVPEALKLWMPEQYQELIPFVKPAPIDLEAAASAKKGKKNDKK-