Monarch geneset OGS2.0

DPOGS209719
TranscriptDPOGS209719-TA2046 bp
ProteinDPOGS209719-PA681 aa
Genomic positionDPSCF300105 - 265731-267776
RNAseq coverage322x (Rank: top 35%)
Annotation
HeliconiusHMEL0113590.083.85% 
BombyxBGIBMGA008930-TA0.079.24% 
DrosophilaAats-arg-PA0.057.92% 
EBI UniRef50UniRef50_Q5ZM110.058.27%Arginine--tRNA ligase, cytoplasmic n=38 Tax=Bilateria RepID=SYRC_CHICK
NCBI RefSeqXP_975392.10.065.00%PREDICTED: similar to arginyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastpgi|3838622190.061.97%PREDICTED: arginine--tRNA ligase, cytoplasmic-like [Megachile rotundata]
NCBI nr blastxgi|3838622190.062.37%PREDICTED: arginine--tRNA ligase, cytoplasmic-like [Megachile rotundata]
Group
Gene OntologyGO:00048141.1e-296arginine-tRNA ligase activity
GO:00055241.1e-296ATP binding
GO:00064201.1e-296arginyl-tRNA aminoacylation
GO:00057371.1e-296cytoplasm
GO:00001661.9e-116nucleotide binding
GO:00064186.2e-34tRNA aminoacylation for protein translation
GO:00048126.2e-34aminoacyl-tRNA ligase activity
KEGG pathwaytca:6642920.0 
 K01887 (RARS, argS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[96-681] IPR0012781.1e-296Arginyl-tRNA synthetase, class Ia
[210-542] IPR0159451.9e-116Arginyl-tRNA synthetase, class Ia, core
[207-547] IPR0147291.7e-112Rossmann-like alpha/beta/alpha sandwich fold
[550-681] IPR0090806.2e-34Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
[556-681] IPR0089092.9e-28DALR anticodon binding
[99-187] IPR0051483.1e-19Arginyl tRNA synthetase, class Ia, N-terminal
Orthology groupMCL13849 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209719-TA
ATGACAGATATAAATCAAAAAGCACTTGAAGAAAGGACTGCCAAAGCCGAAAAGGAAGCAAAGGCAATTGAAGAGGAATTAACTAAGCTAGCAAAAGGAGATCTGAGTTATATTGGTGATGAACGTCTTGACAAACTAATGTCTGACAATGAAAAATATAAACATCGCTTAGCCATTCTTCAAAACGCTATTGAAGTTGAAAGATCAGTAGGGTCTTCTAAAAAGACGTCTAAAAAACGTCCTAATCTACTAAACGATATTAATTCTATCGAAGGTACTATCTGCATTTTAGATGAACTTAAAAATATATTTGAAATTGCAATTACATCTGCTTACCCAGAACTGGAGGATCCACCTGTTGTGTTAACATTATCGGGCAACAACCCTAAATTTGGAGACTATCAGTGTAATTCTGCTATGCCTATATCACAATTGCTTAAATCTAAAAATATAAAGACTAATCCTAGAGAAGTAGCAAACAATATTTTAAATAAGATGCCTGCATCTCCACTCATTGAGAAAACAGAAGTTGCCGGTGCTGGTTTTCTAAATATCTACTTGAATAGGGCATTTGCTGAACATGTCCTCAGCTGTATTTTACGACTTGGTGTAAAACCACCTCCTGTCAAACGAGAGAGGATAATTGTAGACTTTTCTTCACCAAATATTGCAAAAGAAATGCATGTAGGACATTTAAGGTCGACTATTATAGGAGATAGTATTTGTCGGGCCTTGGAGTTTCTGGACCATGATGTTCTCCGCTTAAACCATCTTGGTGACTGGGGAACTCAGTTTGGAATGCTCATTGCTCATTTACAGGACAAGTATCCAAATTTCAAAACACATTCTCCTCCAATATCAGATTTACAAGCATTTTACAAAGAGTCAAAGAAAAGATTTGATGAAGATGAAGAATTTAAAAAGAGGGCATATTCCTGTGTGGTTAAACTACAGTCCGGTGATCCTGATTACATATCTGCTTGGAAGTTGATATGCGAAGTCTCCAGGCAAGAGTTCCAAAAGATATACGATCGCTTGGACATTAAAATTGAAGACAGGGGGGAATCCTTCTACCAAAGTAGAATGGAAAAAATTGTTAAGGAATTAAAAGATGGTGGTTACTTAGAGGAAGATGAAGGACGTCTTATAATGTGGAGCGACCCCAACAACCATGATGGAATACCACTTACAATTGTGAAGTCTGACGGTGGTTATACCTATGACACTTCTGATATGGCTACCATAAGAAATCGCGTGGAGGAAGAGAAAGGTGACCGATTTATCTATGTGACTGATGTTGGACAATACACTCACTTCGTGCTCATCGAGGCATGCGCCAGAAGGTTCGGTATCCTCAAGGACGGTAAGAAAATAGAACATGTCGGATTTGGTGTTGTTCTTGGCGAAGACAAGAAGAAATTTAAAACACGATCTGGCGATACTATAAAGTTGATACAATTATTAGATGAAGGTCTTAAAAGAGCTCTCGACAAACTGGTAGAAAAAGGACGCGACAAGGTTCTTACGCCTGAGGAATTAAAGCAAGCCCAGGAAGCGGTGGCGTATGGGTGCATAAAATACGCTGATCTGTCTCACAACAGAATCAATGATTACATATTTTCGTTTGATAAAATGTTAGATGATAAAGGCAACACCGCGGTATACTTATTGTATGCTTTAACTCGCATCAGATCTATTGCGAGGACGGCACAGATATCCACAGATAAACTGTTGGAGGAGGTTCACAAGTCTGGGTTTAAACTGGTACATGATGCCGAGTGGAAACTTGGCAAAGTCCTTCTCAGATTCCCTGAGGTTATTCTAAAAGTTGCTAATGATCTTTATTTGCATAGTCTGTGTGAATATTTGTACGAAATCAGTTCCGCGTTCACAGACTTTTATGACAAGTGTTACTGTGTTGAGAAAGACAAAAGCGGTAACGTTGTTAAAATATTATATGAAAGATTGATGCTTTGCGAAGTTACTGCCAGAGTTATGGAACGATGTCTCGACATCCTGGGCATCAAGACGGTCTCTAAAATGTAA

Protein sequence:

>DPOGS209719-PA
MTDINQKALEERTAKAEKEAKAIEEELTKLAKGDLSYIGDERLDKLMSDNEKYKHRLAILQNAIEVERSVGSSKKTSKKRPNLLNDINSIEGTICILDELKNIFEIAITSAYPELEDPPVVLTLSGNNPKFGDYQCNSAMPISQLLKSKNIKTNPREVANNILNKMPASPLIEKTEVAGAGFLNIYLNRAFAEHVLSCILRLGVKPPPVKRERIIVDFSSPNIAKEMHVGHLRSTIIGDSICRALEFLDHDVLRLNHLGDWGTQFGMLIAHLQDKYPNFKTHSPPISDLQAFYKESKKRFDEDEEFKKRAYSCVVKLQSGDPDYISAWKLICEVSRQEFQKIYDRLDIKIEDRGESFYQSRMEKIVKELKDGGYLEEDEGRLIMWSDPNNHDGIPLTIVKSDGGYTYDTSDMATIRNRVEEEKGDRFIYVTDVGQYTHFVLIEACARRFGILKDGKKIEHVGFGVVLGEDKKKFKTRSGDTIKLIQLLDEGLKRALDKLVEKGRDKVLTPEELKQAQEAVAYGCIKYADLSHNRINDYIFSFDKMLDDKGNTAVYLLYALTRIRSIARTAQISTDKLLEEVHKSGFKLVHDAEWKLGKVLLRFPEVILKVANDLYLHSLCEYLYEISSAFTDFYDKCYCVEKDKSGNVVKILYERLMLCEVTARVMERCLDILGIKTVSKM-