Monarch geneset OGS2.0

DPOGS203115
TranscriptDPOGS203115-TA1575 bp
ProteinDPOGS203115-PA524 aa
Genomic positionDPSCF300094 - 157979-159553
RNAseq coverage645x (Rank: top 20%)
Annotation
HeliconiusHMEL0160440.089.33% 
BombyxBGIBMGA001525-TA1e-9183.16% 
DrosophilaAats-tyr-PA0.068.71% 
EBI UniRef50UniRef50_P545770.068.00%Tyrosine--tRNA ligase, cytoplasmic n=103 Tax=Opisthokonta RepID=SYYC_HUMAN
NCBI RefSeqXP_002432437.10.070.77%tyrosyl-tRNA synthetase, cytoplasmic, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420240380.070.77%tyrosyl-tRNA synthetase, cytoplasmic, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420240380.070.77%tyrosyl-tRNA synthetase, cytoplasmic, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00048311.7e-239tyrosine-tRNA ligase activity
GO:00055245.7e-71ATP binding
GO:00057375.7e-71cytoplasm
GO:00001665.7e-71nucleotide binding
GO:00064185.7e-71tRNA aminoacylation for protein translation
GO:00048125.7e-71aminoacyl-tRNA ligase activity
GO:00064377.3e-53tyrosyl-tRNA aminoacylation
GO:00000498.2e-34tRNA binding
KEGG pathwayphu:Phum_PHUM5840300.0 
 K01866 (YARS, tyrS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[5-507] IPR0236171.7e-239Tyrosyl-tRNA synthetase, archaeal-type
[4-203] IPR0147296e-72Rossmann-like alpha/beta/alpha sandwich fold
[29-321] IPR0023055.7e-71Aminoacyl-tRNA synthetase, class Ic
[8-259] IPR0023077.3e-53Tyrosyl-tRNA synthetase
[361-517] IPR0123407.9e-49Nucleic acid-binding, OB-fold
[355-524] IPR0160271.6e-47Nucleic acid-binding, OB-fold-like
[365-460] IPR0025478.2e-34tRNA-binding domain
Orthology groupMCL13306 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203115-TA
ATGGATAATTGGCAAGAAAAGAAAACGCTCATAACCCGTAACTTACAAGAAGTGTTGGGTGATGAAAGACTCACAGAAATTTTGAAACAACGGGACTTAAAAATATATTGGGGCACCGCGACAACTGGAAGACCTCATGTGGCATATTTCGTGCCAATGTCTAAGATTGCTGATTTTCTAAATGCAGGGTGTGAGGTAACAATTTTATTTGCTGACTTACATGCCTATTTAGATAACATGAAAGCACCATGGGAGTTATTATCCTTGCGAACACAATACTATGAAGCAGCAATCAAAGCTATGTTAACTTCCATAAATGTTCCCCTAGACAAACTAAAATTTGTTAGAGGTACAGAATATCAGCTAAGCAAAGAATACACATTGGATGTATACCGCATGTCATCTGTTATAACAGATCATGATGCAAGAAAAGCTGGTGCAGAGGTTGTAAAACAAGTTGAGCATCCATTACTAAGTGGTCTCTTGTATCCTGGCTTGCAAGCATTGGATGAAGAGTATTTAAAGGTTGATGCACAATTTGGTGGTGTTGACCAGAGAAAAATATTCACAATGTCTGAGAAGTATCTACCTCAGCTGGGCTATGCCAAAAGGGTGCACCTTATGAATCCAATGGTACCAGGTCTGACAGGTGGTAAAATGTCTGCTTCAGAAGAGGACAGTAAGATAGACTTATTGGACAATCCAGCAAATGTTAAGAAAAAGCTTAAGAAAGCTTTCTGTGAACCAGGCAACATTACTGAAAATGGTGTCCTGTCATTTACTAAACATGTTGTGTTTCCGTTAATGAAAGCTGGTGAAACATTCAAAATTTATAGAGCTGAAGAGCATGGTGGCAATGCCGAGTATGATAATTTTGAAGCTTTGGAAGCAGCTTTTGCAAAACAAGATATCCATCCAGGTGACTTAAAAGCATCTGTAGAACAAGCTATAAACAAGTTGCTTGCACCTGTTCAAGAAATATTTAAGGATCCTAAACTTCAGGAACTCACTAAGAAAGCTTATCCACCTCCAGTAAAAGTAAAGGGAAATGTTAATTCAAACTCTGATGAAATAAGCCCTGTCAAATTAGATATCCGAGTTGGTCGTATAGTTGATGTATCAAGACATCCCGATGCTGACGCTCTCTATGTTGAGAAGATTGACCTTGGTGAAGATGAGCCTAGAACAATTGTATCTGGTCTTGTCAACTTTGTACCAATAGAGGAAATGCAGAACAGAGATGTAGTTGTTCTATGCAATTTAAAACCAGCTAAGATGCGTGGTGTCGAATCCAAGGGTATGGTTCTCTGTGCATCTATTGACGAACCAAAACAAGTAGAACCTCTTGTAGTACCTAAAGACAGCAAACCTGGCGATAGAATTGTTATTGAAGGCTATGAAACAGGGGAGCCAGATGATGTATTGAATCCCAAGAAAAAAGTTTGGGAGAAACTACAGGTTGATCTAAAGACTAATGATGATTTGTTTGCTGTGTGGCAAGGTAATAAACTTATCAGTAAAGTGAATGGCAATCCAGTAACATCCGGTTCCATGAAGAACGCACCCATCAAGTAA

Protein sequence:

>DPOGS203115-PA
MDNWQEKKTLITRNLQEVLGDERLTEILKQRDLKIYWGTATTGRPHVAYFVPMSKIADFLNAGCEVTILFADLHAYLDNMKAPWELLSLRTQYYEAAIKAMLTSINVPLDKLKFVRGTEYQLSKEYTLDVYRMSSVITDHDARKAGAEVVKQVEHPLLSGLLYPGLQALDEEYLKVDAQFGGVDQRKIFTMSEKYLPQLGYAKRVHLMNPMVPGLTGGKMSASEEDSKIDLLDNPANVKKKLKKAFCEPGNITENGVLSFTKHVVFPLMKAGETFKIYRAEEHGGNAEYDNFEALEAAFAKQDIHPGDLKASVEQAINKLLAPVQEIFKDPKLQELTKKAYPPPVKVKGNVNSNSDEISPVKLDIRVGRIVDVSRHPDADALYVEKIDLGEDEPRTIVSGLVNFVPIEEMQNRDVVVLCNLKPAKMRGVESKGMVLCASIDEPKQVEPLVVPKDSKPGDRIVIEGYETGEPDDVLNPKKKVWEKLQVDLKTNDDLFAVWQGNKLISKVNGNPVTSGSMKNAPIK-