Monarch geneset OGS2.0

DPOGS214693
TranscriptDPOGS214693-TA1215 bp
ProteinDPOGS214693-PA404 aa
Genomic positionDPSCF300022 - 1534180-1536493
RNAseq coverage291x (Rank: top 38%)
Annotation
Heliconius% 
BombyxBGIBMGA014443-TA0.074.24% 
DrosophilaCG16912-PA2e-14055.61% 
EBI UniRef50UniRef50_Q9W1072e-13855.61%Probable tyrosine--tRNA ligase, mitochondrial n=13 Tax=Diptera RepID=SYYM_DROME
NCBI RefSeqXP_968995.13e-14456.24%PREDICTED: similar to tyrosyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastpgi|910923986e-14356.24%PREDICTED: similar to tyrosyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastxgi|910923983e-13856.24%PREDICTED: similar to tyrosyl-tRNA synthetase [Tribolium castaneum]
Group
Gene OntologyGO:00048318.7e-172tyrosine-tRNA ligase activity
GO:00055245.8e-91ATP binding
GO:00064375.8e-91tyrosyl-tRNA aminoacylation
GO:00057375.8e-91cytoplasm
GO:00001665.8e-91nucleotide binding
GO:00064185.7e-69tRNA aminoacylation for protein translation
GO:00048125.7e-69aminoacyl-tRNA ligase activity
GO:00037232.9e-16RNA binding
KEGG pathwaytca:6574429e-144 
 K01866 (YARS, tyrS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[3-405] IPR0240888.7e-172Tyrosyl-tRNA synthetase, bacterial-type
[12-402] IPR0023075.8e-91Tyrosyl-tRNA synthetase
[12-234] IPR0147298.9e-82Rossmann-like alpha/beta/alpha sandwich fold
[24-305] IPR0023055.7e-69Aminoacyl-tRNA synthetase, class Ic
[311-403] IPR0029422.9e-16RNA-binding S4
Orthology groupMCL15256 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214693-TA
ATGTATCAGGACATGTTCCCAAAAACAGCAGCTAATGAAATATTAGACATATTAAATGGGTCTCCACAATGTGTGTATGCGGGATTTGATCCAACAGCTAGAAGTCTGCATGTGGGAAATCTTTTAGTTATTATCAATCTCCTACACTGGCAGCGCGGTGGACACAATGTGATAGCCTTGGTAGGAGGAGCCACAGGGTTTATAGGAGATCCAAGTGGTAGAAGTTCAGAGCGAACAGCATTACAGGAAGAGATTCTTAGGAAAAATCTAGCTGGTATTAAGAATAATCTAGAAACGGTGTTTGAAAATCATAAAAAGTACATTTGGACCGAAGATGAGAATAAGTTGAAACCTCCAATAATACTTAATAATGAAGAGTGGTACAGAAATATAGATTCAATTAGGTTTGTCAGTGAGATTGGAAGACATTTTAGGATGGGTACAATGTTGTTAAAGCAATCTGTACAAAACAGAATAAACTCTGATATAGGTATGAGCTTCACGGAGTTCTCGTATCAAATATTCCAATCTTATGATTGGTTGCATCTATTAAACAAATATAATTGTAAATTTCAGATAGGTGGAAGTGATCAGATGGGTAACATAAGCGCCGGTCATGAACTCATCAGTAGAACAGCAAAGAAAAATGTTTATGGTTTAACGTTACCGTTAGTAACGACAGAGGAAGGTGACAAGTTTGGAAAGTCCGCGGGAAATGCAATTTGGTTGGATGCAACCATGACAAGCCCATATTCGATGTATCAATTTTTCATAAGGACGAAAGACAGTGACGTCGAAAAATTATTGAAGTTGCTCACATTTTACAGTTTAGGAGAAATAAAAGACATAATGTACAAACATACACAACACCCCGAGCAGAGATACCCGCAGCAGTGTCTGGCTGAACATAAAGACGTCAAGTCCTTGGTGTCTCTTACATCGACTGAGCTGCAGCAAGTGTTCGAAGGTGCTTCAACTGTACCCCTGCTGCTTTCTCCGGGTATAACAGTACTGGAACTTGGATTGAAAGCGAAATGTTTCGCTACAGAGGGCGACGCTATGAGGATCATCCAAGCCGGAGGTTTTTATATAAATCATCAGAGAATGAAGAAGATCGATGAAGTCATCACCGAGTCCGCCCACATACTGCCGAATCATACCTCGTTACTACGAGTCGGCAAACGTAATTACTACATCGTGAAGTGGCAAACATAA

Protein sequence:

>DPOGS214693-PA
MYQDMFPKTAANEILDILNGSPQCVYAGFDPTARSLHVGNLLVIINLLHWQRGGHNVIALVGGATGFIGDPSGRSSERTALQEEILRKNLAGIKNNLETVFENHKKYIWTEDENKLKPPIILNNEEWYRNIDSIRFVSEIGRHFRMGTMLLKQSVQNRINSDIGMSFTEFSYQIFQSYDWLHLLNKYNCKFQIGGSDQMGNISAGHELISRTAKKNVYGLTLPLVTTEEGDKFGKSAGNAIWLDATMTSPYSMYQFFIRTKDSDVEKLLKLLTFYSLGEIKDIMYKHTQHPEQRYPQQCLAEHKDVKSLVSLTSTELQQVFEGASTVPLLLSPGITVLELGLKAKCFATEGDAMRIIQAGGFYINHQRMKKIDEVITESAHILPNHTSLLRVGKRNYYIVKWQT-