Monarch geneset OGS2.0

DPOGS214597
TranscriptDPOGS214597-TA1203 bp
ProteinDPOGS214597-PA400 aa
Genomic positionDPSCF300050 - 271207-274003
RNAseq coverage398x (Rank: top 30%)
Annotation
HeliconiusHMEL0069730.091.50% 
BombyxBGIBMGA005126-TA3e-15987.25% 
DrosophilaAats-trp-PB5e-17266.92% 
EBI UniRef50UniRef50_P172482e-15665.72%Tryptophan--tRNA ligase, cytoplasmic n=166 Tax=root RepID=SYWC_BOVIN
NCBI RefSeqXP_002431147.14e-17470.15%Tryptophanyl-tRNA synthetase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420214297e-17370.15%Tryptophanyl-tRNA synthetase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|910939792e-17169.15%PREDICTED: similar to Tryptophanyl-tRNA synthetase, cytoplasmic (Tryptophan--tRNA ligase) (TrpRS) (Interferon-induced protein 53) (IFP53) (hWRS) [Tribolium castaneum]
Group
Gene OntologyGO:00048302.1e-235tryptophan-tRNA ligase activity
GO:00055242.1e-235ATP binding
GO:00064362.1e-235tryptophanyl-tRNA aminoacylation
GO:00001662.1e-235nucleotide binding
GO:00057372.1e-235cytoplasm
GO:00064181.3e-25tRNA aminoacylation for protein translation
GO:00048121.3e-25aminoacyl-tRNA ligase activity
KEGG pathwayphu:Phum_PHUM5134201e-173 
 K01867 (WARS, trpS)maps-> Aminoacyl-tRNA biosynthesis
    Tryptophan metabolism
InterPro domain[13-400] IPR0023062.1e-235Tryptophanyl-tRNA synthetase
[32-286] IPR0147293.6e-90Rossmann-like alpha/beta/alpha sandwich fold
[87-370] IPR0023051.3e-25Aminoacyl-tRNA synthetase, class Ic
Orthology groupMCL13009 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214597-TA
ATGACTGAGGAGCAAATAAATAATCTATCTATACATGAAGATGATGATGTGGTGGATCCCTGGAATGTGGCTGGAAAGTCTGAAACGGGGATAGATTATGACAAGCTTATCAAAAGATTTGGTAGTCAGAAAATAGATGAAGAGGTTATAGCAAGATTCGAGAAGGTTACTGGCAAGAAAGCACATCACTTCCTAAGACGAGGCATATTCTTCTCTCACAGGGATATCCACAGTATTTTGAACCTGGTGGAATCCGGAAAGAAGTTATATTTATACACAGGAAGAGGTCCATCTTCAGATAGTATGCATATTGGTCATATGATACCATTTATGTTCACAAAGTGGTTGCAAGAAGTGTTTGATGTTCCACTGATAATACAACTCACTGACGACGAGAAGGTTATGTGGAGGGATATAAAGGTAGAAGATGCAAGACAGATGGCTTACAGCAACGCCAAGGATATTATTGCTGTGGGATTTGATCCTTCCAATACATTTATATTCAATGATTTAGATTTTATTGGACAATGTCCAGCATTCTATCAAAATATGTTGCGGATACAGAGGTGTGTGACATTCAATCAGGTGAAGGGGATCTTTGGGTTCGGTGACTCGGATGTTATCGGGAAAATTACCTTTCCGTCTATAGAGGCTGCACCTGCCTTTTCCACTACATTCCCATTCATTTTTGATAATAAAGTGGTACCTTGTCTTGTTCCTTGCGCCATCGACCAGGACCCGTACTTCCGTCTGACCCGTGATGTAGCTCCACGTCTTCGTCTCCCAAAGCCAGCGCTTCTGCATGCTACGTTCCTACCAGCCTTACAGGGAGCACAACACAAAATGTCTGCAAGTGATCCAAACGCGTCCATCTTCCTTAATGACACTCCGAAGCAGATTAAGAATAAGATCAACAAGTATGCGTTCTCCGGCGGGCGGGCGACGGTTGAAGAGCACAGAGAAAAGGGCGGGAACACTGACGTTGACATTTCATACAAATACCTCACGTTTTTCCTAGAAGATGACGACAGACTAGCTGAGATAAAGAAGCAGTACGAATCTGGCGAAATGTTGACCGGTGAACTGAAGAAGATAGCCATCGAGACCATAACCCCCATCATCACGGACTACCAGCAGAGGAGGGTCAAGGTCACCGATGACGTCATGAACGAGTTCTTCGCTATCAGGAAACTCAACTTCTAG

Protein sequence:

>DPOGS214597-PA
MTEEQINNLSIHEDDDVVDPWNVAGKSETGIDYDKLIKRFGSQKIDEEVIARFEKVTGKKAHHFLRRGIFFSHRDIHSILNLVESGKKLYLYTGRGPSSDSMHIGHMIPFMFTKWLQEVFDVPLIIQLTDDEKVMWRDIKVEDARQMAYSNAKDIIAVGFDPSNTFIFNDLDFIGQCPAFYQNMLRIQRCVTFNQVKGIFGFGDSDVIGKITFPSIEAAPAFSTTFPFIFDNKVVPCLVPCAIDQDPYFRLTRDVAPRLRLPKPALLHATFLPALQGAQHKMSASDPNASIFLNDTPKQIKNKINKYAFSGGRATVEEHREKGGNTDVDISYKYLTFFLEDDDRLAEIKKQYESGEMLTGELKKIAIETITPIITDYQQRRVKVTDDVMNEFFAIRKLNF-