Monarch geneset OGS2.0

DPOGS202350
TranscriptDPOGS202350-TA1470 bp
ProteinDPOGS202350-PA489 aa
Genomic positionDPSCF300104 - 562297-566919
RNAseq coverage371x (Rank: top 32%)
Annotation
HeliconiusHMEL0220327e-15484.77% 
BombyxBGIBMGA014337-TA5e-10179.05% 
DrosophilaCG2263-PA0.069.98% 
EBI UniRef50UniRef50_Q9Y2850.062.47%Phenylalanine--tRNA ligase alpha subunit n=120 Tax=Eukaryota RepID=SYFA_HUMAN
NCBI RefSeqNP_001182394.10.082.69%phenylalanyl-tRNA synthetase alpha subunit [Bombyx mori]
NCBI nr blastpgi|3065186620.082.69%phenylalanyl-tRNA synthetase alpha subunit [Bombyx mori]
NCBI nr blastxgi|3065186620.082.69%phenylalanyl-tRNA synthetase alpha subunit [Bombyx mori]
Group
Gene OntologyGO:00064321.1e-107phenylalanyl-tRNA aminoacylation
GO:00048261.1e-107phenylalanine-tRNA ligase activity
GO:00055241.1e-107ATP binding
GO:00001661.1e-107nucleotide binding
GO:00057371.1e-107cytoplasm
GO:00430391.7e-92tRNA aminoacylation
GO:00048121.7e-92aminoacyl-tRNA ligase activity
GO:00000491.7e-92tRNA binding
KEGG pathwayphu:Phum_PHUM3999400.0 
 K01889 (FARSA, pheS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[155-475] IPR0045291.1e-107Phenylalanyl-tRNA synthetase, class IIc, alpha subunit
[204-479] IPR0023191.7e-92Phenylalanyl-tRNA synthetase
Orthology groupMCL12584 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202350-TA
ATGGAATTAAACGAGAGGATTTTAAAGTATTTAGAGGAAAATGAAAAGGCTGATACGTTGAAGCTCGCCAATGAATTCAATGAAGATCATCAAAAAATAGTGGGCGCTGTGAAAAGTCTTGAAGCTTTGGACATGATAGTATCTGAACCGGTGAAAAGTACGAAATGGGAACTTACGGAAGAGGGGAAGTTAGTAGCTGAAAACGGAAGTCATGAAGCCGTCTTGTACCGGAGTATTCCTGAAAATGGAATGTCTCAAGCTGAAGTCATGAAGACCGTGCCAAATGCAAAAGTTGGATTCAGCAAAGCAATGTCATCTGGATGGATAGTTTTAGATAAATCTGGAGGAACGCCTCTTGTTAAGAAGAAGGTTGACTTGATAAAAGATACTGTACAAAACCATCTCAATGAGATTAAGAATGGCGTTGACAATATTCCAGATAAAGAGAGAAGTGATTACAAGAAAAGAAAATTACTTCAAGAGATCACTTTCAAGAGTTTTGTTCTATCAAAGGGACTTCAGTTTGCGACAACTATAAAGAAACTGGAGACCGATATAACAAGTGAAATGCTTATGACTGGGGCGTGGAAGGATTTGCAGTTCAAGCCCTACAACTTTGATGCTCTCGGTCAGCCGCCCGACTCGGGTCACCTCCACCCTCTACTGAAAGTCAGATCCGAATTCCGTGAAATATTCCTAGAAATGGGTTTCACAGAAATGCCAACAAATCGTTATGTGGAGAGCTCCTTCTGGAATTTCGATGCTCTGTTCCAACCCCAGCAACACCCGGCCAGAGACGCCCACGACACATTCTTCATTTCCTCACCAGCTGTTTCATCACAGTTTCCCATGGACTATTTGGAGAAAGTCAAAAAGGTTCACAGTGAAGGCGGCTATGGCTCTCAGGGTTACCGTTATACTTGGAAGCTGGAGGAAGCTCAGAAGAACCTCCTCCGGACTCACACGACGGCCGTCAGCGCTCGGACGTTGTACAAACGAGCAGACAGACACACTCCCATCAAGTGCTTCAGTATAGATAAGGTGTTCCGCAACGAGACCCTGGACGCGACCCACCTGGCGGAGTTCCACCAGGTGGAGGGAGTCATCGCTGACAAGGACCTGGGCCTGGCGGACCTCATCACAGTCCTGGACGACTTCTTCAAACGACTCGGCTTCGACAAACTGCAGTTCAAACCGGCATACAACCCGTACACCGAACCCAGTATGGAGATATTCGCTTACCATAATGGTCTAGAAAAGTGGATAGAAATTGGAAATTCTGGTGTGTTCCGACCGGAGATGCTTCTACCCATGGGCTTGCCTGAAGATGTCAATGTAATCGCATGGGGCTTGTCCCTCGAAAGACCTACTATGATCAAATATGGTTTGAATAACATCAGAGACCTCGTGGGACCCAAGGTGGACCTGCAAATGGTTCAGAGCAACCCTATATGCAGGCTCGACAAGTGA

Protein sequence:

>DPOGS202350-PA
MELNERILKYLEENEKADTLKLANEFNEDHQKIVGAVKSLEALDMIVSEPVKSTKWELTEEGKLVAENGSHEAVLYRSIPENGMSQAEVMKTVPNAKVGFSKAMSSGWIVLDKSGGTPLVKKKVDLIKDTVQNHLNEIKNGVDNIPDKERSDYKKRKLLQEITFKSFVLSKGLQFATTIKKLETDITSEMLMTGAWKDLQFKPYNFDALGQPPDSGHLHPLLKVRSEFREIFLEMGFTEMPTNRYVESSFWNFDALFQPQQHPARDAHDTFFISSPAVSSQFPMDYLEKVKKVHSEGGYGSQGYRYTWKLEEAQKNLLRTHTTAVSARTLYKRADRHTPIKCFSIDKVFRNETLDATHLAEFHQVEGVIADKDLGLADLITVLDDFFKRLGFDKLQFKPAYNPYTEPSMEIFAYHNGLEKWIEIGNSGVFRPEMLLPMGLPEDVNVIAWGLSLERPTMIKYGLNNIRDLVGPKVDLQMVQSNPICRLDK-