Monarch geneset OGS2.0

DPOGS212352
TranscriptDPOGS212352-TA1218 bp
ProteinDPOGS212352-PA405 aa
Genomic positionDPSCF300019 - 32652-34323
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0053027e-15280.54% 
BombyxBGIBMGA012008-TA4e-15576.74% 
DrosophilaCG7441-PA2e-9348.84% 
EBI UniRef50UniRef50_E2ARE73e-9547.21%Tryptophanyl-tRNA synthetase, mitochondrial n=20 Tax=Eumetazoa RepID=E2ARE7_CAMFO
NCBI RefSeqXP_001865561.14e-11153.09%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700598948e-11053.09%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700598941e-10552.63%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00048303.4e-130tryptophan-tRNA ligase activity
GO:00055243.4e-130ATP binding
GO:00064363.4e-130tryptophanyl-tRNA aminoacylation
GO:00001663.4e-130nucleotide binding
GO:00057373.4e-130cytoplasm
GO:00064182.8e-64tRNA aminoacylation for protein translation
GO:00048122.8e-64aminoacyl-tRNA ligase activity
KEGG pathwaycqu:CpipJ_CPIJ0152731e-110 
 K01867 (WARS, trpS)maps-> Aminoacyl-tRNA biosynthesis
    Tryptophan metabolism
InterPro domain[31-382] IPR0023063.4e-130Tryptophanyl-tRNA synthetase
[49-250] IPR0147299.1e-71Rossmann-like alpha/beta/alpha sandwich fold
[49-324] IPR0023052.8e-64Aminoacyl-tRNA synthetase, class Ic
Orthology groupMCL14089 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212352-TA
ATGGCGTTAACATCAGGAAGGCTAGCGTATTCAAAACACTTATGTAAAGTTAATATCCGGTGCATATCGTTACAGGGCTATAAAGCAAAGGAAACTAGCATCAAAAATTCACCTACAGCAGGAACGAGTGAGACTTGGAACCGAAGAGTTGTGACTGGGTTGCAACCGACAGGGGCACTGCATGTGGGCAATTACTTTGCGGCTGTACAGCGTTGCGTAAAGCTTCAGCAGCAGGGCGATGATCTTATGATATTTATCGCAGATCTCCATGCCCACACTACACAACAAAATCCAACACAAATTCAGAAAAATATATTGGAATTGACAGCATACTTGCTTGCTAGTGGGATTGACCCTGAACAAAGCATATTATTCGCACAGTCGGCTGTGCCGCGTCATGCCGAACTTTGCTGGCTTCTAGCCTGCCTCGCGACCCATGCACGTCTTGCACATCTGCCTCAATTTAAAGAGAAATCTGCTACTATGAAGGAGGTGCCAATAGGACTTCTTTTGTATCCAGTACTTCAGGCAGCAGACGTGCTGGTGTATCGCGGCACGCATGTGCCAGTGGGCACAGACCAACTGCAGCACCTGCAGGTTGCAGCTCAACTCGTGCGCACCTTCCACCACCGATATGGACGGCTTTTTCCTACGCCCTCCCCAATCCTACCAGATGATGGAAGTGATCGTCTACGTAGTCTTCGTGATCCCACAAAAAAGATGTCCAAATCCGACACGGACCCAAAATCAAGGATACTTCTAAGTGATTCAGATGATGTCATCAGATTGAAAATAAGAAAAGCCGTTACCGATTTCAACCCACAAGTAACCTTCGAACCCGATAGCCGTCCTGGCGTGTCCAATCTCGTGACACTCCACTGTCTGGCAGCCGACAAACTTCCAGAAGAGGTCGTTCTGGAGGCAGAGGGATTAACTACGGCGCGGTACAAGCAAATGGTAGCGGACGCACTCGTGGCAGCGATCCGGCCGATCCGTGAACGAGCAAACGAGCTGCTGGCTCGACCGGGTCTGCTGCGACACGTGCTGCGTCACGGTGCCGCGCGCGCTCGCCGCCGGGCTGACATCGTCTACGGAGACGTCGCGGAACGTCTCGGGCTCGCCAACGCCACACTCCAAACTTCTGATGAGTTTAACATTTCAAATCAGATAGTTCTTTTAGGATCCAAATCAAAGAAAGCAGTCGAACATGGTGCATGA

Protein sequence:

>DPOGS212352-PA
MALTSGRLAYSKHLCKVNIRCISLQGYKAKETSIKNSPTAGTSETWNRRVVTGLQPTGALHVGNYFAAVQRCVKLQQQGDDLMIFIADLHAHTTQQNPTQIQKNILELTAYLLASGIDPEQSILFAQSAVPRHAELCWLLACLATHARLAHLPQFKEKSATMKEVPIGLLLYPVLQAADVLVYRGTHVPVGTDQLQHLQVAAQLVRTFHHRYGRLFPTPSPILPDDGSDRLRSLRDPTKKMSKSDTDPKSRILLSDSDDVIRLKIRKAVTDFNPQVTFEPDSRPGVSNLVTLHCLAADKLPEEVVLEAEGLTTARYKQMVADALVAAIRPIRERANELLARPGLLRHVLRHGAARARRRADIVYGDVAERLGLANATLQTSDEFNISNQIVLLGSKSKKAVEHGA-