Monarch geneset OGS2.0

DPOGS204649
TranscriptDPOGS204649-TA2139 bp
ProteinDPOGS204649-PA712 aa
Genomic positionDPSCF300462 + 20262-29492
RNAseq coverage1974x (Rank: top 6%)
Annotation
HeliconiusHMEL0055310.087.93% 
BombyxBGIBMGA001849-TA0.089.22% 
DrosophilaAats-thr-PA0.074.85% 
EBI UniRef50UniRef50_F4W5M60.073.27%Threonyl-tRNA synthetase, cytoplasmic n=11 Tax=cellular organisms RepID=F4W5M6_ACREC
NCBI RefSeqXP_967345.10.076.27%PREDICTED: similar to RH56418p [Tribolium castaneum]
NCBI nr blastpgi|910877990.076.27%PREDICTED: similar to RH56418p [Tribolium castaneum]
NCBI nr blastxgi|910877990.076.27%PREDICTED: similar to RH56418p [Tribolium castaneum]
Group
Gene OntologyGO:00055241.1e-202ATP binding
GO:00048291.1e-202threonine-tRNA ligase activity
GO:00064351.1e-202threonyl-tRNA aminoacylation
GO:00057371.1e-202cytoplasm
GO:00001667.3e-54nucleotide binding
GO:00064186.3e-46tRNA aminoacylation for protein translation
GO:00048126.3e-46aminoacyl-tRNA ligase activity
GO:00168763.7e-19ligase activity, forming aminoacyl-tRNA and related compounds
GO:00430393.7e-19tRNA aminoacylation
KEGG pathwaytca:6556910.0 
 K01868 (TARS, thrS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[142-697] IPR0023201.1e-202Threonyl-tRNA synthetase, class IIa
[133-308] IPR0181637.3e-54Threonyl/alanyl tRNA synthetase, class II-like, putative editing domain
[341-502] IPR0023146.3e-46Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
[597-702] IPR0041548.1e-31Anticodon-binding
[71-133] IPR0126753.2e-22Beta-grasp fold, ferredoxin-type
[237-286] IPR0129473.7e-19Threonyl/alanyl tRNA synthetase, SAD
[71-132] IPR0126761.2e-17TGS-like
[72-130] IPR0040954.4e-17TGS
Orthology groupMCL10715 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204649-TA
ATGGGAGACGCGGCAGTGGAAGAAGTTAAGAATCTAAATATTAACGAAAAGTCAAAATCAAAAGCAATGAAAGAAAAAAAGAAGACAGAGTCATCCTCTGGTGTTTCTGAACTGCAACCCTGGCCGGAGTTCATTCAGAAGCGGATAGACCTATGGGATAAGTACAAAGCACAGTATGCAGAAGCCCTAGCCTCTAAACCTGATGTGAGTGTTGTTGTGACACTCCCTGATGGCAAAACCGTTGAGGCCAAAGCCTGGCATACAACCCCATATGACGTGGCCAAAGGCATCAGCCAGGGTCTAGCAGATGCTACTATCATAGCGAGGGTGAACGGTGTGCTTTGGGACCTTGATCGGCCGTTGGAAGGAGATTGCAAGTTGGAGTTACTGAGATGGGACAACACGGACGCACAGGCAGTGTTCTGGCATTCATCTGCCCACATGCTCGGAGAGGCCATGGAGAGGGTTTATGGTGGGTGTCTCTGCTACGGTCCCCCCATCGAGGAGGGTTTCTACTATGACATGTTCTATCCAGAGAAGGGGATATCGTCAACGGATTTCGGCGTCATAGACGCTCTTGTTAAGAAGATAGCTAAGGAGAAGCAGCCGTTTGAGCGTCTGGAGCTCACCAAGGAACAACTCCTGGAGATGTTCGACTACAACCCGTTCAAAGTGAGGATCCTCAAGGAGAAGGTGGACACGCCCACCACCACCGTGTACAGGTGCGGGCCCCTCATCGATCTCTGCCGGGGACCCCACGTGAGGCACACCGGCAAGGTCAAAGCGTTCAAGGTCACTAAGAACTCGTCCACGTACTGGGAGGGGAAGGCCGACGCTGAGACCCTTCAAAGGGTGTACGGCGTGTCGTTCCCGGAACCCAAACAGCTCAAGGAGTGGGAGCTGATGCAGGAGGAGGCGGCCAAGAGGGATCACAGGAAGATAGGCAGGGAACAAGAGTTATATTTCTTCCACGAGCTGTCTCCTGGGTCGTGTTTCTTCCAGCCGAGGGGAGCTCACATATACAACACGTTGGTCAACTTCATCAGGGAACAGTACAGGAAACGCGGCTTCCAGGAGGTGGTGACTCCTAACATGTACAACGCCAAGCTGTGGCAGACGTCGGGTCACTGGGCGCACTACGCCGAAAACATGTTCTCCTTCGACGTCGAGAAAGAGACCTTCGCTCTCAAACCCATGAACTGTCCCGGACACTGTTTGATGTTCGACAACCGTGTCCGGTCGTGGCGCGAGCTGCCCCTGCGGCTGGCGGACTTCGGGGTGCTGCACAGGAACGAGCTGAGCGGAGCGCTCACTGGACTAACCCGCGTCAGGCGCTTCCAACAGGACGACGCGCACATCTTCTGTACGCCGCAGAACATTGAGCAGGAGATGATCGGTTGCTTAGAATTCCTCGAGCAGGTGTACTCGACCTTCGGCTTCACGTTTCAACTGAAGCTCTCCACGCGCCCTGAGAAATACCTCGGAGACCTGGCGACCTGGAATCAGGCTGAGAAGGCCCTGGAAGACTCCCTCAACAGATTCGGCAAGGTCTGGCAGCTCAACCCGGGCGACGGCGCATTCTACGGGCCCAAAATAGACATTACTATACAGGATGCTCTGAGACGATCACATCAATGTGCGACCATACAACTGGATTTCCAACTGCCGGAGAGGTTCAACCTGAGTTACATCAGTGAGACGGGTGAGAAGAAGCGGCCAGTGATAATCCACCGGGCGGTGTTGGGTTCCGTGGAGCGTATGATAGCTATCCTGAGCGAGTCGTACGCCGGCAAGTGGCCGTTCTGGCTAAGTCCGAGGCAAACTTGTGTGGTGCCCGTCGGTCCCAGCTTCGACGATTACGCCACATACGTGAATGAGAAGTTGTTTGCCGCTGGGTTCATGTCTGAAGTTGACACGGATGCCGGAGACACACTCAACAAGAAAGTCCGGAACGCACAGCTGGCACAGTTCAACTATATACTCGTTGTGGGCGAGCGAGAGAAGTCCTCCAACACAGTGAATGTCCGTACAAGAGATAACAAAGTCCACGGAGAAATGTCCATCGAGGGTCTGATAGAGCACCTCAACAAGCTGGTCGCTGAGAAAACGCTCTCCGAGGACAGTGAGCTGCTCAAATAG

Protein sequence:

>DPOGS204649-PA
MGDAAVEEVKNLNINEKSKSKAMKEKKKTESSSGVSELQPWPEFIQKRIDLWDKYKAQYAEALASKPDVSVVVTLPDGKTVEAKAWHTTPYDVAKGISQGLADATIIARVNGVLWDLDRPLEGDCKLELLRWDNTDAQAVFWHSSAHMLGEAMERVYGGCLCYGPPIEEGFYYDMFYPEKGISSTDFGVIDALVKKIAKEKQPFERLELTKEQLLEMFDYNPFKVRILKEKVDTPTTTVYRCGPLIDLCRGPHVRHTGKVKAFKVTKNSSTYWEGKADAETLQRVYGVSFPEPKQLKEWELMQEEAAKRDHRKIGREQELYFFHELSPGSCFFQPRGAHIYNTLVNFIREQYRKRGFQEVVTPNMYNAKLWQTSGHWAHYAENMFSFDVEKETFALKPMNCPGHCLMFDNRVRSWRELPLRLADFGVLHRNELSGALTGLTRVRRFQQDDAHIFCTPQNIEQEMIGCLEFLEQVYSTFGFTFQLKLSTRPEKYLGDLATWNQAEKALEDSLNRFGKVWQLNPGDGAFYGPKIDITIQDALRRSHQCATIQLDFQLPERFNLSYISETGEKKRPVIIHRAVLGSVERMIAILSESYAGKWPFWLSPRQTCVVPVGPSFDDYATYVNEKLFAAGFMSEVDTDAGDTLNKKVRNAQLAQFNYILVVGEREKSSNTVNVRTRDNKVHGEMSIEGLIEHLNKLVAEKTLSEDSELLK-