Monarch geneset OGS2.0

DPOGS200737
TranscriptDPOGS200737-TA3552 bp
ProteinDPOGS200737-PA1183 aa
Genomic positionDPSCF300030 + 139893-146094
RNAseq coverage425x (Rank: top 29%)
Annotation
HeliconiusHMEL0089600.080.37% 
BombyxBGIBMGA001037-TA0.080.08% 
DrosophilaCG33123-PA0.063.49% 
EBI UniRef50UniRef50_E0VRG60.059.71%Leucyl-tRNA synthetase, putative n=5 Tax=Bilateria RepID=E0VRG6_PEDHC
NCBI RefSeqXP_395743.20.063.20%PREDICTED: similar to CG33123-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|665016290.063.20%PREDICTED: leucyl-tRNA synthetase, cytoplasmic-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|2700062030.062.78%hypothetical protein TcasGA2_TC008372 [Tribolium castaneum]
Group
Gene OntologyGO:00048233.1e-286leucine-tRNA ligase activity
GO:00055243.1e-286ATP binding
GO:00001663.1e-286nucleotide binding
GO:00064293.1e-286leucyl-tRNA aminoacylation
GO:00057373.1e-286cytoplasm
GO:00064188.2e-30tRNA aminoacylation for protein translation
GO:00048128.2e-30aminoacyl-tRNA ligase activity
KEGG pathwayame:4122820.0 
 K01869 (LARS, leuS)maps-> Aminoacyl-tRNA biosynthesis
    Valine, leucine and isoleucine biosynthesis
InterPro domain[23-1062] IPR0044933.1e-286Leucyl-tRNA synthetase, class Ia, archaeal/eukaryotic cytosolic
[514-749] IPR0147292.9e-102Rossmann-like alpha/beta/alpha sandwich fold
[261-517] IPR0090088.2e-30Valyl/Leucyl/Isoleucyl-tRNA synthetase, class Ia, editing domain
[183-759] IPR0023001.2e-26Aminoacyl-tRNA synthetase, class Ia
[768-992] IPR0090801.2e-24Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
[797-914] IPR0131553.5e-14Valyl/Leucyl/Isoleucyl-tRNA synthetase, class I, anticodon-binding
Orthology groupMCL12248 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200737-TA
ATGACAAATCTTTCCAGTACGGCAACCCTTGACCGCAAAGGAACCTTTAAGGTTGAATATCTCCAAGAGATTGAAAAGAAAGTTCAGGAGCGATGGGATCGAGAAAAAATTTTCGATATGGAAGCGCCGGATGACGGAAAAGACTATGAGAAGTTTTTGTGCACCTTTCCTTATCCATATATGAACGGACGTTTACACCTCGGACACACATTTTCATTATCAAAATGTGAGTTTGCCACCAGGTACTACAGGTTAAAAGGGAGGAAGGTTCTTTTTCCATTTGGTTTCCACTGCACCGGAATGCCTATCAAAGCATGCGCTGATAAGCTTAAAAGAGAAATGGCATTATATGGATGCCCACCAATCTTTCCCGATGACGAAATTGTAGAAGAAAAGGAACAAGGGGATATAGTCCCTAAAGATAAAAGTAAGGGCAAAAAAAGCAAAGCCGTGGCTAAGACAGGAGCTGCAAAGTATCAGTGGCAGATCATGCAGAGCATTGGAGTTCCCGAGGAGGAAATTAAGGAGTTTGCTAATGAGAGTTACTGGCTGGAGTACTTCCCACCTCGTGCCGTAGCTGACTTAAAAAGGATGGGAATCCATGTTGACTGGCGTAGAAAATTCATAACAACAGATGCGAATCCATTCTATGATTCATTCATCAGGTGGCAATTTCATCATCTGAAACAACGGAATAAGATTATGTATGGCAAACGCTATACTATATTTTCTCCGCTAGATAAGCAACCTTGCATGGACCATGACAGAAGTACTGGCGAAGGAGCTGGGCCACAGGAATATACACTTATTAAAATGGAAGTTTTGGAGCCTTTTCCTGAAGTTTTAAAACAATTTCAGGGTAAAACCTTAAACTTTGTAGCGGCAACACTCCGGCCTGAGACCATGTATGGCCAGACAAACTGTTGGGTCCATCCTGAAATTAAGTATATTGCATTTGAAACTGTCAAACACGGTGTGTTCATATGTACAAGACGAGCAGCTCGGAACATGTCGTATCAAGGATTTACCGAAAAAGATGGCGAATATAAAATCATTGCTGAAATTGTAGGGCTGGATCTATTAGGTGTGGCCTTGAAATCACCATTTACTTGCTATCAGAAAATTTACTCGCTTCCGATGTTAACAATTAAGGAGGATAAAGGAACGGGAATCGTTACCAGCGTGCCGTCTGATTCTCCCGATGATTACGCCGCATTGGTTGACCTACAAAAGAAAGCCCCGTTCAGAGAAAAGTACGGCATCCAAGACTATATGGTCATGCCATTTAAGCCTGTCTCCATATTAGAAATACCTGAATTCGGTAACCTCACAGCCGTGTTCCTATATGATAAACTTAAAATCCAAAGTCAAAATGACAAAGATAAACTGACCCAGGCCAAAGAAATGGCGTACCTGAAAGGATTTTACGACGGCGTGCTACTGGTCGGTGATTACAAAGGCGAGAAAATTCAGGATGTGAAGAAAAAATTGCAACAGAGGCTGATAGATGATAACTCCGCTGTAATCTATTACGAACCGGAAAAGACAATCATTTCCAGATCCGGTGACGAATGCGTTGTCGCACTTTGCAATCAATGGTATTTGGATTATGGTAATGCAGAATGGAAGGGCCAAGCAGAAAAAGCTTTGGCCGCAATGAACACGTATCACGATGAAGTGAGGAAAAACTTCCAGGCTACTCTTAAATGGCTTCACGAGTATGCTTGTTCCCGTACCTACGGTCTCGGCACCAAATTGCCGTGGGACACGCAATGGGTCATCGAATCGCTCTCCGACTCGACTATATACAACGCGTACTACACTATATCCCATTATCTGCAAGGCGACAGCTTTAGGGGTAATGTCGAAAATGATTTGAAAATTAAACCCGAAGAGATGTCGATTGAAGTTTGGGATTATATTTTCTTCAAAGACGCTCCCATACCTAAGAACACGAAAATATCTAAAAATAAATTAGATCTGATGAAGAAGTCTTTCCAATTCTGGTACCCAGTAGACCTCAGAGTGTCCGGAAAGGATCTCATTCAGAATCATCTGACGTTCTATATTTACAATCACTGTGCAATGTGGGAGAAAGAAGAAGACAAATGGCCAAAAGGGATCCGAGCAAATGGTCACCTCATGTTAAATTCAGCAAAAATGTCCAAATCTGACGGAAACTTCTTAACACTATCTGAGAGTATCGACAAATTCAGTGCCGATGGAATGAGACTGACTCTAGCTGACGCCGGAGACTCCGTTGAAGATGCTAACTTTGTTGAAAGTACAGCCGATGCCGCTATTTTAAGACTTTACACTTTTATTGAATGGGTGAAGGAGGTCATGGTCACTAAATCAAACTTCAGGACAGGAGAGTACAATTTCCATGATAAAGTTTTTGTCAGTGAAATGAACACAAAAATTATTCAGACTGATGATAACTACAACAAACTGCTGTTCAAGGAGGCCTTGAAAACTGGTTTCTTTGAGCTTCAGGCTGCCAGAGATAAATATAGGGAGTTGTGTTCCGAGGGAGGCATGCACGAGAGCCTTATAACACAGTACATTAGCACCCAGGCGAAACTCATTTCACCAATATGCCCGCATGTCGCTGAACATGTTTGGGAACTACTTGGTAATAAAGGTAGCATTCTTCATGAAAGATGGCCAGTTGCTGGAGAAGTGGATGAGATAGCAGTGAAAGCGAGCAACTATCTCATGGAAGCGGCTCACTCCTTCCGAGTTTATCTCAAAAATCATTGTGCTGTCAAAAAACCAAAGAAAGGAGAAGTCGTCAAACAGGAGTCTAAACCGAACAAAGCTGTTATATGGGTGGCCAAGGAATATCCTAAATGGCAACATATTATTTTGAGCACACTTAAAGAAATGCATGGACCAAATGGTCTTCCCGATAACAAAACAATATCCAGCAAGTTAGCAGAAATAAATGATCTGAAAAAGTATATGAAGAGGGTTATGCCGTTTGTTCAGGCGACCAGAGAGAACATAGAGCGTATTGGCCTTGAAGCTCTCCGCGTGGGATTGGCGTTTGATGAAGCGGCTGTACTACAGGATAATGCACAGTATTTAAGAGATACCCTCGATCTAGAGTACATAGAAATTAAATTGGTAGATGAGGACGCTCCAGAACGGACTCGTACTGAATGCGCTCCCGGGTCACCTCACGCCAGCTTCTTCACACATGTGACCCCGGCGGCGGACGTCGTGTTACTGAACCCCGACCAGCGCTCGGGTCTGTTCACAGTTAGCCTGAAGTTAGGGGAGGGGGAAACACTTGATTCCCTTAAGGAGAAGTTGGCGAAACAAGTCAAGGGGATACGAGATATGGATGCGCTTAAAATTTGGCGATACAAGGACCCGGTCCTCGGACCACGAAAGATTCCCGTCATAGGGGATTACGTCACCAAGTGTGTTGTGTTGGGAGCCGGCTCTGCGTTCAATGTTGACGTTGACAAGAACATAATTGAACTGGTTAATAACGGAACCAATATTAATGTCGGCAACCAACTGCTGTACACATACGACAACTAA

Protein sequence:

>DPOGS200737-PA
MTNLSSTATLDRKGTFKVEYLQEIEKKVQERWDREKIFDMEAPDDGKDYEKFLCTFPYPYMNGRLHLGHTFSLSKCEFATRYYRLKGRKVLFPFGFHCTGMPIKACADKLKREMALYGCPPIFPDDEIVEEKEQGDIVPKDKSKGKKSKAVAKTGAAKYQWQIMQSIGVPEEEIKEFANESYWLEYFPPRAVADLKRMGIHVDWRRKFITTDANPFYDSFIRWQFHHLKQRNKIMYGKRYTIFSPLDKQPCMDHDRSTGEGAGPQEYTLIKMEVLEPFPEVLKQFQGKTLNFVAATLRPETMYGQTNCWVHPEIKYIAFETVKHGVFICTRRAARNMSYQGFTEKDGEYKIIAEIVGLDLLGVALKSPFTCYQKIYSLPMLTIKEDKGTGIVTSVPSDSPDDYAALVDLQKKAPFREKYGIQDYMVMPFKPVSILEIPEFGNLTAVFLYDKLKIQSQNDKDKLTQAKEMAYLKGFYDGVLLVGDYKGEKIQDVKKKLQQRLIDDNSAVIYYEPEKTIISRSGDECVVALCNQWYLDYGNAEWKGQAEKALAAMNTYHDEVRKNFQATLKWLHEYACSRTYGLGTKLPWDTQWVIESLSDSTIYNAYYTISHYLQGDSFRGNVENDLKIKPEEMSIEVWDYIFFKDAPIPKNTKISKNKLDLMKKSFQFWYPVDLRVSGKDLIQNHLTFYIYNHCAMWEKEEDKWPKGIRANGHLMLNSAKMSKSDGNFLTLSESIDKFSADGMRLTLADAGDSVEDANFVESTADAAILRLYTFIEWVKEVMVTKSNFRTGEYNFHDKVFVSEMNTKIIQTDDNYNKLLFKEALKTGFFELQAARDKYRELCSEGGMHESLITQYISTQAKLISPICPHVAEHVWELLGNKGSILHERWPVAGEVDEIAVKASNYLMEAAHSFRVYLKNHCAVKKPKKGEVVKQESKPNKAVIWVAKEYPKWQHIILSTLKEMHGPNGLPDNKTISSKLAEINDLKKYMKRVMPFVQATRENIERIGLEALRVGLAFDEAAVLQDNAQYLRDTLDLEYIEIKLVDEDAPERTRTECAPGSPHASFFTHVTPAADVVLLNPDQRSGLFTVSLKLGEGETLDSLKEKLAKQVKGIRDMDALKIWRYKDPVLGPRKIPVIGDYVTKCVVLGAGSAFNVDVDKNIIELVNNGTNINVGNQLLYTYDN-