Monarch geneset OGS2.0

DPOGS204025
TranscriptDPOGS204025-TA2856 bp
ProteinDPOGS204025-PA951 aa
Genomic positionDPSCF300138 - 73995-81242
RNAseq coverage568x (Rank: top 22%)
Annotation
HeliconiusHMEL0049550.058.40% 
BombyxBGIBMGA004877-TA0.060.11% 
DrosophilaAats-ala-m-PA1e-16836.50% 
EBI UniRef50UniRef50_Q16YK45e-16735.78%Alanyl-tRNA synthetase n=3 Tax=Culicidae RepID=Q16YK4_AEDAE
NCBI RefSeqXP_002068628.12e-17337.19%GK20581 [Drosophila willistoni]
NCBI nr blastpgi|1954416743e-17237.19%GK20581 [Drosophila willistoni]
NCBI nr blastxgi|1954416748e-16736.88%GK20581 [Drosophila willistoni]
Group
Gene OntologyGO:00001668.3e-157nucleotide binding
GO:00057378.3e-157cytoplasm
GO:00055242.3e-130ATP binding
GO:00064192.3e-130alanyl-tRNA aminoacylation
GO:00048132.3e-130alanine-tRNA ligase activity
GO:00168764.1e-07ligase activity, forming aminoacyl-tRNA and related compounds
GO:00430394.1e-07tRNA aminoacylation
KEGG pathwaydwi:Dwil_GK205815e-173 
 K01872 (AARS, alaS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[7-763] IPR0023188.3e-157Alanyl-tRNA synthetase, class IIc
[7-555] IPR0181642.3e-130Alanyl-tRNA synthetase, class IIc, N-terminal
[240-455] IPR0181621.5e-28Alanyl-tRNA synthetase, class IIc, anti-codon-binding domain
[571-725] IPR0181631.1e-25Threonyl/alanyl tRNA synthetase, class II-like, putative editing domain
[672-717] IPR0129474.1e-07Threonyl/alanyl tRNA synthetase, SAD
Orthology groupMCL15632 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204025-TA
ATGAAATCTAGCGGATTCATAAGAAGTTCTTTTATAGACTATTTTGTAAATAAACATGGTCATAAAAATATTAAGTCTAGTTCCGTTGTTTCTCTATGCGACCGTACAGTGCCTTTCGTGAATGCTGGTATGAACCAGTTTAAAGGAATTTTTCTCGGCCTCGCAAACGCTCCATGCACTCGAGTAGTAAACTCTCAGAAATGTGTCAGAGTAGGCGGTAAACACAATGATCTGGATCTTGTCGGAACAGACGGACATCATCACACTTTCTTTGAGATGCTGGGAAACTGGTCATTCAACAATTATTACAAGAAAGAAGCATGCCAAATGGCATGGGACTTGTTACTGGGTCCATACAGAATGAAGCCAGAGAGTCTGCTGGTGACCTACTTCTCCGGCGATGCTGTTATAGGACTGCAGGAGGATAAAGAGTGCAGAGATATATGGAAGAAGATTGGAGTACCGCAAAATCGTCTCAAAGGCCATGGAGCGAGGGACAATTTTTGGGAGATGGGTCCGTCGGGGCCATGTGGTCCCTGCACTGAAATACACTACATACATCCAGACGGTAGTCGTACAGAAATATGGAATCTAGTCTTCATACAATGCAACAGGGAGGTAGATGGCTCTGTGACAGCCCTGCGCCATCATCACGTTGACACTGGTCTTGGTCTGGAGCGACTCGCGGCTCTCCTTCAGGGCGTGCCCTCCAACTACGACACGGATCTCTTCAGACCTCTCATAAAAACTATTGAAAAGTCTGCCAAAGTCCCTGTGTACGAAGGTCGCTTCGGCGAGGGCGCAGAGCTAGACACTAGCTACCGGCGGCTGGCGGACCACGCTCGGCTCCTGGCTATCTGTCTCGCGGATGGAGCCCTACCCTCAACAAAGTCAGTATACATTTTTCATATAAGGGAACTGATCGCCAAAGAGAGGGAAGCTAAGCTCATAATAGAACAGGAAAAAGAAAATTATACGAAATTGAGAGCTGATTTGGCCAAGAAATGGAAGAATTTGGCCAAGAGATATCCTGAGGTTGAGGCGTTGAGTGATATAGAGATATCAGGCTTTGCTTTGGGATACGAGGAGTTTAAAGAAACAATGACGAAAATTAATTCAAAAGTAATACCGGGGGATCTGGTGTTCAAAATGTACGACACTCATGGCTTCCAAGAAGACGTTATAGAAAGAATAGCAAAGTTAAATAATATGGAGATCGATAAAAAAGAGTTCTGGAAACTTCTGACCAATCACAAGCTGAGGCACAAGACGGCCTTCAAAGAACAAACGTCCAAGAACGGTATAAAGTTCGACAAAGCCGTAGAGAAACTAGCGGAAAGCGGCATAAGACATACAAACGACTTGCCAAAATACGATTTTATGTACTCGGACAATAAAGTCACTTTTCGTCCTTTGAAAACTAAACTAGTAGGAATTCTGAACGAAGACGGCGAATGGCTTGACTTCTCGGAGCCGTGCGAAAATAGACCTTATTACTTGGTGACCGAGAGCACAAACTTTTATTGCGAGGAGGGAGGTCAAGCGGCAGACGACGGGGTCGTTCAGATCAGAGAAAATATTGGCTTCAATGTTCACAGCGTCTTCAAAATACGAGGCATTATATTCCATAAGGGAGAATTCAGTTTGAAAAGCGCTGACAACATATACGTGTCTATTGGACTGGACGTTAGTATGGCCATTAACAGAGAGAAGAGACTGAGCACGATGAGGAATCACACGGGGGTGCATCTGTTGAACGCCGCTGTCAGGAAGGTGTTGCCAGATAGCGTGATCTCTCAGACAGGGTCCAGTGTCACTGACAGAGGGCTCTCATTGAGTTTGTCCATCTACGGCGAGAAATTATCGCCGAGGGCTGTTGAAGATGCACAGGAATTAATAAGCTCGAGTATATCAGCGAACGTGCCAATCCTGAGCCGTACCCTGTCAACCACGGAGCTCTCCGGGGAGGCGGACATACTCACAGTGCCGGGGGAAGTGTACCCTGAAACCGGGTTAAGAATGGTCTCGTGCCCGCAGCCCCTGGCTTCTAAGGAGCTCTGTTGCGGTACCCACGTACCTTCAACAGGTGAGCTGCAATCCCTGCTAGTGACGTCAGTGCGCGCCATGGGTTCCAGGTCCCCCACTATATACGCGCTGACCGGTTCCGCTGCGATGCAGGCCCGCGAGCTGTTCTGCCGCGTTCAGAAGCTGACGGAGGTCATTGAACTGGCTGAGCCGTCGAGGGTTGACGAGGAGGTTGCTATCATCAGACATCAGCTGAAGGATCTGTGCGGGAGCAGCGGCACCCCGGCTGGGGACTATCACAGGAGTCTGCAGCTGATGGACCACTTGAAGAAGACAGCCGCTAACAGAAACGATGCCGCACTGCTAGACATAGCTCGTACTGAGATCGATGAAGCGTGTTCCGAAGCACAGCGAAGCGGACGCCGCTTCACGGTGCACTTCCTGCGTTGTTCATATCTCATGCGAAGTGACGCTGCGAGGAGTGTGCTGGCTGGCCGAGGGAGCGCGGACCCTCACAACTCCTGGGGACCTGTCATGCTGATAGGGTGTGCGGGGGGTGTTGTCATAGCAACCGCCAAAGTACCACAGGAGCTGGTGACGTCATCGTTCACGGCTGACAAGTGGCTGTCCTGTATACTGCCGATATTCGAGGCCACCGTACTGCCACCCACTGAGCAGTTCTCGACGCTCACCCACGCTGAGATGAGCGCCACTAAAGTAAGTCTCATAAACTGCGAGCAGATGGTCCAGGACGCTATGAGGGTAGCCATCAAGTATGCGCAGGCGCACCTCAAGGACGACGAGAGAAAGACAGACATACGACACAACTAG

Protein sequence:

>DPOGS204025-PA
MKSSGFIRSSFIDYFVNKHGHKNIKSSSVVSLCDRTVPFVNAGMNQFKGIFLGLANAPCTRVVNSQKCVRVGGKHNDLDLVGTDGHHHTFFEMLGNWSFNNYYKKEACQMAWDLLLGPYRMKPESLLVTYFSGDAVIGLQEDKECRDIWKKIGVPQNRLKGHGARDNFWEMGPSGPCGPCTEIHYIHPDGSRTEIWNLVFIQCNREVDGSVTALRHHHVDTGLGLERLAALLQGVPSNYDTDLFRPLIKTIEKSAKVPVYEGRFGEGAELDTSYRRLADHARLLAICLADGALPSTKSVYIFHIRELIAKEREAKLIIEQEKENYTKLRADLAKKWKNLAKRYPEVEALSDIEISGFALGYEEFKETMTKINSKVIPGDLVFKMYDTHGFQEDVIERIAKLNNMEIDKKEFWKLLTNHKLRHKTAFKEQTSKNGIKFDKAVEKLAESGIRHTNDLPKYDFMYSDNKVTFRPLKTKLVGILNEDGEWLDFSEPCENRPYYLVTESTNFYCEEGGQAADDGVVQIRENIGFNVHSVFKIRGIIFHKGEFSLKSADNIYVSIGLDVSMAINREKRLSTMRNHTGVHLLNAAVRKVLPDSVISQTGSSVTDRGLSLSLSIYGEKLSPRAVEDAQELISSSISANVPILSRTLSTTELSGEADILTVPGEVYPETGLRMVSCPQPLASKELCCGTHVPSTGELQSLLVTSVRAMGSRSPTIYALTGSAAMQARELFCRVQKLTEVIELAEPSRVDEEVAIIRHQLKDLCGSSGTPAGDYHRSLQLMDHLKKTAANRNDAALLDIARTEIDEACSEAQRSGRRFTVHFLRCSYLMRSDAARSVLAGRGSADPHNSWGPVMLIGCAGGVVIATAKVPQELVTSSFTADKWLSCILPIFEATVLPPTEQFSTLTHAEMSATKVSLINCEQMVQDAMRVAIKYAQAHLKDDERKTDIRHN-