New model in OGS2.0 | DPOGS204025  |
---|---|
Genomic Position | scaffold452:- 73995-81242 |
See gene structure | |
CDS Length | 2874 |
Paired RNAseq reads   | 2082 |
Single RNAseq reads   | 5166 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004877 (0.0) |
Best Drosophila hit   | mitochondrial alanyl-tRNA synthetase (4e-159) |
Best Human hit | alanyl-tRNA synthetase, mitochondrial precursor (1e-107) |
Best NR hit (blastp)   | GK20581 [Drosophila willistoni] (3e-176) |
Best NR hit (blastx)   | GK20581 [Drosophila willistoni] (8e-171) |
GeneOntology terms    | GO:0005739 mitochondrion GO:0006419 alanyl-tRNA aminoacylation GO:0004813 alanine-tRNA ligase activity GO:0005524 ATP binding GO:0003676 nucleic acid binding |
InterPro families    | IPR018165 Alanyl-tRNA synthetase, class IIc, core domain IPR018164 Alanyl-tRNA synthetase, class IIc, N-terminal IPR012947 Threonyl/alanyl tRNA synthetase, SAD IPR018162 Alanyl-tRNA synthetase, class IIc, anti-codon-binding domain IPR018163 Threonyl/alanyl tRNA synthetase, class II-like, putative editing domain IPR002318 Alanyl-tRNA synthetase, class IIc |
Orthology group | MCL16331 |
Nucleotide sequence:
ATGAAATCTAGCGGATTCATAAGAAGTTCTTTTATAGACTATTTTGTAAATAAACATGGT
CATAAAAATATTAAGTCTAGTTCCGTTGTTTCTCTATGCGACCGTACAGTGCCTTTCGTG
AATGCTGGTATGAACCAGTTTAAAGGAATTTTTCTCGGCCTCGCAAACGCTCCATGCACT
CGAGTAGTAAACTCTCAGAAATGTGTCAGAGTAGGCGGTAAACACAATGATCTGGATCTT
GTCGGAACAGACGGACATCATCACACTTTCTTTGAGATGCTGGGAAACTGGTCATTCAAC
AATTATTACAAGAAAGAAGCATGCCAAATGGCATGGGACTTGTTACTGGGTCCATACAGA
ATGAAGCCAGAGAGTCTGCTGGTGACCTACTTCTCCGGCGATGCTGTTATAGGACTGCAG
GAGGATAAAGAGTGCAGAGATATATGGAAGAAGATTGGAGTACCGCAAAATCGTCTCAAA
GGCCATGGAGCGAGGGACAATTTTTGGGAGATGGGTCCGTCGGGGCCATGTGGTCCCTGC
ACTGAAATACACTACATACATCCAGACGGTAGTCGTACAGAAATATGGAATCTAGTCTTC
ATACAATGCAACAGGGAGGTAGATGGCTCTGTGACAGCCCTGCGCCATCATCACGTTGAC
ACTGGTCTTGGTCTGGAGCGACTCGCGGCTCTCCTTCAGGGCGTGCCCTCCAACTACGAC
ACGGATCTCTTCAGACCTCTCATAAAAACTATTGAAAAGTCTGCCAAAGTCCCTGTGTAC
GAAGGTCGCTTCGGCGAGGGCGCAGAGCTAGACACTAGCTACCGGCGGCTGGCGGACCAC
GCTCGGCTCCTGGCTATCTGTCTCGCGGATGGAGCCCTACCCTCAACAAATCTGAACGTT
AAGCAAATCATGCGAAAGTCATTCAAACTCAGGGAACTGATCGCCAAAGAGAGGGAAGCT
AAGCTCATAATAGAACAGGAAAAAGAAAATTATACGAAATTGAGAGCTGATTTGGCCAAG
AAATGGAAGAATTTGGCCAAGAGATATCCTGAGGTTGAGGCGTTGAGTGATATAGAGATA
TCAGGCTTTGCTTTGGGATACGAGGAGTTTAAAGAAACAATGACGAAAATTAATTCAAAA
GTAATACCGGGGGATCTGGTGTTCAAAATGTACGACACTCATGGCTTCCAAGAAGACGTT
ATAGAAAGAATAGCAAAGTTAAATAATATGGAGATCGATAAAAAAGAGTTCTGGAAACTT
CTGACCAATCACAAGCTGAGGCACAAGACGGCCTTCAAAGAACAAACGTCCAAGAACGGT
ATAAAGTTCGACAAAGCCGTAGAGAAACTAGCGGAAAGCGGCATAAGACATACAAACGAC
TTGCCAAAATACGATTTTATGTACTCGGACAATAAAGTCACTTTTCGTCCTTTGAAAACT
AAACTAGTAGGAATTCTGAACGAAGACGGCGAATGGCTTGACTTCTCGGAGCCGTGCGAA
AATAGACCTTATTACTTGGTGACCGAGAGCACAAACTTTTATTGCGAGGAGGGAGGTCAA
GCGGCAGACGACGGGGTCGTTCAGATCAGAGAAAATATTGGCTTCAATGTTCACAGCGTC
TTCAAAATACGAGGCATTATATTCCATAAGGGAGAATTCAGTTTGAAAAGCGCTGACAAC
ATATACGTGTCTATTGGACTGGACGTTAGTATGGCCATTAACAGAGAGAAGAGACTGAGC
ACGATGAGGAATCACACGGGGGTGCATCTGTTGAACGCCGCTGTCAGGAAGGTGTTGCCA
GATAGCGTGATCTCTCAGACAGGGTCCAGTGTCACTGACAGAGGGCTCTCATTGAGTTTG
TCCATCTACGGCGAGAAATTATCGCCGAGGGCTGTTGAAGATGCACAGGAATTAATAAGC
TCGAGTATATCAGCGAACGTGCCAATCCTGAGCCGTACCCTGTCAACCACGGAGCTCTCC
GGGGAGGCGGACATACTCACAGTGCCGGGGGAAGTGTACCCTGAAACCGGGTTAAGAATG
GTCTCGTGCCCGCAGCCCCTGGCTTCTAAGGAGCTCTGTTGCGGTACCCACGTACCTTCA
ACAGGTGAGCTGCAATCCCTGCTAGTGACGTCAGTGCGCGCCATGGGTTCCAGGTCCCCC
ACTATATACGCGCTGACCGGTTCCGCTGCGATGCAGGCCCGCGAGCTGTTCTGCCGCGTT
CAGAAGCTGACGGAGGTCATTGAACTGGCTGAGCCGTCGAGGGTTGACGAGGAGGTTGCT
ATCATCAGACATCAGCTGAAGGATCTGTGCGGGAGCAGCGGCACCCCGGCTGGGGACTAT
CACAGGAGTCTGCAGCTGATGGACCACTTGAAGAAGACAGCCGCTAACAGAAACGATGCC
GCACTGCTAGACATAGCTCGTACTGAGATCGATGAAGCGTGTTCCGAAGCACAGCGAAGC
GGACGCCGCTTCACGGTGCACTTCCTGCGTTGTTCATATCTCATGCGAAGTGACGCTGCG
AGGAGTGTGCTGGCTGGCCGAGGGAGCGCGGACCCTCACAACTCCTGGGGACCTGTCATG
CTGATAGGGTGTGCGGGGGGTGTTGTCATAGCAACCGCCAAAGTACCACAGGAGCTGGTG
ACGTCATCGTTCACGGCTGACAAGTGGCTGTCCTGTATACTGCCGATATTCGAGGCCACC
GTACTGCCACCCACTGAGCAGTTCTCGACGCTCACCCACGCTGAGATGAGCGCCACTAAA
GTAAGTCTCATAAACTGCGAGCAGATGGTCCAGGACGCTATGAGGGTAGCCATCAAGTAT
GCGCAGGCGCACCTCAAGGACGACGAGAGAAAGACAGACATACGACACAACTAG
Protein sequence:
MKSSGFIRSSFIDYFVNKHGHKNIKSSSVVSLCDRTVPFVNAGMNQFKGIFLGLANAPCT
RVVNSQKCVRVGGKHNDLDLVGTDGHHHTFFEMLGNWSFNNYYKKEACQMAWDLLLGPYR
MKPESLLVTYFSGDAVIGLQEDKECRDIWKKIGVPQNRLKGHGARDNFWEMGPSGPCGPC
TEIHYIHPDGSRTEIWNLVFIQCNREVDGSVTALRHHHVDTGLGLERLAALLQGVPSNYD
TDLFRPLIKTIEKSAKVPVYEGRFGEGAELDTSYRRLADHARLLAICLADGALPSTNLNV
KQIMRKSFKLRELIAKEREAKLIIEQEKENYTKLRADLAKKWKNLAKRYPEVEALSDIEI
SGFALGYEEFKETMTKINSKVIPGDLVFKMYDTHGFQEDVIERIAKLNNMEIDKKEFWKL
LTNHKLRHKTAFKEQTSKNGIKFDKAVEKLAESGIRHTNDLPKYDFMYSDNKVTFRPLKT
KLVGILNEDGEWLDFSEPCENRPYYLVTESTNFYCEEGGQAADDGVVQIRENIGFNVHSV
FKIRGIIFHKGEFSLKSADNIYVSIGLDVSMAINREKRLSTMRNHTGVHLLNAAVRKVLP
DSVISQTGSSVTDRGLSLSLSIYGEKLSPRAVEDAQELISSSISANVPILSRTLSTTELS
GEADILTVPGEVYPETGLRMVSCPQPLASKELCCGTHVPSTGELQSLLVTSVRAMGSRSP
TIYALTGSAAMQARELFCRVQKLTEVIELAEPSRVDEEVAIIRHQLKDLCGSSGTPAGDY
HRSLQLMDHLKKTAANRNDAALLDIARTEIDEACSEAQRSGRRFTVHFLRCSYLMRSDAA
RSVLAGRGSADPHNSWGPVMLIGCAGGVVIATAKVPQELVTSSFTADKWLSCILPIFEAT
VLPPTEQFSTLTHAEMSATKVSLINCEQMVQDAMRVAIKYAQAHLKDDERKTDIRHN