New model in OGS2.0 | DPOGS210187  |
---|---|
Genomic Position | scaffold2271:- 3563-12326 |
See gene structure | |
CDS Length | 2940 |
Paired RNAseq reads   | 1788 |
Single RNAseq reads   | 4099 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003259 (0.0) |
Best Drosophila hit   | CG15100 (0.0) |
Best Human hit | methionyl-tRNA synthetase, cytoplasmic (0.0) |
Best NR hit (blastp)   | GK20695 [Drosophila willistoni] (0.0) |
Best NR hit (blastx)   | GK20695 [Drosophila willistoni] (0.0) |
GeneOntology terms    | GO:0004825 methionine-tRNA ligase activity GO:0006431 methionyl-tRNA aminoacylation GO:0005524 ATP binding GO:0005737 cytoplasm GO:0005875 microtubule associated complex |
InterPro families    | IPR014758 Methionyl-tRNA synthetase IPR015413 Aminoacyl-tRNA synthetase, class I (M) IPR000738 WHEP-TRS IPR001412 Aminoacyl-tRNA synthetase, class I, conserved site IPR010987 Glutathione S-transferase, C-terminal-like IPR014729 Rossmann-like alpha/beta/alpha sandwich fold IPR009068 S15/NS1, RNA-binding IPR017933 Glutathione S-transferase/chloride channel, C-terminal IPR009080 Aminoacyl-tRNA synthetase, class 1a, anticodon-binding |
Orthology group | MCL12139 |
Nucleotide sequence:
ATGAAGATTTATACTAACGAGAACAACACGGCTACATTGAAACTGCTTATAGCGGCGAAG
TTAGCTGGGAAAGACGTTGAATTGCTGAAGGGAACTCACGAAGAATCACCAGGTCCAGCA
GTATTACCGCGTCTGGAAGTCCACGAGGAGCTCAGCTTCTTTAGCAGTAACGCAGCAGTT
CAGTATCTATTCCCCTCGTTCAGTCTACAGATGTTAGAATGGGAAGCGACTCGTCTTTAT
CCCGTCGTGTCGACCGTCCTGACCTCTAAGACCGTTTCATCCGAGTTGAAGGAAGCTTTA
AACACCTCGTTACTGATTGCCGACAATCTTCTGGCTAAACATCAGTACATACTTGGGGAC
AAACTGAGTCCTGTAGATGTATCAATATTCAGCACACTATATCCATTGTGCTGCACAGAT
CTCAAGGATACCTATCTTAAAGAGTACAGTCATGTTCTCCGATGGTCCGGAGATATCGGG
AACTCTGAGGCCGTCCAGGAAGCTGTAAAACAGTGGGGCGGATCCCCTAACAGTCCACCA
TCAGCCTCATCGCTGCTGGGTACTCCACAAGTCGTCATACAAACACCGACCGGATCCCCC
GATGAAGTGCCAGAGAAGCTGTCCGCCGAGGAGTTGGAAATGGCGAGAGACAACTTCCTC
AACGGAATCAACAAGCTGCAGCCGCCGTTAAAGAGAGAAGGAGTCGTCCTACCGGATAAG
GATCGCAGGAACGTCCTGATAACCTCCGCCCTGCCGTACGTCAACAACGTGCCTCACCTC
GGCAACATTATAGGCTGCGTCCTCTCCGCGGATATATTCTCCAGGTATTGTCGTCTGTGT
GGCTTCAACACGCTGTTCGTGTGCGGCACGGACGAGTACGGCACGGCCACTGAGACGAAG
GCGTTGGAGGAAGGCGTCACACCGCGTCAGATATGTGATAAGTATTTCGCTATCCACGAC
GCTGTGTATCGCTGGTTCAACATAGACTTTGATTACTTCGGAAGGACCAGCACCGAGCAA
CAGACAAGGATAGCCCAGGACCTGTTCAAGAAACTGAACGCCAACGGCTTCGTCAGCAAG
CAGACGGTGGAGCAGTTGTACTGTGAGAAGTGTGACAGATTCCTCGCTGACAGGTTCGTG
GAGGGTACCTGTCCCCACCCCGGCTGTTTGTACGACAACGCCCGCGGGGACCAGTGCGAT
AAGTGCGGGAAGCTCATCAACGCTGTCGAGCTCCGCGAGGCGAGGTGCAAGGTGTGCTCC
AGCTCGCCCGCCGTCAGGAACAGCGACCAGCTGTTCATAGAACTACCTCAGTTGGAGCCC
TCGCTCCGTTCGTGGGCGTCGCGGGCGGAGGCCGGGTGGTCTGGTCCAGCTCGCGCCGTG
CTCCGGGCCTGGATGAGGGACAAGCTGAGATGTAGGGCCGTCACTAGAGATCTCAAGTGG
GGTGTCCCTGTGCCTATAACCGGCTTTGAGAATAAAGTATTCTACGTGTGGTTCGACGCA
CCGATCGGCTACCTCAGTATAACGGAGTGCGCGACCGGGAACTACGAGAAGTGGTGGAAA
CGGTCGCCGGACTACGACGTGAAGCTCTACCAGTTCATGGCCAAGGACAACGTTCCGTTC
CACGTGATAATGTTCCCAGCTACGGTCATCGGGGTCAACGAGGGTCACCTGCTGGTGGAC
CACATCTACGCCACAGAATATCTGAACTATGAAGACACTAAGTTCTCCAAGTCCCGCGGC
GTGGGCGTGTTCGGGACGGACGCTCGGGACACGGGCATACCGTCCGACGTGTGGCGCTTC
TACCTGGCCATGATCAGGCCAGAGACCTCCGACTCCAGCTTCAGCTGGGCGGACCTCGCC
ACCAGGAACAACTCGGAGCTGCTCAACAACCTGGGTAACTTCTGCCACCGGAGCCTGAGC
TTCTGTTACAGCTCGTTCTCCGCAGCCGTTCCCGACACGCAGCTCACGCCCACGGACCTG
GAGATCATAGCCGGAGTCAACAGGGACGTGGTCGCGTATGTCCAGCACCTGGAGCGAGGT
CGGCTGCGGGACGCGCTCCGCCACGTGCTGCGAGTGTCCCGCGCCGGCAACCTGTACATG
CAGGACACGCAGCCCTGGGCGCTGCTGAAGGGCGGGACACAGGACAGGGTGAAGGCTGCA
ACAACGATAGGTGTCTGTTGTGAGCTGGTGGCTCTGCTGGCAGCCCTGCTGGCCCCGTAC
ATGCCCGACACCAGCAAACGGCTCTGCACACAACTGAACATAGACCAGAGCGAGCTAAGG
ATCAATCCGACGGAGCCCTGTATGGTGAGGTTCCTGGGGCCGGGACACACGATCAACAAG
CCGGAGCCGCTCTTCACCAAGATAGAGCAGCAGACGGTCGACGAGTTGCGGAGGAAGTAC
GCCGGTACACAGGCGGACAGGCGGAAGTCGAACGGAGACTGCAAGAAGCTGAGTGCCGCT
GAGTTAGAGGCGGCTATATCCGCTCAGGGTGAAAAAGTTAGAAAATTGAAATCGTCTACA
AAGGACAAGGCGGTTTGGCAACCGGAAGTAGACGTACTGCTGGCTCTGAAGAAACAACTC
ACCCTCGCGCACACACACGCCGACCAGCAGACGGGCAGCGCGGCGGAGCTGGAGAGAGCC
GTCGCTGAACAAGGCGATAAAGTGAGAAAACTGAAGGCATCGACGAAAGATAAAACCGTT
TGGCAGCCGGAGGTCAATAAACTGTTGGCGCTGAAAAAACAACTCGCGGAACACACGGAC
AGACAGACGGGGAACCACTCCCCGGGCAGCGTGGAGCAGCTGGAGAAGGCTATAGCTGAA
CAGGGAGATAAGGTCAGGAAGCTGAAAGCATCCACAAAGGACAAGTCAGTTTGGCAGCCA
GAGGTCAACGTACTCTTGGACCTAAAAAAACAGCTGACAGCATTACAAGCCAATAAATAA
Protein sequence:
MKIYTNENNTATLKLLIAAKLAGKDVELLKGTHEESPGPAVLPRLEVHEELSFFSSNAAV
QYLFPSFSLQMLEWEATRLYPVVSTVLTSKTVSSELKEALNTSLLIADNLLAKHQYILGD
KLSPVDVSIFSTLYPLCCTDLKDTYLKEYSHVLRWSGDIGNSEAVQEAVKQWGGSPNSPP
SASSLLGTPQVVIQTPTGSPDEVPEKLSAEELEMARDNFLNGINKLQPPLKREGVVLPDK
DRRNVLITSALPYVNNVPHLGNIIGCVLSADIFSRYCRLCGFNTLFVCGTDEYGTATETK
ALEEGVTPRQICDKYFAIHDAVYRWFNIDFDYFGRTSTEQQTRIAQDLFKKLNANGFVSK
QTVEQLYCEKCDRFLADRFVEGTCPHPGCLYDNARGDQCDKCGKLINAVELREARCKVCS
SSPAVRNSDQLFIELPQLEPSLRSWASRAEAGWSGPARAVLRAWMRDKLRCRAVTRDLKW
GVPVPITGFENKVFYVWFDAPIGYLSITECATGNYEKWWKRSPDYDVKLYQFMAKDNVPF
HVIMFPATVIGVNEGHLLVDHIYATEYLNYEDTKFSKSRGVGVFGTDARDTGIPSDVWRF
YLAMIRPETSDSSFSWADLATRNNSELLNNLGNFCHRSLSFCYSSFSAAVPDTQLTPTDL
EIIAGVNRDVVAYVQHLERGRLRDALRHVLRVSRAGNLYMQDTQPWALLKGGTQDRVKAA
TTIGVCCELVALLAALLAPYMPDTSKRLCTQLNIDQSELRINPTEPCMVRFLGPGHTINK
PEPLFTKIEQQTVDELRRKYAGTQADRRKSNGDCKKLSAAELEAAISAQGEKVRKLKSST
KDKAVWQPEVDVLLALKKQLTLAHTHADQQTGSAAELERAVAEQGDKVRKLKASTKDKTV
WQPEVNKLLALKKQLAEHTDRQTGNHSPGSVEQLEKAIAEQGDKVRKLKASTKDKSVWQP
EVNVLLDLKKQLTALQANK