DPGLEAN07501 in OGS1.0

New model in OGS2.0DPOGS210187 
Genomic Positionscaffold2271:- 3563-12326
See gene structure
CDS Length2940
Paired RNAseq reads  1788
Single RNAseq reads  4099
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003259 (0.0)
Best Drosophila hit  CG15100 (0.0)
Best Human hitmethionyl-tRNA synthetase, cytoplasmic (0.0)
Best NR hit (blastp)  GK20695 [Drosophila willistoni] (0.0)
Best NR hit (blastx)  GK20695 [Drosophila willistoni] (0.0)
GeneOntology terms



  
GO:0004825 methionine-tRNA ligase activity
GO:0006431 methionyl-tRNA aminoacylation
GO:0005524 ATP binding
GO:0005737 cytoplasm
GO:0005875 microtubule associated complex
InterPro families







  
IPR014758 Methionyl-tRNA synthetase
IPR015413 Aminoacyl-tRNA synthetase, class I (M)
IPR000738 WHEP-TRS
IPR001412 Aminoacyl-tRNA synthetase, class I, conserved site
IPR010987 Glutathione S-transferase, C-terminal-like
IPR014729 Rossmann-like alpha/beta/alpha sandwich fold
IPR009068 S15/NS1, RNA-binding
IPR017933 Glutathione S-transferase/chloride channel, C-terminal
IPR009080 Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
Orthology groupMCL12139

Nucleotide sequence:

ATGAAGATTTATACTAACGAGAACAACACGGCTACATTGAAACTGCTTATAGCGGCGAAG
TTAGCTGGGAAAGACGTTGAATTGCTGAAGGGAACTCACGAAGAATCACCAGGTCCAGCA
GTATTACCGCGTCTGGAAGTCCACGAGGAGCTCAGCTTCTTTAGCAGTAACGCAGCAGTT
CAGTATCTATTCCCCTCGTTCAGTCTACAGATGTTAGAATGGGAAGCGACTCGTCTTTAT
CCCGTCGTGTCGACCGTCCTGACCTCTAAGACCGTTTCATCCGAGTTGAAGGAAGCTTTA
AACACCTCGTTACTGATTGCCGACAATCTTCTGGCTAAACATCAGTACATACTTGGGGAC
AAACTGAGTCCTGTAGATGTATCAATATTCAGCACACTATATCCATTGTGCTGCACAGAT
CTCAAGGATACCTATCTTAAAGAGTACAGTCATGTTCTCCGATGGTCCGGAGATATCGGG
AACTCTGAGGCCGTCCAGGAAGCTGTAAAACAGTGGGGCGGATCCCCTAACAGTCCACCA
TCAGCCTCATCGCTGCTGGGTACTCCACAAGTCGTCATACAAACACCGACCGGATCCCCC
GATGAAGTGCCAGAGAAGCTGTCCGCCGAGGAGTTGGAAATGGCGAGAGACAACTTCCTC
AACGGAATCAACAAGCTGCAGCCGCCGTTAAAGAGAGAAGGAGTCGTCCTACCGGATAAG
GATCGCAGGAACGTCCTGATAACCTCCGCCCTGCCGTACGTCAACAACGTGCCTCACCTC
GGCAACATTATAGGCTGCGTCCTCTCCGCGGATATATTCTCCAGGTATTGTCGTCTGTGT
GGCTTCAACACGCTGTTCGTGTGCGGCACGGACGAGTACGGCACGGCCACTGAGACGAAG
GCGTTGGAGGAAGGCGTCACACCGCGTCAGATATGTGATAAGTATTTCGCTATCCACGAC
GCTGTGTATCGCTGGTTCAACATAGACTTTGATTACTTCGGAAGGACCAGCACCGAGCAA
CAGACAAGGATAGCCCAGGACCTGTTCAAGAAACTGAACGCCAACGGCTTCGTCAGCAAG
CAGACGGTGGAGCAGTTGTACTGTGAGAAGTGTGACAGATTCCTCGCTGACAGGTTCGTG
GAGGGTACCTGTCCCCACCCCGGCTGTTTGTACGACAACGCCCGCGGGGACCAGTGCGAT
AAGTGCGGGAAGCTCATCAACGCTGTCGAGCTCCGCGAGGCGAGGTGCAAGGTGTGCTCC
AGCTCGCCCGCCGTCAGGAACAGCGACCAGCTGTTCATAGAACTACCTCAGTTGGAGCCC
TCGCTCCGTTCGTGGGCGTCGCGGGCGGAGGCCGGGTGGTCTGGTCCAGCTCGCGCCGTG
CTCCGGGCCTGGATGAGGGACAAGCTGAGATGTAGGGCCGTCACTAGAGATCTCAAGTGG
GGTGTCCCTGTGCCTATAACCGGCTTTGAGAATAAAGTATTCTACGTGTGGTTCGACGCA
CCGATCGGCTACCTCAGTATAACGGAGTGCGCGACCGGGAACTACGAGAAGTGGTGGAAA
CGGTCGCCGGACTACGACGTGAAGCTCTACCAGTTCATGGCCAAGGACAACGTTCCGTTC
CACGTGATAATGTTCCCAGCTACGGTCATCGGGGTCAACGAGGGTCACCTGCTGGTGGAC
CACATCTACGCCACAGAATATCTGAACTATGAAGACACTAAGTTCTCCAAGTCCCGCGGC
GTGGGCGTGTTCGGGACGGACGCTCGGGACACGGGCATACCGTCCGACGTGTGGCGCTTC
TACCTGGCCATGATCAGGCCAGAGACCTCCGACTCCAGCTTCAGCTGGGCGGACCTCGCC
ACCAGGAACAACTCGGAGCTGCTCAACAACCTGGGTAACTTCTGCCACCGGAGCCTGAGC
TTCTGTTACAGCTCGTTCTCCGCAGCCGTTCCCGACACGCAGCTCACGCCCACGGACCTG
GAGATCATAGCCGGAGTCAACAGGGACGTGGTCGCGTATGTCCAGCACCTGGAGCGAGGT
CGGCTGCGGGACGCGCTCCGCCACGTGCTGCGAGTGTCCCGCGCCGGCAACCTGTACATG
CAGGACACGCAGCCCTGGGCGCTGCTGAAGGGCGGGACACAGGACAGGGTGAAGGCTGCA
ACAACGATAGGTGTCTGTTGTGAGCTGGTGGCTCTGCTGGCAGCCCTGCTGGCCCCGTAC
ATGCCCGACACCAGCAAACGGCTCTGCACACAACTGAACATAGACCAGAGCGAGCTAAGG
ATCAATCCGACGGAGCCCTGTATGGTGAGGTTCCTGGGGCCGGGACACACGATCAACAAG
CCGGAGCCGCTCTTCACCAAGATAGAGCAGCAGACGGTCGACGAGTTGCGGAGGAAGTAC
GCCGGTACACAGGCGGACAGGCGGAAGTCGAACGGAGACTGCAAGAAGCTGAGTGCCGCT
GAGTTAGAGGCGGCTATATCCGCTCAGGGTGAAAAAGTTAGAAAATTGAAATCGTCTACA
AAGGACAAGGCGGTTTGGCAACCGGAAGTAGACGTACTGCTGGCTCTGAAGAAACAACTC
ACCCTCGCGCACACACACGCCGACCAGCAGACGGGCAGCGCGGCGGAGCTGGAGAGAGCC
GTCGCTGAACAAGGCGATAAAGTGAGAAAACTGAAGGCATCGACGAAAGATAAAACCGTT
TGGCAGCCGGAGGTCAATAAACTGTTGGCGCTGAAAAAACAACTCGCGGAACACACGGAC
AGACAGACGGGGAACCACTCCCCGGGCAGCGTGGAGCAGCTGGAGAAGGCTATAGCTGAA
CAGGGAGATAAGGTCAGGAAGCTGAAAGCATCCACAAAGGACAAGTCAGTTTGGCAGCCA
GAGGTCAACGTACTCTTGGACCTAAAAAAACAGCTGACAGCATTACAAGCCAATAAATAA

Protein sequence:

MKIYTNENNTATLKLLIAAKLAGKDVELLKGTHEESPGPAVLPRLEVHEELSFFSSNAAV
QYLFPSFSLQMLEWEATRLYPVVSTVLTSKTVSSELKEALNTSLLIADNLLAKHQYILGD
KLSPVDVSIFSTLYPLCCTDLKDTYLKEYSHVLRWSGDIGNSEAVQEAVKQWGGSPNSPP
SASSLLGTPQVVIQTPTGSPDEVPEKLSAEELEMARDNFLNGINKLQPPLKREGVVLPDK
DRRNVLITSALPYVNNVPHLGNIIGCVLSADIFSRYCRLCGFNTLFVCGTDEYGTATETK
ALEEGVTPRQICDKYFAIHDAVYRWFNIDFDYFGRTSTEQQTRIAQDLFKKLNANGFVSK
QTVEQLYCEKCDRFLADRFVEGTCPHPGCLYDNARGDQCDKCGKLINAVELREARCKVCS
SSPAVRNSDQLFIELPQLEPSLRSWASRAEAGWSGPARAVLRAWMRDKLRCRAVTRDLKW
GVPVPITGFENKVFYVWFDAPIGYLSITECATGNYEKWWKRSPDYDVKLYQFMAKDNVPF
HVIMFPATVIGVNEGHLLVDHIYATEYLNYEDTKFSKSRGVGVFGTDARDTGIPSDVWRF
YLAMIRPETSDSSFSWADLATRNNSELLNNLGNFCHRSLSFCYSSFSAAVPDTQLTPTDL
EIIAGVNRDVVAYVQHLERGRLRDALRHVLRVSRAGNLYMQDTQPWALLKGGTQDRVKAA
TTIGVCCELVALLAALLAPYMPDTSKRLCTQLNIDQSELRINPTEPCMVRFLGPGHTINK
PEPLFTKIEQQTVDELRRKYAGTQADRRKSNGDCKKLSAAELEAAISAQGEKVRKLKSST
KDKAVWQPEVDVLLALKKQLTLAHTHADQQTGSAAELERAVAEQGDKVRKLKASTKDKTV
WQPEVNKLLALKKQLAEHTDRQTGNHSPGSVEQLEKAIAEQGDKVRKLKASTKDKSVWQP
EVNVLLDLKKQLTALQANK