DPGLEAN00244 in OGS1.0

New model in OGS2.0DPOGS214446 
Genomic Positionscaffold882:+ 3458-6593
See gene structure
CDS Length1287
Paired RNAseq reads  120
Single RNAseq reads  329
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009612 (2e-75)
Best Drosophila hit  Prolyl-tRNA synthetase (8e-82)
Best Human hitprobable prolyl-tRNA synthetase, mitochondrial precursor (5e-82)
Best NR hit (blastp)  PREDICTED: similar to prolyl-tRNA synthetase [Tribolium castaneum] (2e-101)
Best NR hit (blastx)  PREDICTED: similar to prolyl-tRNA synthetase [Tribolium castaneum] (3e-90)
GeneOntology terms






  
GO:0005737 cytoplasm
GO:0016874 ligase activity
GO:0005524 ATP binding
GO:0005759 mitochondrial matrix
GO:0000166 nucleotide binding
GO:0004827 proline-tRNA ligase activity
GO:0005739 mitochondrion
GO:0006433 prolyl-tRNA aminoacylation
InterPro families


  
IPR002316 Prolyl-tRNA synthetase, class IIa
IPR004154 Anticodon-binding
IPR002314 Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
IPR006195 Aminoacyl-tRNA synthetase, class II
Orthology groupMCL12931

Nucleotide sequence:

ATGAAATTATTATCTAAAATATTCCAACCTGTGATCACAATACCTAAGGGTGCGAAGATA
AAGAACACGGAAATAACATGTAAAAGTCAGAAACTCTTGTTAGAATGCGGTCTGGTCCGT
CCAACGAGCACCGGTTTCTTCACCCTGCTACCGTTGGCAAGACGAGCTCTCACCAAATTA
GAAAACATTGTACACCGCTGCTTAGAAGACGTCGGTGCTCAACAGATATCACTACCTTGT
CTCACTTCCAGCAGGCTATGGGAAGCGAGCGGACGTTTAGACAGAGTTGGCTCCGAGTTG
TTAAAAGTAGAAGATAGACACAACAAGAAGTATATATTAAGTCCGACTCACGAGGAGGCC
ATCGCCGACTTGTTGTCCGATGTAGCTCCGTTGTCACACAAACAGTTACCGTTCATACTG
TACCAGATTGGTAACAAGTATCGTGACGAGCTCCGTCCTAAGCACGGTCTGCTGAGGTCG
AGGGAGTTCCTCATGATGGACGCCTACAGTGTACACACGGACACGGACAGCGCGCTCTGT
ACATACGACACACTCACACACGCGTACAGGAACGTGTTCAGAGAACTGCGGCTGCCGGTG
AGGAGAGTGGAGGCTCCGTCGGGTGACATGGGAGGCACTCTCTCCCACGAGTGGCAGCTG
CCAGCTCCCTCTGGCGAGGACTGTCTGTCTGTGTGTCCGTCTTGCTCACACACCACCTTA
CTGGAGGAGGGGAAGGAGGGCAGAAAATGTGTCGCGTGTGGCAGAGAGACGGAGATATGT
AGCAGTATTGAGGTTGGTCACACGTTCGTCCTCGGTGACAGGTACAGCGCCCCCATCGTG
ATGGCCTGCTATGGTATAGGACTCACGAGGCTGCTTGCCGCTAGTGTGGAGCTCCTCTCA
TCCGAGCGTTCCCTGAGGTGGCCGCACGCTCTGGCGCCCTACAAGGCCATAGTTATAGGA
CCTAAGGAAGGTTCTAAGGAGTGGGTACATCATGACAGTCCTCGGTTGGAGCAGCTCGGG
GCTCAGGTGGAGGCTGTAGCTGGTGACGTGGTTTTGGACGACAGACATCACCTCACCATA
GGGAAGAGATTGCTTCAGGCTGATAAAACTGGCTATCCATACATCATAGTGTGCGGGCGC
TCCGCCCTGGAGTCTCCGCCGCGGTATGAACTGCATCGAGACCAAGGCGAAGTCCTAACT
CTGCCGCTAAACGAACTATTAGCATTCATTAAAGATGATAACAAAGAACGAGATTTAAAG
TTTAAAAGAGAAAGCGAATATATATAA

Protein sequence:

MKLLSKIFQPVITIPKGAKIKNTEITCKSQKLLLECGLVRPTSTGFFTLLPLARRALTKL
ENIVHRCLEDVGAQQISLPCLTSSRLWEASGRLDRVGSELLKVEDRHNKKYILSPTHEEA
IADLLSDVAPLSHKQLPFILYQIGNKYRDELRPKHGLLRSREFLMMDAYSVHTDTDSALC
TYDTLTHAYRNVFRELRLPVRRVEAPSGDMGGTLSHEWQLPAPSGEDCLSVCPSCSHTTL
LEEGKEGRKCVACGRETEICSSIEVGHTFVLGDRYSAPIVMACYGIGLTRLLAASVELLS
SERSLRWPHALAPYKAIVIGPKEGSKEWVHHDSPRLEQLGAQVEAVAGDVVLDDRHHLTI
GKRLLQADKTGYPYIIVCGRSALESPPRYELHRDQGEVLTLPLNELLAFIKDDNKERDLK
FKRESEYI