DPGLEAN04726 in OGS1.0

New model in OGS2.0DPOGS203277 
Genomic Positionscaffold3627:- 8334-17735
See gene structure
CDS Length1965
Paired RNAseq reads  972
Single RNAseq reads  2895
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011005 (1e-20)
Best Drosophila hit  CG4699, isoform J (2e-23)
Best Human hithypothetical protein LOC284058 isoform 2 (2e-07)
Best NR hit (blastp)  PREDICTED: similar to CG4699-PA, isoform A [Apis mellifera] (1e-37)
Best NR hit (blastx)  AGAP003755-PA [Anopheles gambiae str. PEST] (2e-30)
GeneOntology terms


  
GO:0005524 ATP binding
GO:0004812 aminoacyl-tRNA ligase activity
GO:0005737 cytoplasm
GO:0006418 tRNA aminoacylation for protein translation
InterPro families  ND
Orthology groupMCL16368

Nucleotide sequence:

ATGGCCCCGGCGCTAACTGTGACACCCCAATATAACCACTACGACGCCGTCGAAGAAACT
GAGAAGCTATCCCACCGAAGACGGGCCCTCGCGTCCGTCGATAACGTTTCTATACCCATG
GAGGAAGACGATATGGGGCTTCAGAACCAACCCAGCCTGGCTAAGGACTCTCAGGAAATG
GAACAGATCCTAGTAGATTTAGGTAACAATGACCTTGTTCAGGCTGATTTACTACAAGCG
ATCAAAACCTTGGAAAGTGGTGGCGACTCCCTGTCTACAGGCGACCCAGATGGTATGTTT
CCCTTAAGTGGCTTCGATCTAGCTGACACAGTCGATTCAGAGAGCGATGCTACCGAGGAT
AAGATAAGATTAATACAAGCGCGTTTAGAACGCAGGTGTGCGTTTCTATTGAGACGGTTG
AGAATATTGCAAGCGAGAGCAATCGGCAAAAGAATATCAGAGGAAGCATCACAGACTTTC
GAAAAATGTGCAAAGGGCGCACGGAGAGATGGCGGTGGAAGGCCAATGGGGCTGAAGGCT
CTACTAAAAAGGATAGAGACAACAGCCGCTTTACAAGCGAGTGCTGCATCACGTTCTGTG
GTCGGTCCTAAGTATTACCGTGCGGGGACTTCGAAAGGTGATGCCTCAAGATCTGCCTCT
ATTGGAATACCTTCAGGGACTCTGACTGGTTTGGAAGATTCGGCTGGTGCGCTGAGATCA
CACTTATCTGTAGTAAAACATCAATTGGATTCTGATGCAACAGCTTCGTCTTCTGGAGCC
GAGAGCAACGATGAGGCAGTCACCTACAACAACACTCACCAACAACCAATGCCCATGAAA
GCTATTTATATAAATACGGTAGTCCGTGAAGCTAAAGGTCCGGTAGAATTCGAAGTTAGT
GGCGAATCCCAGTTCGAAGACACGTGTTCCAGGGTCAGACCTCTCAGGAGGGACACGTTT
AATAAGAGGAAGTTGCTGCAGATGCACAACTTACACATAGCAACCAATAAGGCCGCGAAA
CCATCAGATATCAATTGTCGTTGTGTGGGTAGTTCGTGTGCGGTGTGTACGGGACGGTTC
GAGGCCACACAACCGGCAGCTCCGAGCGGAATGCTACCACCAGCAGCGCGTCGGGCATTA
GTCGATCCTTCGCATCACCCTGTGCTCAGCGACGTTAATGACCTTCGTCCGTCTGTTCAT
CTATCGGCGTTAACGTCACGCTCGTGGTTCAGATCGCGAGTTACAAAGTCGTGGCGCGGA
GCACACACCGGTACAGCACCGAGAGCGCCGCCAGCACCGCCCCCCAGACACAGGCGGCTA
ACAACTAGTATGGGTCGAGGTCGGTCGAGTACTGAGAACCGCTTGTGGCGGCGGCAGTCT
TACGACATTGACAACATCGTTATACCCCAGAGCGTCGCCGCCAGCACTCGTCCGGAGATA
CTCACCTATAAAGAAATTATTACACCAAAATGGAGAGTCATGGAAATACCAGAAACACCG
CTCAACAACGGTGTGTCCAAGTCTAATAGGATGTCCATAGAGAGTGATGACGAGGATATA
TCAGAGGCAGCGGTGCAGGCTCGTCACACACGGGCCGAGACACGAGAACGTAACAGATAT
CTCCGGAAGAGAAGATCTAGGAGACGTAACACTGAAGAAGAGAATAATGATCCCATACCG
GAAGTAGTGGTTCGACAGCCTACACCGCCTCTACAGGAGACAGTGCCACCTTACTCACCG
AGGCAGTTCCCTCTCAAAGACGATCTCTACCAGGATATGTTATCCAAAATGCCAGAAGGT
TATCGACCCATCAGCCCCGATTTAGATCCGGATATAACCATGGAAGAAGACACGAGTTCC
CTATCACCGTTGTCACCTTTGAACTTTGATGGTGATGATCCCGATGATGCCGAATGGAAT
CCCAGCAATGAGAAGTCAGATAAAAGAAGAAGTACGCTAAGATAA

Protein sequence:

MAPALTVTPQYNHYDAVEETEKLSHRRRALASVDNVSIPMEEDDMGLQNQPSLAKDSQEM
EQILVDLGNNDLVQADLLQAIKTLESGGDSLSTGDPDGMFPLSGFDLADTVDSESDATED
KIRLIQARLERRCAFLLRRLRILQARAIGKRISEEASQTFEKCAKGARRDGGGRPMGLKA
LLKRIETTAALQASAASRSVVGPKYYRAGTSKGDASRSASIGIPSGTLTGLEDSAGALRS
HLSVVKHQLDSDATASSSGAESNDEAVTYNNTHQQPMPMKAIYINTVVREAKGPVEFEVS
GESQFEDTCSRVRPLRRDTFNKRKLLQMHNLHIATNKAAKPSDINCRCVGSSCAVCTGRF
EATQPAAPSGMLPPAARRALVDPSHHPVLSDVNDLRPSVHLSALTSRSWFRSRVTKSWRG
AHTGTAPRAPPAPPPRHRRLTTSMGRGRSSTENRLWRRQSYDIDNIVIPQSVAASTRPEI
LTYKEIITPKWRVMEIPETPLNNGVSKSNRMSIESDDEDISEAAVQARHTRAETRERNRY
LRKRRSRRRNTEEENNDPIPEVVVRQPTPPLQETVPPYSPRQFPLKDDLYQDMLSKMPEG
YRPISPDLDPDITMEEDTSSLSPLSPLNFDGDDPDDAEWNPSNEKSDKRRSTLR