New model in OGS2.0 | DPOGS203277  |
---|---|
Genomic Position | scaffold3627:- 8334-17735 |
See gene structure | |
CDS Length | 1965 |
Paired RNAseq reads   | 972 |
Single RNAseq reads   | 2895 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011005 (1e-20) |
Best Drosophila hit   | CG4699, isoform J (2e-23) |
Best Human hit | hypothetical protein LOC284058 isoform 2 (2e-07) |
Best NR hit (blastp)   | PREDICTED: similar to CG4699-PA, isoform A [Apis mellifera] (1e-37) |
Best NR hit (blastx)   | AGAP003755-PA [Anopheles gambiae str. PEST] (2e-30) |
GeneOntology terms    | GO:0005524 ATP binding GO:0004812 aminoacyl-tRNA ligase activity GO:0005737 cytoplasm GO:0006418 tRNA aminoacylation for protein translation |
InterPro families   | ND |
Orthology group | MCL16368 |
Nucleotide sequence:
ATGGCCCCGGCGCTAACTGTGACACCCCAATATAACCACTACGACGCCGTCGAAGAAACT
GAGAAGCTATCCCACCGAAGACGGGCCCTCGCGTCCGTCGATAACGTTTCTATACCCATG
GAGGAAGACGATATGGGGCTTCAGAACCAACCCAGCCTGGCTAAGGACTCTCAGGAAATG
GAACAGATCCTAGTAGATTTAGGTAACAATGACCTTGTTCAGGCTGATTTACTACAAGCG
ATCAAAACCTTGGAAAGTGGTGGCGACTCCCTGTCTACAGGCGACCCAGATGGTATGTTT
CCCTTAAGTGGCTTCGATCTAGCTGACACAGTCGATTCAGAGAGCGATGCTACCGAGGAT
AAGATAAGATTAATACAAGCGCGTTTAGAACGCAGGTGTGCGTTTCTATTGAGACGGTTG
AGAATATTGCAAGCGAGAGCAATCGGCAAAAGAATATCAGAGGAAGCATCACAGACTTTC
GAAAAATGTGCAAAGGGCGCACGGAGAGATGGCGGTGGAAGGCCAATGGGGCTGAAGGCT
CTACTAAAAAGGATAGAGACAACAGCCGCTTTACAAGCGAGTGCTGCATCACGTTCTGTG
GTCGGTCCTAAGTATTACCGTGCGGGGACTTCGAAAGGTGATGCCTCAAGATCTGCCTCT
ATTGGAATACCTTCAGGGACTCTGACTGGTTTGGAAGATTCGGCTGGTGCGCTGAGATCA
CACTTATCTGTAGTAAAACATCAATTGGATTCTGATGCAACAGCTTCGTCTTCTGGAGCC
GAGAGCAACGATGAGGCAGTCACCTACAACAACACTCACCAACAACCAATGCCCATGAAA
GCTATTTATATAAATACGGTAGTCCGTGAAGCTAAAGGTCCGGTAGAATTCGAAGTTAGT
GGCGAATCCCAGTTCGAAGACACGTGTTCCAGGGTCAGACCTCTCAGGAGGGACACGTTT
AATAAGAGGAAGTTGCTGCAGATGCACAACTTACACATAGCAACCAATAAGGCCGCGAAA
CCATCAGATATCAATTGTCGTTGTGTGGGTAGTTCGTGTGCGGTGTGTACGGGACGGTTC
GAGGCCACACAACCGGCAGCTCCGAGCGGAATGCTACCACCAGCAGCGCGTCGGGCATTA
GTCGATCCTTCGCATCACCCTGTGCTCAGCGACGTTAATGACCTTCGTCCGTCTGTTCAT
CTATCGGCGTTAACGTCACGCTCGTGGTTCAGATCGCGAGTTACAAAGTCGTGGCGCGGA
GCACACACCGGTACAGCACCGAGAGCGCCGCCAGCACCGCCCCCCAGACACAGGCGGCTA
ACAACTAGTATGGGTCGAGGTCGGTCGAGTACTGAGAACCGCTTGTGGCGGCGGCAGTCT
TACGACATTGACAACATCGTTATACCCCAGAGCGTCGCCGCCAGCACTCGTCCGGAGATA
CTCACCTATAAAGAAATTATTACACCAAAATGGAGAGTCATGGAAATACCAGAAACACCG
CTCAACAACGGTGTGTCCAAGTCTAATAGGATGTCCATAGAGAGTGATGACGAGGATATA
TCAGAGGCAGCGGTGCAGGCTCGTCACACACGGGCCGAGACACGAGAACGTAACAGATAT
CTCCGGAAGAGAAGATCTAGGAGACGTAACACTGAAGAAGAGAATAATGATCCCATACCG
GAAGTAGTGGTTCGACAGCCTACACCGCCTCTACAGGAGACAGTGCCACCTTACTCACCG
AGGCAGTTCCCTCTCAAAGACGATCTCTACCAGGATATGTTATCCAAAATGCCAGAAGGT
TATCGACCCATCAGCCCCGATTTAGATCCGGATATAACCATGGAAGAAGACACGAGTTCC
CTATCACCGTTGTCACCTTTGAACTTTGATGGTGATGATCCCGATGATGCCGAATGGAAT
CCCAGCAATGAGAAGTCAGATAAAAGAAGAAGTACGCTAAGATAA
Protein sequence:
MAPALTVTPQYNHYDAVEETEKLSHRRRALASVDNVSIPMEEDDMGLQNQPSLAKDSQEM
EQILVDLGNNDLVQADLLQAIKTLESGGDSLSTGDPDGMFPLSGFDLADTVDSESDATED
KIRLIQARLERRCAFLLRRLRILQARAIGKRISEEASQTFEKCAKGARRDGGGRPMGLKA
LLKRIETTAALQASAASRSVVGPKYYRAGTSKGDASRSASIGIPSGTLTGLEDSAGALRS
HLSVVKHQLDSDATASSSGAESNDEAVTYNNTHQQPMPMKAIYINTVVREAKGPVEFEVS
GESQFEDTCSRVRPLRRDTFNKRKLLQMHNLHIATNKAAKPSDINCRCVGSSCAVCTGRF
EATQPAAPSGMLPPAARRALVDPSHHPVLSDVNDLRPSVHLSALTSRSWFRSRVTKSWRG
AHTGTAPRAPPAPPPRHRRLTTSMGRGRSSTENRLWRRQSYDIDNIVIPQSVAASTRPEI
LTYKEIITPKWRVMEIPETPLNNGVSKSNRMSIESDDEDISEAAVQARHTRAETRERNRY
LRKRRSRRRNTEEENNDPIPEVVVRQPTPPLQETVPPYSPRQFPLKDDLYQDMLSKMPEG
YRPISPDLDPDITMEEDTSSLSPLSPLNFDGDDPDDAEWNPSNEKSDKRRSTLR