DPGLEAN19918 in OGS1.0

New model in OGS2.0DPOGS212197 
Genomic Positionscaffold197:- 151245-155811
See gene structure
CDS Length1779
Paired RNAseq reads  882
Single RNAseq reads  2595
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000991 (5e-134)
Best Drosophila hit  Histidyl-tRNA synthetase, isoform A (2e-165)
Best Human hithistidyl-tRNA synthetase, cytoplasmic (1e-159)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC008324 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC008324 [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0004821 histidine-tRNA ligase activity
GO:0005737 cytoplasm
GO:0000166 nucleotide binding
GO:0006427 histidyl-tRNA aminoacylation
GO:0005524 ATP binding
InterPro families





  
IPR004516 Histidyl-tRNA synthetase, class IIa
IPR002314 Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
IPR004154 Anticodon-binding
IPR000738 WHEP-TRS
IPR009068 S15/NS1, RNA-binding
IPR015807 Histidyl-tRNA synthetase, class IIa, subgroup
IPR006195 Aminoacyl-tRNA synthetase, class II
Orthology groupMCL11208

Nucleotide sequence:

ATGGCAGAACACAATGAAATATTGTTGAAAAAAATTCAGGAGCAAGGAGATTTAGTGAGA
AAGCTGAAAGCAGAAAAGGAATCAACCGAGAAGGTTATAAATGAATATTCTATTCAAGAT
CTCGGAGAATTTGATAAATATATTTCCCAGTACAGTTATATCAATGGGTATACGCCTAGT
AGATTAGATTTCGAAGTTTATAATAATCTAAAAAATGTAGATTTAAAGAGATATTCGTAC
GTTAAGCGATGGTGGTGCCATATGAGGAGTTTCAGCAACTCTGAAATAACGCAACTTCCT
TTTATGAAACCTCCAGATGCTGTAAAATTAATTTTAAATTCCAACCAGAACCATGACCAA
AAGATTAAAGAAGAAGTAGCTAAGTTATTGGCTTTAAAAGCTCAGCTCTCCAATGATGCC
CCTCCACAAAAGTTTGTACTCAAGACTCCTAAAGGTACGAGGGACTACAACCCTCAACAG
ATGGCGATAAGGAATAGCGTTTTGCAAAAAATTATATCAGTATTCAAGAAACATGGTGCA
GAATGTATCGACACCCCCGTCTTCGAGTTAAAGGAAGTGTTAACTGGCTTTATGCCGCTT
CTGAGTTTCAAGATTTATGATCTGAAAGACCAAGGCGGAGAGATACTTTCATTGAGGTAT
GACCTCACCGTCCCACTCGCAAGATATCTGGCTATGAATAAGATTAATAACTTGAAGAGG
TATCACATCGCTAAAGTGTACAGGAGAGACAACCCGGCCATGACGAGGGGTAGATATAGG
GAGTTTTATCAATGTGATTTTGATATAGCCGGCCAGTTTGACCCCATGGTGCCGGATGCG
GAATGTCTTAAAGTAGTCACGGAGATATTGGACTCTTTGGACATTGGCAAGTACATGCTG
AAGGTGAATCACAGATGTCTACTGGACGGCATGTTTGAAGCTTGCGGTGTACCAGCAGAG
CAGTTCCGCTCTACATGCTCTACTATTGATAAACTCGATAAGTCACCATGGGAAGAGGTG
CGGACGGAAATGATCAGTGAGAAGGGCGTTACACCGGAAGCAGCGGATCGCATCGGCGAA
TACGTCAGGCTGAACGGAAGTACGGAACTCGTGGATACATTGCTTCAAGATGAAAAACTA
TCGAAGTCTAAAAGCGCTGTAGAGGGTTTACAAGGGATAAAATTGCTGCTAGAGTATTGT
GAACTCTACGGCATTAAGGATAAGGTGCTGTTCGATCTGAGCCTCGCCAGAGGCTTGGAT
TACTACACTGGCATCATATATGAAGCTGTACTGACCGAACCAATCAAGATCGGTGGTGAA
GAGCAAAGTGTGGGCTCGATAGCTGGCGGGGGCAGATATGATAACCTCGTTCCATGTGTG
GGTATCAGTGTGGGTGTGGAGCGTGTGTTCTCAGTGCTGGAGGCTCGCCTGGCGGCCGGG
GAGCTGAGCGTGCGCCCCTCGGAGGTGGATGTGTATGTAGCGTCCGCTCAGAAAGATTTC
CTAACCACGAGAATGAGGATATGCAATGAGTTGTGGGGCGCTGGCATTAAGGCCGAGCAG
CCATACAAGAAGAATCCAAAAATGCTAAATCAATTGCAACACTGCGAAGAGAATGGTATA
CCGCTGGCTGTGATACTGGGGGAGTCGGAATTAAAACGTGGATTGGTCAAAATAAGAAAC
ATAGCTACCAGACAGGAAGATGAGGTGCCGAGAGAGAAGCTCGTTGAGGAACTGAAGAAT
AGAATAAGCATGTTGCATGTCAATGTAAACGGACTCTAG

Protein sequence:

MAEHNEILLKKIQEQGDLVRKLKAEKESTEKVINEYSIQDLGEFDKYISQYSYINGYTPS
RLDFEVYNNLKNVDLKRYSYVKRWWCHMRSFSNSEITQLPFMKPPDAVKLILNSNQNHDQ
KIKEEVAKLLALKAQLSNDAPPQKFVLKTPKGTRDYNPQQMAIRNSVLQKIISVFKKHGA
ECIDTPVFELKEVLTGFMPLLSFKIYDLKDQGGEILSLRYDLTVPLARYLAMNKINNLKR
YHIAKVYRRDNPAMTRGRYREFYQCDFDIAGQFDPMVPDAECLKVVTEILDSLDIGKYML
KVNHRCLLDGMFEACGVPAEQFRSTCSTIDKLDKSPWEEVRTEMISEKGVTPEAADRIGE
YVRLNGSTELVDTLLQDEKLSKSKSAVEGLQGIKLLLEYCELYGIKDKVLFDLSLARGLD
YYTGIIYEAVLTEPIKIGGEEQSVGSIAGGGRYDNLVPCVGISVGVERVFSVLEARLAAG
ELSVRPSEVDVYVASAQKDFLTTRMRICNELWGAGIKAEQPYKKNPKMLNQLQHCEENGI
PLAVILGESELKRGLVKIRNIATRQEDEVPREKLVEELKNRISMLHVNVNGL