Monarch geneset OGS2.0

DPOGS212197
TranscriptDPOGS212197-TA1722 bp
ProteinDPOGS212197-PA573 aa
Genomic positionDPSCF300323 - 261155-265721
RNAseq coverage570x (Rank: top 22%)
Annotation
HeliconiusHMEL0068334e-16064.07% 
BombyxBGIBMGA000991-TA7e-16553.38% 
DrosophilaAats-his-PB0.055.52% 
EBI UniRef50UniRef50_E2BID50.052.88%Histidyl-tRNA synthetase, cytoplasmic n=16 Tax=Opisthokonta RepID=E2BID5_HARSA
NCBI RefSeqXP_974715.20.066.95%PREDICTED: similar to Histidyl-tRNA synthetase CG6335-PA [Tribolium castaneum]
NCBI nr blastpgi|2700061570.059.16%hypothetical protein TcasGA2_TC008324 [Tribolium castaneum]
NCBI nr blastxgi|2700061570.058.44%hypothetical protein TcasGA2_TC008324 [Tribolium castaneum]
Group
Gene OntologyGO:00055247.3e-244ATP binding
GO:00048217.3e-244histidine-tRNA ligase activity
GO:00001667.3e-244nucleotide binding
GO:00064277.3e-244histidyl-tRNA aminoacylation
GO:00057377.3e-244cytoplasm
GO:00048122.7e-22aminoacyl-tRNA ligase activity
GO:00064181.1e-17tRNA aminoacylation for protein translation
KEGG pathwaytca:6635820.0 
 K01892 (HARS, hisS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[103-562] IPR0045167.3e-244Histidyl-tRNA synthetase, class IIa
[467-564] IPR0041542.7e-22Anticodon-binding
[144-303] IPR0023141.1e-17Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
Orthology groupMCL10960 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212197-TA
ATGGCAGAACACAATGAAATATTGTTGAAAAAAATTCAGGAGCAAGGAGATTTAGTGAGAAAGCTGAAAGCAGAAAAGGAATCAACCGAGAAGTACAGTTATATCAATGGGTATACGCCTAGTAGATTAGATTTCGAAGTTTATAATAATCTAAAAAATGTAGATTTAAAGAGATATTCGTACGTTAAGCGATGGTGGTGCCATATGAGGAGTTTCAGCAACTCTGAAATAACGCAACTTCCTTTTATGAAACCTCCAGATGCTGTAAAATTAATTTTAAATTCCAACCAGAACCATGACCAAAAGATTAAAGAAGAAGTAGCTAAGTTATTGGCTTTAAAAGCTCAGCTCTCCAATGATGCCCCTCCACAAAAGTTTGTACTCAAGACTCCTAAAGGTACGAGGGACTACAACCCTCAACAGATGGCGATAAGGAATAGCGTTTTGCAAAAAATTATATCAGTATTCAAGAAACATGGTGCAGAATGTATCGACACCCCCGTCTTCGAGTTAAAGGAAGTGTTAACTGGCTTTATGCCGCTTCTGAGTTTCAAGATTTATGATCTGAAAGACCAAGGCGGAGAGATACTTTCATTGAGGTATGACCTCACCGTCCCACTCGCAAGATATCTGGCTATGAATAAGATTAATAACTTGAAGAGGTATCACATCGCTAAAGTGTACAGGAGAGACAACCCGGCCATGACGAGGGGTAGATATAGGGAGTTTTATCAATGTGATTTTGATATAGCCGGCCAGTTTGACCCCATGGTGCCGGATGCGGAATGTCTTAAAGTAGTCACGGAGATATTGGACTCTTTGGACATTGGCAAGTACATGCTGAAGGTGAATCACAGATGTCTACTGGACGGCATGTTTGAAGCTTGCGGTGTACCAGCAGAGCAGTTCCGCTCTACATGCTCTACTATTGATAAACTCGATAAGTCACCATGGGAAGAGGTGCGGACGGAAATGATCAGTGAGAAGGGCGTTACACCGGAAGCAGCGGATCGCATCGGCGAATACGTCAGGCTGAACGGAAGTACGGAACTCGTGGATACATTGCTTCAAGATGAAAAACTATCGAAGTCTAAAAGCGCTGTAGAGGGTTTACAAGGGATAAAATTGCTGCTAGAGTATTGTGAACTCTACGGCATTAAGGATAAGGTGCTGTTCGATCTGAGCCTCGCCAGAGGCTTGGATTACTACACTGGCATCATATATGAAGCTGTACTGACCGAACCAATCAAGATCGGTGGTGAAGAGCAAAGTGTGGGCTCGATAGCTGGCGGGGGCAGATATGATAACCTCGTTCCATGTGTGGGTATCAGTGTGGGTGTGGAGCGTGTGTTCTCAGTGCTGGAGGCTCGCCTGGCGGCCGGGGAGCTGAGCGTGCGCCCCTCGGAGGTGGATGTGTATGTAGCGTCCGCTCAGAAAGATTTCCTAACCACGAGAATGAGGATATGCAATGAGTTGTGGGGCGCTGGCATTAAGGCCGAGCAGCCATACAAGAAGAATCCAAAAATGCTAAATCAATTGCAACACTGCGAAGAGAATGGTATACCGCTGGCTGTGATACTGGGGGAGTCGGAATTAAAACGTGGATTGGTCAAAATAAGAAACATAGCTACCAGACAGGAAGATGAGGTGCCGAGAGAGAAGCTCGTTGAGGAACTGAAGAATAGAATAAGCATGTTGCATGTCAATGTAAACGGACTCTAG

Protein sequence:

>DPOGS212197-PA
MAEHNEILLKKIQEQGDLVRKLKAEKESTEKYSYINGYTPSRLDFEVYNNLKNVDLKRYSYVKRWWCHMRSFSNSEITQLPFMKPPDAVKLILNSNQNHDQKIKEEVAKLLALKAQLSNDAPPQKFVLKTPKGTRDYNPQQMAIRNSVLQKIISVFKKHGAECIDTPVFELKEVLTGFMPLLSFKIYDLKDQGGEILSLRYDLTVPLARYLAMNKINNLKRYHIAKVYRRDNPAMTRGRYREFYQCDFDIAGQFDPMVPDAECLKVVTEILDSLDIGKYMLKVNHRCLLDGMFEACGVPAEQFRSTCSTIDKLDKSPWEEVRTEMISEKGVTPEAADRIGEYVRLNGSTELVDTLLQDEKLSKSKSAVEGLQGIKLLLEYCELYGIKDKVLFDLSLARGLDYYTGIIYEAVLTEPIKIGGEEQSVGSIAGGGRYDNLVPCVGISVGVERVFSVLEARLAAGELSVRPSEVDVYVASAQKDFLTTRMRICNELWGAGIKAEQPYKKNPKMLNQLQHCEENGIPLAVILGESELKRGLVKIRNIATRQEDEVPREKLVEELKNRISMLHVNVNGL-