Monarch geneset OGS2.0

DPOGS211867
TranscriptDPOGS211867-TA2169 bp
ProteinDPOGS211867-PA722 aa
Genomic positionDPSCF300011 - 1047192-1051802
RNAseq coverage751x (Rank: top 17%)
Annotation
HeliconiusHMEL0216519e-16774.05% 
BombyxBGIBMGA001238-TA0.071.72% 
DrosophilaAats-cys-PA0.059.19% 
EBI UniRef50UniRef50_Q7KN900.059.19%Cysteine--tRNA ligase, cytoplasmic n=42 Tax=Eukaryota RepID=SYCC_DROME
NCBI RefSeqXP_001648945.10.063.06%cysteinyl-tRNA synthetase [Aedes aegypti]
NCBI nr blastpgi|1571056070.063.06%cysteinyl-tRNA synthetase [Aedes aegypti]
NCBI nr blastxgi|1571056070.063.06%cysteinyl-tRNA synthetase [Aedes aegypti]
Group
Gene OntologyGO:00048171.4e-143cysteine-tRNA ligase activity
GO:00064231.4e-143cysteinyl-tRNA aminoacylation
GO:00055241.4e-143ATP binding
GO:00001661.4e-143nucleotide binding
GO:00057371.4e-143cytoplasm
GO:00064186.5e-14tRNA aminoacylation for protein translation
GO:00048126.5e-14aminoacyl-tRNA ligase activity
KEGG pathwayaag:AaeL_AAEL0043450.0 
 K01883 (CARS, cysS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[3-721] IPR0158030Cysteinyl-tRNA synthetase, class Ia
[199-431] IPR0147291.6e-73Rossmann-like alpha/beta/alpha sandwich fold
[458-624] IPR0090806.5e-14Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
Orthology groupMCL12983 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211867-TA
ATGGCTAAAAGAACACAACCACCTTGGTCTTTGCCTTCCGGTGAAGAAAAAAGACCAGTTTTAAATCTTTATAATAGTCTGACCCGACAGAAAGAGGAATTTGTTTGTGGTCATGGCAATCGAGTGAACTGGTATAGCTGCGGACCAACTGTGTACGACGCCTCGCACATGGGACACGCGAGGTCATACATATCATTTGATATTCTGAGGCGAGTGTTGACCACTTACTTTGGGTATGATGTGTTGTATGCAATGAATATAACCGACATTGATGATAAGATTATAAAGAGAGCGAGACAGAACTACCTGTATGAGAATTATGTCAAAGACACGAAAACATTGGACCAGATCGTGGACGATGCAACCGCAGTCATAGACTTCTATGAAGGTATCGTCAGAGACGCCAATGATCCCGATAAGAAGAACACCTTGCAGAAGATGCTGGACAGTGTTGCTTCAGCCGTAAAAACTTTAAAATCAGCACTTCAAGAAAGTAAATCTGATAAAATCGAAGAAGCAAAAAGTGATCTCTTAAAATCAGCTAAAGATCCCATATCAGTCTGGTTGGATGAGAAGTACGGTGCGACCGTCACGGAAAATGGGATATTTCAAACACTGCCCCGATATTGGGAAAATGAGTTCCACAAAGACATGAAAGCTCTCAATGTCCTGCCCCCCGATGTGTTGACCAGAGTCAGTGAGTACATTCCACAGATAATAACCTTCATCCAAAGGATCATTGACAATGGCCTGGCTTACGAGTCCAACGGTTCTGTGTATTTCAATGTGAGCGAGTTTGATGGCAAAGATAAGCATTACTACGCCAGACTCGTCCCCGAGGCCTACGGGGACAACAAATCTTTGCAAGAAGGGGAAGGTGACTTAACCGATAGCACAGCCGAGAAGCGCTCACTCAATGATTTCGCTCTCTGGAAGCAGAGCAAGACCGGCGAGCCGTCGTGGGAGTCCCCGTGGGGTCGCGGCCGGCCCGGCTGGCACATAGAGTGCTCCGCCATGGCCTCGGACGTGTTCGGAGACAACCTCGACATCCACACCGGCGGAGTGGACCTCAAGTTCCCCCATCACGACAACGAGCTGGCTCAGAGCGAGGCTCACTTCGACAAGCCGGGCTGGGTGAACTACTTCCTCCACACGGGCCATCTCACGATAGCCGGCTGTAAGATGTCCAAGTCCTTGAAGAACTTCATCACCATAAAGCAGGCTCTGGAGCAGCACTCGGCGCGGCAGCTGCGGCTGGCCTTCCTGATGCACGGCTGGAGGGACACCCTGGACTACTCGCACAACACCATGGACATGGCGCTCCAGGCTGAGAAGCTGTTCAATGAGTTCTTCCTGACGGTGAAGGATGCCCTCCGCAGTCTCCCGGAGTGCGAGGGGCCGTGGAGCGAGCCCGAGCGGCGCCTGGCCGGGGCGCTCACCGCGGCCAGGGAGCACGTGCACCAGGCGCTATGCGATAATGTGGACACTCGTTCAGCGCTGGACGCGCTCCGTGAGCTGGTGAGCGCTTGCCACGTGTACCTGGGCGGAGTCGCCGAGGTTCCCCGGAGCGCCCCGCTACTTCGAGCCTCCGCCCTCTACGTCACCGATATACTCCACGTGTTTGGAGTCATCGAGGGCCCCCGTGGGCTCATAGGGTTCCCGGCGGCTGGGGCCGGAGAAGTGGCTCTGGAGGAGGCCGTGCTGCCGTACCTGGAGGTGCTGGGTTCGTTCCGGGCGGACGTGCGGGGGGCGGCGCGGCGGGCGGGGGCGGGGGAGGTGCTGACTCTGTGTGACCGCCTGCGAGACGAGCTGCTGCCCGAGCTGGGCGTGCGGCTCGAGGACAAGCCCGATAGGACGGTGGTGAAGCTGGTCAACAAGGACGAGCTGATCCGGGAGAGAGAGGAGAGGAAGAGGCAGGAGGAAGAGAGGCAGAGGAAGAAGAGGGAGCTGGCGGACGCCCAGAGGGCCAAGGACGAGCAGAAGAAGATACCGCCCGAGGAGATGTTCCGCAGAGAAGCGGGGAAGTACTCGCAGTTTGACGAACAGGGCCTGCCCACGCACGACCACGAGGGCAAGGAGCTGAGCAAGGGTCTCATCAAGAAGCTGCAGAAGTTGCAGCAGCTCCAGGAGAAGAAGTACAAGGAACACCTGGCGAGCCTCGGGAACTGA

Protein sequence:

>DPOGS211867-PA
MAKRTQPPWSLPSGEEKRPVLNLYNSLTRQKEEFVCGHGNRVNWYSCGPTVYDASHMGHARSYISFDILRRVLTTYFGYDVLYAMNITDIDDKIIKRARQNYLYENYVKDTKTLDQIVDDATAVIDFYEGIVRDANDPDKKNTLQKMLDSVASAVKTLKSALQESKSDKIEEAKSDLLKSAKDPISVWLDEKYGATVTENGIFQTLPRYWENEFHKDMKALNVLPPDVLTRVSEYIPQIITFIQRIIDNGLAYESNGSVYFNVSEFDGKDKHYYARLVPEAYGDNKSLQEGEGDLTDSTAEKRSLNDFALWKQSKTGEPSWESPWGRGRPGWHIECSAMASDVFGDNLDIHTGGVDLKFPHHDNELAQSEAHFDKPGWVNYFLHTGHLTIAGCKMSKSLKNFITIKQALEQHSARQLRLAFLMHGWRDTLDYSHNTMDMALQAEKLFNEFFLTVKDALRSLPECEGPWSEPERRLAGALTAAREHVHQALCDNVDTRSALDALRELVSACHVYLGGVAEVPRSAPLLRASALYVTDILHVFGVIEGPRGLIGFPAAGAGEVALEEAVLPYLEVLGSFRADVRGAARRAGAGEVLTLCDRLRDELLPELGVRLEDKPDRTVVKLVNKDELIREREERKRQEEERQRKKRELADAQRAKDEQKKIPPEEMFRREAGKYSQFDEQGLPTHDHEGKELSKGLIKKLQKLQQLQEKKYKEHLASLGN-