Monarch geneset OGS2.0

DPOGS214514
TranscriptDPOGS214514-TA1479 bp
ProteinDPOGS214514-PA492 aa
Genomic positionDPSCF300287 - 408325-410294
RNAseq coverage129x (Rank: top 56%)
Annotation
HeliconiusHMEL0178390.075.78% 
BombyxBGIBMGA010964-TA0.067.36% 
DrosophilaCG8257-PA2e-11742.88% 
EBI UniRef50UniRef50_D6W8J32e-12746.17%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W8J3_TRICA
NCBI RefSeqXP_973716.21e-13047.12%PREDICTED: similar to cysteinyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastpgi|1892343502e-12947.12%PREDICTED: similar to cysteinyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastxgi|1892343501e-12646.69%PREDICTED: similar to cysteinyl-tRNA synthetase [Tribolium castaneum]
Group
Gene OntologyGO:00048177.7e-203cysteine-tRNA ligase activity
GO:00064237.7e-203cysteinyl-tRNA aminoacylation
GO:00055247.7e-203ATP binding
GO:00001667.7e-203nucleotide binding
GO:00057377.7e-203cytoplasm
GO:00064183.5e-19tRNA aminoacylation for protein translation
GO:00048123.5e-19aminoacyl-tRNA ligase activity
KEGG pathwaytca:6625333e-130 
 K01883 (CARS, cysS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[1-477] IPR0158037.7e-203Cysteinyl-tRNA synthetase, class Ia
[170-305] IPR0147293.3e-33Rossmann-like alpha/beta/alpha sandwich fold
[319-480] IPR0090803.5e-19Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
Orthology groupMCL13996 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214514-TA
ATGCCGAATGGGAATCCTATTGGAGCGTACGTATACAATTGTGTTGCTGAACAAAGAGTTCCTATAATATTAAATGATCCGACAATCGCTACATGGTACTCCTGTGGACCCACTGTATATGATTCAGCGCATATTGGACACGCATCCTGTTACGTAAAGTTAGACATTATTCAGAGAATATTAAAATCATTTTTTAATATCAAACTGGTCACGGCTATGGGAATTACTGATATTGATGATAAAATAATAAAAAAAAGTCAGGAGACTAAAACTGATTTTCGCACTGTAGCTAAGAAATATGAACATGAATTTTGGATTGATATGTCGAGTTTGAATGTTGAAAAGCCAATGATTATAACCAGGGTGTCTGAACATATAGATACTATAGGACAATTCGTTCATAGCCTATTAGAATCCGGTATGGCGTATGCTGGCAAGGATGGTTCAATCTATTTTGATACCTCCAAGTTTCCAGATTATGGAAAGTTACAAAAGGTTCAAGATATCAGAGAACCAAGTAATGAATATAAGAGAAATAAAATGGATTTCGCCCTATGGAAAGGTTATAAGCCCGGTGAACCTTCATGGCCTATGAAATTTGGTGACGGGAGACCAGGCTGGCACATTGAATGCTCAGCCATGGTCAGTAAAGTATTCGGAACCCAATTAGACTTCCATGCAGGTGGAATTGACTTGAGATTCCCACACCATGAGAATGAAGAAGCCCAAGCCTGTGCTTTCCACAACACACGACAGTGGGCGAACTATTGGCTACATGTCGGCCATCTTAATCTAAAGGAAACAAAAATGTCCAAATCGCTTAAGAACACAATTTTAATACCAGATATACTAGAAAAGTATAGCGCTGATGCATTTCGAATGGCATGTCTCATGTCTAATTACCGATATCCAATGGAATATAGCGATGACATAATGGACACTGCATCTGATATCTTGAATAAATTTAAATTCTTTTTGAAAGATGTCGAGGGTTATGCAAATAGGAATTCTGTTAGATGTGGGGACTACAATGACAAGCTATTAGAGGAGTTACAGAGAGTTGAGGAGAGTAATATGGAAGCTATGAGAAATGACTTTGATACGGCATCATGCATTAATTCATTAATGAATCTGGTATCTTTAGTGAATAAAATTATTAAAGCGGATACAGGAGATTACACTCCTGTACCTGTGATTCTCATAGCCGAGTACATCGCCTTTGTTTTGAAAAGATTCGGTTTGAAGCTGACAGACGAGAGAAGTGATATATCTAGTCCCTTGTTAGATACATTAGTAGAATTCAGACATACAGTAAGAAAGAAGGCTTTGAGCGACAAAGATAAAACTCTTTTGAATGCTTGCGATGTTGTTCGTGATAAGTTGAAGACAATGAAAGTCCAAATAAATGACAGCAAAGAGACTTCTTCATGGGTCGTTAATGAAATGACATTTATAAACATTTACGCTAATCGCGTCTAG

Protein sequence:

>DPOGS214514-PA
MPNGNPIGAYVYNCVAEQRVPIILNDPTIATWYSCGPTVYDSAHIGHASCYVKLDIIQRILKSFFNIKLVTAMGITDIDDKIIKKSQETKTDFRTVAKKYEHEFWIDMSSLNVEKPMIITRVSEHIDTIGQFVHSLLESGMAYAGKDGSIYFDTSKFPDYGKLQKVQDIREPSNEYKRNKMDFALWKGYKPGEPSWPMKFGDGRPGWHIECSAMVSKVFGTQLDFHAGGIDLRFPHHENEEAQACAFHNTRQWANYWLHVGHLNLKETKMSKSLKNTILIPDILEKYSADAFRMACLMSNYRYPMEYSDDIMDTASDILNKFKFFLKDVEGYANRNSVRCGDYNDKLLEELQRVEESNMEAMRNDFDTASCINSLMNLVSLVNKIIKADTGDYTPVPVILIAEYIAFVLKRFGLKLTDERSDISSPLLDTLVEFRHTVRKKALSDKDKTLLNACDVVRDKLKTMKVQINDSKETSSWVVNEMTFINIYANRV-