Monarch geneset OGS2.0

DPOGS214446
TranscriptDPOGS214446-TA1287 bp
ProteinDPOGS214446-PA428 aa
Genomic positionDPSCF300441 - 76784-79919
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0044312e-16363.84% 
BombyxBGIBMGA009612-TA1e-8068.81% 
DrosophilaAats-pro-PA5e-9541.38% 
EBI UniRef50UniRef50_Q7QAP88e-9845.60%AGAP003589-PA n=2 Tax=Culicidae RepID=Q7QAP8_ANOGA
NCBI RefSeqXP_971952.11e-10243.46%PREDICTED: similar to prolyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastpgi|910780763e-10143.46%PREDICTED: similar to prolyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastxgi|910780761e-9743.46%PREDICTED: similar to prolyl-tRNA synthetase [Tribolium castaneum]
Group
Gene OntologyGO:00055244.2e-35ATP binding
GO:00064184.2e-35tRNA aminoacylation for protein translation
GO:00001664.2e-35nucleotide binding
GO:00048124.2e-35aminoacyl-tRNA ligase activity
GO:00057374.2e-35cytoplasm
GO:00064338.9e-14prolyl-tRNA aminoacylation
GO:00048278.9e-14proline-tRNA ligase activity
KEGG pathwaytca:6606464e-102 
 K01881 (PARS, proS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[59-220] IPR0023144.2e-35Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
[303-416] IPR0041541.1e-15Anticodon-binding
[75-93] IPR0023168.9e-14Prolyl-tRNA synthetase, class IIa
Orthology groupMCL12355 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214446-TA
ATGAAATTATTATCTAAAATATTCCAACCTGTGATCACAATACCTAAGGGTGCGAAGATAAAGAACACGGAAATAACATGTAAAAGTCAGAAACTCTTGTTAGAATGCGGTCTGGTCCGTCCAACGAGCACCGGTTTCTTCACCCTGCTACCGTTGGCAAGACGAGCTCTCACCAAATTAGAAAACATTGTACACCGCTGCTTAGAAGACGTCGGTGCTCAACAGATATCACTACCTTGTCTCACTTCCAGCAGGCTATGGGAAGCGAGCGGACGTTTAGACAGAGTTGGCTCCGAGTTGTTAAAAGTAGAAGATAGACACAACAAGAAGTATATATTAAGTCCGACTCACGAGGAGGCCATCGCCGACTTGTTGTCCGATGTAGCTCCGTTGTCACACAAACAGTTACCGTTCATACTGTACCAGATTGGTAACAAGTATCGTGACGAGCTCCGTCCTAAGCACGGTCTGCTGAGGTCGAGGGAGTTCCTCATGATGGACGCCTACAGTGTACACACGGACACGGACAGCGCGCTCTGTACATACGACACACTCACACACGCGTACAGGAACGTGTTCAGAGAACTGCGGCTGCCGGTGAGGAGAGTGGAGGCTCCGTCGGGTGACATGGGAGGCACTCTCTCCCACGAGTGGCAGCTGCCAGCTCCCTCTGGCGAGGACTGTCTGTCTGTGTGTCCGTCTTGCTCACACACCACCTTACTGGAGGAGGGGAAGGAGGGCAGAAAATGTGTCGCGTGTGGCAGAGAGACGGAGATATGTAGCAGTATTGAGGTTGGTCACACGTTCGTCCTCGGTGACAGGTACAGCGCCCCCATCGTGATGGCCTGCTATGGTATAGGACTCACGAGGCTGCTTGCCGCTAGTGTGGAGCTCCTCTCATCCGAGCGTTCCCTGAGGTGGCCGCACGCTCTGGCGCCCTACAAGGCCATAGTTATAGGACCTAAGGAAGGTTCTAAGGAGTGGGTACATCATGACAGTCCTCGGTTGGAGCAGCTCGGGGCTCAGGTGGAGGCTGTAGCTGGTGACGTGGTTTTGGACGACAGACATCACCTCACCATAGGGAAGAGATTGCTTCAGGCTGATAAAACTGGCTATCCATACATCATAGTGTGCGGGCGCTCCGCCCTGGAGTCTCCGCCGCGGTATGAACTGCATCGAGACCAAGGCGAAGTCCTAACTCTGCCGCTAAACGAACTATTAGCATTCATTAAAGATGATAACAAAGAACGAGATTTAAAGTTTAAAAGAGAAAGCGAATATATATAA

Protein sequence:

>DPOGS214446-PA
MKLLSKIFQPVITIPKGAKIKNTEITCKSQKLLLECGLVRPTSTGFFTLLPLARRALTKLENIVHRCLEDVGAQQISLPCLTSSRLWEASGRLDRVGSELLKVEDRHNKKYILSPTHEEAIADLLSDVAPLSHKQLPFILYQIGNKYRDELRPKHGLLRSREFLMMDAYSVHTDTDSALCTYDTLTHAYRNVFRELRLPVRRVEAPSGDMGGTLSHEWQLPAPSGEDCLSVCPSCSHTTLLEEGKEGRKCVACGRETEICSSIEVGHTFVLGDRYSAPIVMACYGIGLTRLLAASVELLSSERSLRWPHALAPYKAIVIGPKEGSKEWVHHDSPRLEQLGAQVEAVAGDVVLDDRHHLTIGKRLLQADKTGYPYIIVCGRSALESPPRYELHRDQGEVLTLPLNELLAFIKDDNKERDLKFKRESEYI-