Monarch geneset OGS2.0

DPOGS200603
TranscriptDPOGS200603-TA1287 bp
ProteinDPOGS200603-PA428 aa
Genomic positionDPSCF300076 - 457391-460259
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0010042e-12554.94% 
Bombyx% 
DrosophilaCG8097-PA8e-3227.78% 
EBI UniRef50UniRef50_C3PPE76e-12455.18%DNA sequence from clone AEHM-27I5 (Fragment) n=1 Tax=Heliconius melpomene RepID=C3PPE7_9NEOP
NCBI RefSeqXP_001651586.19e-5536.26%hypothetical protein AaeL_AAEL005905 [Aedes aegypti]
NCBI nr blastpgi|2294874032e-12355.18%unnamed protein product [Heliconius melpomene]
NCBI nr blastxgi|2294874031e-12555.18%unnamed protein product [Heliconius melpomene]
Group
Gene OntologyGO:00048141.1e-19arginine-tRNA ligase activity
GO:00055241.1e-19ATP binding
GO:00064201.1e-19arginyl-tRNA aminoacylation
GO:00064183.6e-18tRNA aminoacylation for protein translation
GO:00001663.6e-18nucleotide binding
GO:00048123.6e-18aminoacyl-tRNA ligase activity
GO:00057373.6e-18cytoplasm
KEGG pathway 
InterPro domain[277-412] IPR0089091.1e-19DALR anticodon binding
[271-412] IPR0090803.6e-18Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
Orthology groupMCL14553 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200603-TA
ATGTTACAAGATGATATAAGTGTTTTTATAAGTAATTTATACAAGTATCTAGTTGGAAAGGACGATTATGTGCATGGACTTTTAATAAAAAAACACTCCGGCAACTTGGTAAATTTGGGTGATTTGAGTTTTCCTAACACTGTGAAATCATGGCAAGAGCTATTGAACGCTGAAGAGTTACAAAATAATTCAGACAAAACATTCTTGAAATTTATTGACAAAAATGTTTCGGACGTTATTCAAGCGTCTCAAAATTGGGAAATGTCGGTGGTGAAAGCTACGGAGTTAAATAACCGAATTTATTTGTTTTTGGATAGAACTAAAGCTATTAGCATGGGTTTACAGTCCGCATTGAAAAATAATAACTTACTTTTACATAACCTAAATAAAGTTACAAATTTAGTAAAAGTAGATCCAAAGTGCGTTGAAGATAGTAGTGTTACATATTTGCGTTTAAAATATTTAACAGAAGTAGTTAAGAACTTATGTTTGGTAAATGATATTGATGAAGATCATAATATCTTAGTGACAAGCCGACATAATGATAATGACAGGACAGTGTTCTGCGGGACTGTCTTAAATTCAAAAACTGGATTGAAAGAGAATTTAACTACAGCCGAAGAATATATTAAACTAAGACAAGATGAGATGACATTAATTGCTCAACATAAGTATGGTGTGAGAGTTTCTACTGATGTTAAATGGAGGGAATTCATAGCACATTTAGGGGAGTCGGCCGTAGCATTTGAGATGTTACAAACAAGATCATCCAGCCCTGTGAAGGTACACTTCGATGCTTCTGGTGGCTCCAGTAAAGGCGCAGCATTCATTTTGTACAATTGTGCTAGACTGGAAACCATGATTCGTACGTTCAACGAGAGGGTTGCGGACGGCAGCTATCCAGATTTACCGAGCCTCTATAATGTAGACTTATCGCTACTTACCGATGAGGACGAATGGAGTATTATATTCACCTATATAATGGCCGTGGCGTCTCTTATAAGAAACACTGTAGATATGAATGGTGTATGTGAATTCCGACCTCACCTCATTTGTAACTTTCTGAGCGGAATGGTCAAAGTATTCAGTCAGTATTACAGACGTGTCAGGATCCTAACGGAACCAAGAAAACATCTACTGCCTATAATGTTTGCGAGAATTCATACGCTTATAATATTAAACGATACGTTAAAAGTGTGTTTAAAAATATTAAACATTAAAAGCGTTTCACAAATGTATACAGTATTTATTTTGTCTTCCGGCACTCCTTTATATTTGATAAAATGA

Protein sequence:

>DPOGS200603-PA
MLQDDISVFISNLYKYLVGKDDYVHGLLIKKHSGNLVNLGDLSFPNTVKSWQELLNAEELQNNSDKTFLKFIDKNVSDVIQASQNWEMSVVKATELNNRIYLFLDRTKAISMGLQSALKNNNLLLHNLNKVTNLVKVDPKCVEDSSVTYLRLKYLTEVVKNLCLVNDIDEDHNILVTSRHNDNDRTVFCGTVLNSKTGLKENLTTAEEYIKLRQDEMTLIAQHKYGVRVSTDVKWREFIAHLGESAVAFEMLQTRSSSPVKVHFDASGGSSKGAAFILYNCARLETMIRTFNERVADGSYPDLPSLYNVDLSLLTDEDEWSIIFTYIMAVASLIRNTVDMNGVCEFRPHLICNFLSGMVKVFSQYYRRVRILTEPRKHLLPIMFARIHTLIILNDTLKVCLKILNIKSVSQMYTVFILSSGTPLYLIK-