Monarch geneset OGS2.0

DPOGS216071
TranscriptDPOGS216071-TA1533 bp
ProteinDPOGS216071-PA510 aa
Genomic positionDPSCF300067 + 373080-376584
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0089330.062.70% 
BombyxBGIBMGA008870-TA9e-14655.65% 
DrosophilaAte1-PB2e-8937.98% 
EBI UniRef50UniRef50_O952601e-11845.59%Arginyl-tRNA--protein transferase 1 n=104 Tax=Coelomata RepID=ATE1_HUMAN
NCBI RefSeqXP_395484.25e-12044.72%PREDICTED: similar to Arginyl-tRNA--protein transferase 1 (R-transferase 1) (Arginyltransferase 1) (Arginine-tRNA--protein transferase 1) [Apis mellifera]
NCBI nr blastpgi|3545025166e-12246.73%PREDICTED: arginyl-tRNA--protein transferase 1-like isoform 1 [Cricetulus griseus]
NCBI nr blastxgi|3545025169e-12246.92%PREDICTED: arginyl-tRNA--protein transferase 1-like isoform 1 [Cricetulus griseus]
Group
Gene OntologyGO:00165985.5e-155protein arginylation
GO:00040575.5e-155arginyltransferase activity
KEGG pathway 
InterPro domain[1-511] IPR0171375.5e-155Arginine-tRNA-protein transferase 1, eukaryotic
[280-422] IPR0074722.2e-42Arginine-tRNA-protein transferase, C-terminal
[10-94] IPR0074718.3e-26Arginine-tRNA-protein transferase, N-terminal
[334-438] IPR0161818.7e-08Acyl-CoA N-acyltransferase
Orthology groupMCL13837 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216071-TA
ATGAATCATAGTTTCATTAAATATTATTCTGAGCATGAGGGCTACAAATGTGGGTACTGTAAACGTCCAGATACAAATTACAGTCACGTTATGTGGGCGCATGCAATGACGGTTACTGACTACCAGGATTTAATAGATAGAGGTTGGAGGAGATCTGGAAAACAATGCTACAAGCCAACATTAGAAGTCATCTGCTGTCCCATGTATACCATACGATGTAGGGCATTAGAATTTAAGGCTAGTAAATCTCAAAAGAAAGTTTTGAAGAGTTTCAATAAGTTTTTAATCGGAGAAGAAATAAGTGATATAAGTGCACAAGAAAGTCGAGAAGATGTTGCAATGGAACAAGTTGAAGGGCAGGAGCAGTTTCTAGAATCTAAAAGGCCACATGAAGATGTAAACATTGCCGGAATGGATATTCCATTTATAGAAGAAGCAGATGACTCAAGAAAACTTGAACTATCTGAAATAGAGACAAAAGATGACAGTCATATGCAGAAATTTGATTCTCATCAAGATTTACAGCAAGCGAGTTCATCAACAGCCAGTAGTTCATGCCTGTTGGGAAACACATCAAAAGAAAAATCTAACAAAGTGACCGGTGCAGATCCCACAAAAGCTCCATGCAAAAAAGCGAAACAGGCAAGGCGAGAGAGAATGTTAGAAAAGCTCCAAAGGAAGGGTATCAATGTTACCACTTTAGATAATACCGGCAAAAATACACCAAAAACCATTGAAGACATTATTAATGAACTACCAGATAATGTCAAGAGTAAACTTGAGATAAAATTGGTGAGAACAGAACCACCGAGTCCAGAGTGGCTGGCTACGAAATCAGAAAGTCATGAAGTTTATGTGAAATACCAAACTATTGTTCATGGAGATAAACCTGAGAAATGTACTGAACCCAAGTTCCATGATTTTTTGGTCCACAGTCCATTACTGGAAGAATATTCCGAAGTGGGTCCCCCATGTGGATATGGTTCATTCCACCAACAGTATTGGCTGGACGGAAAGATTATAGCCGTTGGCGTTATAGACATACTGCCAAAATGTATATCGTCCGTATACTTTTTTTATGATCCCCAATATTTATGCCTGAGTTTAGGAACTTATGGAGCTTTAAGAGAAATAGCATTCACAAGACAGTTACAAAAGATTTGTCCTAATCTGAAATATTACAACATGGGATTCTACATACATACTTGTACTAAGATGAGATACAAGGGAAAGTTCCACCCATCGGACCTATTGTGCCCTGAGACTTTCAAGTGGTTTCCCATCAAGGAATGTATAGCAAAGTTGGAAATATCAAAGTATTCAAGATTTGATCCTGATCTAGATGGTGTGGATGAAAATTATCCCACAGATAATGACGTGAACAATATAAAAGTTTTATCAAACGGGCAAGTGCAAATTTACAAAGTTTTCAAACGGAAAGCAGGAAGGAAGTACAATGAAGAATTTGAAGTTCTTGAATATGCGAGGCTGGTTGGAGGCAAGACCGCTAGAAGTATTATAATGGTCATGTAG

Protein sequence:

>DPOGS216071-PA
MNHSFIKYYSEHEGYKCGYCKRPDTNYSHVMWAHAMTVTDYQDLIDRGWRRSGKQCYKPTLEVICCPMYTIRCRALEFKASKSQKKVLKSFNKFLIGEEISDISAQESREDVAMEQVEGQEQFLESKRPHEDVNIAGMDIPFIEEADDSRKLELSEIETKDDSHMQKFDSHQDLQQASSSTASSSCLLGNTSKEKSNKVTGADPTKAPCKKAKQARRERMLEKLQRKGINVTTLDNTGKNTPKTIEDIINELPDNVKSKLEIKLVRTEPPSPEWLATKSESHEVYVKYQTIVHGDKPEKCTEPKFHDFLVHSPLLEEYSEVGPPCGYGSFHQQYWLDGKIIAVGVIDILPKCISSVYFFYDPQYLCLSLGTYGALREIAFTRQLQKICPNLKYYNMGFYIHTCTKMRYKGKFHPSDLLCPETFKWFPIKECIAKLEISKYSRFDPDLDGVDENYPTDNDVNNIKVLSNGQVQIYKVFKRKAGRKYNEEFEVLEYARLVGGKTARSIIMVM-