Monarch geneset OGS2.0

DPOGS203822
TranscriptDPOGS203822-TA1413 bp
ProteinDPOGS203822-PA470 aa
Genomic positionDPSCF300010 + 2274225-2275637
RNAseq coverage1049x (Rank: top 12%)
Annotation
HeliconiusHMEL0133350.088.51% 
BombyxBGIBMGA003725-TA0.082.59% 
DrosophilaNmt-PA3e-17565.74% 
EBI UniRef50UniRef50_O605512e-17365.53%Glycylpeptide N-tetradecanoyltransferase 2 n=132 Tax=root RepID=NMT2_HUMAN
NCBI RefSeqXP_001600387.10.070.66%PREDICTED: similar to N-myristoyltransferase, putative [Nasonia vitripennis]
NCBI nr blastpgi|3071805810.071.55%Glycylpeptide N-tetradecanoyltransferase 2 [Camponotus floridanus]
NCBI nr blastxgi|3320225470.069.70%Glycylpeptide N-tetradecanoyltransferase 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00064998.3e-257N-terminal protein myristoylation
GO:00043798.3e-257glycylpeptide N-tetradecanoyltransferase activity
KEGG pathway 
InterPro domain[67-470] IPR0009030Myristoyl-CoA:protein N-myristoyltransferase
[258-469] IPR0161812.2e-103Acyl-CoA N-acyltransferase
[282-469] IPR0226775.7e-83Myristoyl-CoA:protein N-myristoyltransferase, C-terminal
[109-268] IPR0226763.4e-78Myristoyl-CoA:protein N-myristoyltransferase, N-terminal
Orthology groupMCL11343 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203822-TA
ATGGAGGATAATAAAATGAGTCAATCCGAAGAGTTTCAAAAATTAAATGAAGAAAAAAATCCCAAAAAGAAAAATAAGAATAAAAAAAAGCGTATCATTGGAGATGGTGATAATGGTGGTGCTCAGTCAGCGGATGGATTAGCAATTTCTAACTTAAAAGATATTCAACAAAACATATCTTTAAAAGATCTTAAAACGGCTATGGAAGTCCTTAATTTGCAGCAAAAACCGGCAAAAACAACCGAGGAAGCTCTGCATAAGTCGTATCAATTTTGGTCCACTCAACCAGTACCTAAAATGTATGAAAAAGTTATAACCAACGAACCTATTGAACCACCCAAGTCCACCGATGAAATCCGTTCTGAGCCTTATTCATTGCCTGAGGGTTTTCATTGGGATACTCTTAATCTAAACGAACCTCTGGTTTTAAAAGAGCTATACACTTTATTAAACGAAAATTATGTAGAAGACGATGACTGTATGTTCCGATTTGACTATCAAACTGATTTCTTAAAATGGGCTCTTCAGCCTCCTGGTTGGAGAATGGAATGGCATTGTGGAGTGCGTGTAGTGAAATCTGGTAGATTAGTTGGTTTTATATCTGCAATACCAGCAACACTACGAATTTATGAAAAAATACAAACTGTTGTTGAAATTAACTTTTTATGTGTCCATAAGAAACTACGTGCTAAACGTGTCGCTCCGGTGTTGATAAGAGAAATCACTCGTAGAGTTAATTTGACAGGAATATTTCAAGGTGTTTACACGGCCGGTATAGTTTTGCCAACACCCATTGCTACTTGTCGATATTGGCATAGATCTTTAAATCCTAAAAAACTAATTGATGTTAAATTCAGTCACTTATCTAGAAATATGACAATGCAAAGAACCCTTAAATTATTTAAGTTGCCAGATACACCAAAAACATCAGGTTTTAGAAAAATGGAAGTTAAAGATTCTGATAAAGTTGTTAAACTTTTGAATGATTACTTACAGAAATTTGATTTAGTGCCAATATTCTCAGAAGAGGAATTCAAACATTGGTTTACACCGCAGGCGGGTATTATTGATAGTTATGTTGTGGAAGGTTCTGACGGAAGTATCACAGATTTTGTGAGTTATTATACATTACCGTCCACTGTTGTCTACCACCCAGTTCATAAAACCATAAAAGCTGCTTACTCATTTTACAATGTCTCTACTAAAACGCCATGGGTGGAATTAATGTTAGATGCATTGATTACGGCCAAGAACTCTGGATTTGATGTGTTTAATGCTTTAGACCTAATGGAAAATAAAGAGTTCCTAGAGCCTCTCAAATTTGGTATTGGAGATGGAAATCTACAATACTACTTGTATAATTGGAGATGTCCAAGCATTACATCAAATAAAATTGGTTTGGTTCTACAATAG

Protein sequence:

>DPOGS203822-PA
MEDNKMSQSEEFQKLNEEKNPKKKNKNKKKRIIGDGDNGGAQSADGLAISNLKDIQQNISLKDLKTAMEVLNLQQKPAKTTEEALHKSYQFWSTQPVPKMYEKVITNEPIEPPKSTDEIRSEPYSLPEGFHWDTLNLNEPLVLKELYTLLNENYVEDDDCMFRFDYQTDFLKWALQPPGWRMEWHCGVRVVKSGRLVGFISAIPATLRIYEKIQTVVEINFLCVHKKLRAKRVAPVLIREITRRVNLTGIFQGVYTAGIVLPTPIATCRYWHRSLNPKKLIDVKFSHLSRNMTMQRTLKLFKLPDTPKTSGFRKMEVKDSDKVVKLLNDYLQKFDLVPIFSEEEFKHWFTPQAGIIDSYVVEGSDGSITDFVSYYTLPSTVVYHPVHKTIKAAYSFYNVSTKTPWVELMLDALITAKNSGFDVFNALDLMENKEFLEPLKFGIGDGNLQYYLYNWRCPSITSNKIGLVLQ-