Monarch geneset OGS2.0

DPOGS204336
TranscriptDPOGS204336-TA1377 bp
ProteinDPOGS204336-PA458 aa
Genomic positionDPSCF300142 - 6887-13139
RNAseq coverage256x (Rank: top 41%)
Annotation
HeliconiusHMEL0053921e-5167.44% 
BombyxBGIBMGA013336-TA2e-4331.30% 
DrosophilaDgkepsilon-PA7e-8438.88% 
EBI UniRef50UniRef50_E0VQ752e-8539.00%Diacylglycerol kinase epsilon, putative n=6 Tax=Neoptera RepID=E0VQ75_PEDHC
NCBI RefSeqXP_002428269.14e-8639.00%Diacylglycerol kinase epsilon, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420152187e-8539.00%Diacylglycerol kinase epsilon, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420152184e-8438.96%Diacylglycerol kinase epsilon, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00041431e-34diacylglycerol kinase activity
GO:00072051e-34activation of protein kinase C activity by G-protein coupled receptor protein signaling pathway
KEGG pathwayphu:Phum_PHUM3727401e-85 
 K00901 (E2.7.1.107, DGK, dgkA)maps-> Phosphatidylinositol signaling system
    Glycerolipid metabolism
    Glycerophospholipid metabolism
InterPro domain[267-435] IPR0007561e-34Diacylglycerol kinase, accessory domain
[125-246] IPR0012062.1e-29Diacylglycerol kinase, catalytic domain
Orthology groupMCL15288 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204336-TA
ATGTTATCCCTTGTGGGCCTTTTCTGTGAATGCTGCGGCATAAGTGCATGTAAAAAATGTCATCGTATAATTGATAAAAAGCAAAGATGTAAACAGATGTGGTGGCCGGCAGAGAAACCATTCTATCATCTTTGGGTTAATGTTGGTTCTGTTACTAGAGAGAATCCTGACCACATAGAAGAGGGGGATAATTTAAAAAAATTCTTCTGTTCCTGGTGCCAAAGAGTAAGGCTCAGTCAGGAAAATGAATTATCAAATACCGAGAGATGTGACTTCCAAAAATATAAAGATATCATTATTCCACCGATGAATGTAAATATGGATAGAGGAAAAATTATTAAAATAGAACCTGTACCTGATGACGATTGGGAACCATTCATAATTTTTGCAAATAGGAAATCAGGAAGTAATAGAAGTGACGAGGTTCTGTCACTCTTTAGAGGTCTTTTGAATCCATTGCAGATTATAGATATTGGATCAATGCCACCTGAGAAAGCAGTGAAGTGGCTTCCAGAGCGATGTCGTATCATCGTTGCTGGAGGTGATGGTACAGTGGCGTGGGTGTTAAATACTTTACACACAGTACCACATATTAAAGCGTCAGTTGGTATTCTACCTACTGGCACTGGTAACGATCTGTCTCGAGCCCTCGGCTGGGGAGGCGGCTGTTCTGACTTAGACGCATCCGCTATTATTATATCTATGAAGCAAGCAGAAGTACAAATACTTGACAGATGGAAAGTTTCTATTGGTCCGTTGTCCCGTGGGCTGCGATCACGTGGTCGTGTGTTGTTCGCTCACAACTACGTGAGCGTGGGTGTGGACGCGCAGGTCGCGCTGGACTTCCATCGTGCTCGTGCTCACATCCTCAAAAGATGCGCAAGCAGATACATAAACTATTTAGCGTACGCGCTGCTTGGTGTTGGTCGCGCTCTTGACGATGGTGGCTGCGGAGGGCTGGAGCGTCGGTTGCGTGTGAGGATAGCTCGTGAACACGGAGAGGGTCAAGAGGCTCGTGGCGGCCATGGAAACTTAAACACGCTTGACTTGCCGCCATTACAGGCGCTAGTGTTACTCAATATACCCTCTTGGGGCGCCGGCGTAGATCTATGGAGTTTAGGAAACGAAGAAGACGTCGGTGAACAGTTCATGGATGATAGGAAATTAGAAGTGGTTGGTATATCTTCGTCGTTCCACATCGCGAGGCTTCAGTGTGGTTTGGCAGAGCCCTACCGCTTTGCGCAGACAAGTTATGTAGAGATGAGTCTGGAGGGATGTGTGGCGATGCAGGTCGACGGGGAGCCGTGGATGCAGGGCCCCGCAACAATCCGCCTCGAGCCAGCCGGCCAGTCTTGCATGTTACGTAACAATGTTTAG

Protein sequence:

>DPOGS204336-PA
MLSLVGLFCECCGISACKKCHRIIDKKQRCKQMWWPAEKPFYHLWVNVGSVTRENPDHIEEGDNLKKFFCSWCQRVRLSQENELSNTERCDFQKYKDIIIPPMNVNMDRGKIIKIEPVPDDDWEPFIIFANRKSGSNRSDEVLSLFRGLLNPLQIIDIGSMPPEKAVKWLPERCRIIVAGGDGTVAWVLNTLHTVPHIKASVGILPTGTGNDLSRALGWGGGCSDLDASAIIISMKQAEVQILDRWKVSIGPLSRGLRSRGRVLFAHNYVSVGVDAQVALDFHRARAHILKRCASRYINYLAYALLGVGRALDDGGCGGLERRLRVRIAREHGEGQEARGGHGNLNTLDLPPLQALVLLNIPSWGAGVDLWSLGNEEDVGEQFMDDRKLEVVGISSSFHIARLQCGLAEPYRFAQTSYVEMSLEGCVAMQVDGEPWMQGPATIRLEPAGQSCMLRNNV-