Monarch geneset OGS2.0

DPOGS200388
TranscriptDPOGS200388-TA1620 bp
ProteinDPOGS200388-PA539 aa
Genomic positionDPSCF300511 + 4405-9422
RNAseq coverage362x (Rank: top 33%)
Annotation
HeliconiusHMEL0031370.077.93% 
BombyxBGIBMGA013299-TA0.062.57% 
DrosophilaCG7995-PD4e-12356.04% 
EBI UniRef50UniRef50_UPI0000D559085e-12556.93%UPI0000D55908 related cluster n=1 Tax=unknown RepID=UPI0000D55908
NCBI RefSeqNP_001166873.10.081.22%glycerol kinase-like protein [Bombyx mori]
NCBI nr blastpgi|2905657640.081.22%glycerol kinase-like protein [Bombyx mori]
NCBI nr blastxgi|2905657640.081.22%glycerol kinase-like protein [Bombyx mori]
Group
Gene OntologyGO:00060725.5e-166glycerol-3-phosphate metabolic process
GO:00043705.5e-166glycerol kinase activity
GO:00167735.5e-166phosphotransferase activity, alcohol group as acceptor
GO:00059755.5e-166carbohydrate metabolic process
KEGG pathwayame:4092615e-150 
 K00864 (E2.7.1.30, glpK)maps-> Plant-pathogen interaction
    Glycerolipid metabolism
    PPAR signaling pathway
InterPro domain[1-389] IPR0059995.5e-166Glycerol kinase
[1-389] IPR0005775.5e-166Carbohydrate kinase, FGGY
[123-316] IPR0184851.1e-59Carbohydrate kinase, FGGY, C-terminal
[1-114] IPR0184846.6e-24Carbohydrate kinase, FGGY, N-terminal
Orthology groupMCL10134 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200388-TA
ATGAGATGGCTTAAAGATAACGTGAAGACAGTGAGACGAGCTAGTAGAGACAAAAGATTGTTGTTCGGTACCGTTGATAGTTGGATTGTGTGGAATTTAACCGGCGGACCAAAAGGGGGTATTCATATAACTGACGTCACGAACGCTTCAAGAACGTTGCTCATGAATCTGGAAACCTTGAATTGGGATCCAATATTATGTAGGTTATTCGGTATACACAGAGATTCGTTGCCACAAATTCGTAGCAGCGCTGAAGTGTATGGCGTCGTGCAGGACGGATCTGTATTGGATGGTATACAAATATCTGGGATCTTGGGCAACCAACAAGCAGCTTTGGTCGGGCAGAGCTGTCTACTGGCTGGTCAAGCGAAAAATACTTATAGAGGGGGATGTTTCCTTTTATACAACACCGGTCCGAGGAGGGTTAATTCCTCGCATGGATTGCTAACGACTGTGGCCTACAAGTTGGGACCAAACGCACCAGCTATATACGCATTAGAAGGGTCCATAGCGGTCGCGGGGGGTGCTATGAAATTCCTCCGGGACAATCTCAAGTTAATCAAAGATGTAGCCCAGGACACAGAACATATAGCGGGCCAAGTATTCTCTACTGGGGACGTTTATTTCGTACCAGCCTTCAATGGATTGTATGCGCCATATTGGAGAAAGGACGCCAGAGGTATTATCTGTGGTTTGACAGCATTCACGACTAAGAATCATATAATAAGAGCTGCTTTGGAGGCTGTCTGTTTTCAGACGAGAGATATATTGGAGGCTATGAATAAAGACTGTGGCATGCCGCTCTCCAAGTTACATGTGGATGGGAAAATGACTTCTAACGACTTATTGATGCAACTACAAGCGGATCTCATCGGTATACCCGTTTTGCGTTCCCATACTTGGGACATGTCAGCGCTGGGCGTAGCGGTGGCTGCGGGCTCCGCGGCCGGCGTTTGGAGCGCCGAGCGGTGGAGGGGGCAGGCGACCCCGGCCGACACATTCCTCCCTACCACCACTGATGACGTGATTTCTTCGGATGATAACGGTGGCATTACGGGTCTACTCCCACTGTTTGAAGACGATCGAGATGCCAGATACACGAAATGGAAAATGGCAGTCCAACGATCACTCGGCTGGGCGACAACTAAGAAATCATTCGCGATGACAGGTCAGAAAAGAAAAAATAGCATTTTTCAGAGCATAGATACAACAAGTCCACCTAATAGTAGAAAAAATTCTGAGGATATAGAAAACTTAACGGACGTAATGCCGGATGATAAACATATAGAGGAAGTAATACAATCGTCTGCTAGGAAGTTTTCTGTATTCGTGCCACAGGAGATACAAAGAGACGAGAGGACTATATTAGACGATATAGAATATTATATGAATAAAAAGGAATGCGACTATGGCATGTGTGAGTGTTGCCAGAAGAGAAAATTGCACACAGATTGGAAAACTATAGAGGAAATTATTGAAAACGATCCCGAAATCATATTTGATGACGACAAACCCATCGTTGTGCCAAAGAATTCGCAAGATGACGTCACACGCAGCCCCATGGTTTATCCAGATTTAATCGATATTTGCAAACAAATCGATGAGGCGAATCTCAACTGA

Protein sequence:

>DPOGS200388-PA
MRWLKDNVKTVRRASRDKRLLFGTVDSWIVWNLTGGPKGGIHITDVTNASRTLLMNLETLNWDPILCRLFGIHRDSLPQIRSSAEVYGVVQDGSVLDGIQISGILGNQQAALVGQSCLLAGQAKNTYRGGCFLLYNTGPRRVNSSHGLLTTVAYKLGPNAPAIYALEGSIAVAGGAMKFLRDNLKLIKDVAQDTEHIAGQVFSTGDVYFVPAFNGLYAPYWRKDARGIICGLTAFTTKNHIIRAALEAVCFQTRDILEAMNKDCGMPLSKLHVDGKMTSNDLLMQLQADLIGIPVLRSHTWDMSALGVAVAAGSAAGVWSAERWRGQATPADTFLPTTTDDVISSDDNGGITGLLPLFEDDRDARYTKWKMAVQRSLGWATTKKSFAMTGQKRKNSIFQSIDTTSPPNSRKNSEDIENLTDVMPDDKHIEEVIQSSARKFSVFVPQEIQRDERTILDDIEYYMNKKECDYGMCECCQKRKLHTDWKTIEEIIENDPEIIFDDDKPIVVPKNSQDDVTRSPMVYPDLIDICKQIDEANLN-