Monarch geneset OGS2.0

DPOGS206789
TranscriptDPOGS206789-TA1899 bp
ProteinDPOGS206789-PA632 aa
Genomic positionDPSCF300001 - 5015371-5025184
RNAseq coverage272x (Rank: top 40%)
Annotation
HeliconiusHMEL0078020.083.24% 
BombyxBGIBMGA013299-TA2e-12651.88% 
DrosophilaGyk-PA3e-16564.42% 
EBI UniRef50UniRef50_Q7SZS62e-15059.91%Gk2-prov protein n=14 Tax=Coelomata RepID=Q7SZS6_XENLA
NCBI RefSeqNP_001108335.10.088.60%glycerol kinase [Bombyx mori]
NCBI nr blastpgi|1688234020.088.60%glycerol kinase [Bombyx mori]
NCBI nr blastxgi|1688234020.088.60%glycerol kinase [Bombyx mori]
Group
Gene OntologyGO:00060721.9e-222glycerol-3-phosphate metabolic process
GO:00043701.9e-222glycerol kinase activity
GO:00167731.9e-222phosphotransferase activity, alcohol group as acceptor
GO:00059751.9e-222carbohydrate metabolic process
KEGG pathwayaag:AaeL_AAEL0048530.0 
 K00864 (E2.7.1.30, glpK)maps-> Plant-pathogen interaction
    Glycerolipid metabolism
    PPAR signaling pathway
InterPro domain[9-632] IPR0059991.9e-222Glycerol kinase
[9-632] IPR0005771.9e-222Carbohydrate kinase, FGGY
[12-268] IPR0184849.7e-68Carbohydrate kinase, FGGY, N-terminal
[277-422] IPR0184853.6e-42Carbohydrate kinase, FGGY, C-terminal
Orthology groupMCL10134 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206789-TA
ATGTCTTTTGGATTTGGAAAGTTCGGACCATTAGTTGGGGCTATTGACGAGGGCACCTCAAGTGCGCGTTTTATTATTTTTAAAGCTAACTCTTGTGAAGTGGTGGCATCATACCAAAAGGAGCTAGAACAATATTTTCCACAGGAAGGATGGGTTGAACAGGATCCCAATGCTATTCTAGAAGTTGTTACCATATGTATTACAAAAGCTGTAGAGCAATTAGTATCATTAGGCGGGAAACCCCAGGACATAATAGCTGTTGGTGTTACAAATCAACGGGAAACTACAATTGTGTGGGATAAATACACTGGAAGGCCATTACACAATGCCATAGTGTGGCTTGACATGAGAACTTCATCTACCATTGAAAAGCTTCTTGACACAGTGCCTAACAAGACTAGGAATAAGAATTATTTGAAGCCGTTGTGTGGTCTCCCATTGTCACCGTACTTCAGTGCCGTCAAGCTGAGATGGCTAAGCGACAATGTTGACGTAGTGAAACAAGCTATGAAAAAAGGGACATGCTATTTTGGTACCGTTGATTCGTGGATTATATGGAATTTAACAGGCGGTCCAAACGGCGGCAAGCACGTCACTGATGTATCGAATGCCTCGCGAACAATGCTCATGAATATAGAAAATTTAAAATGGGATCCTTTGCTGATTAAGTTCTTTGAAGTAACAAAATCTGTTTTGCCAGAAATCAAATCTAGTTCTGAGATATACGGCCGTGTGGCAGATGGTCCTCTCAAAGGCGTACCTATATCTGGTTGTTTGGGGGATCAGCAAGCAGCTCTAGTGGGACAGATGTGTCTGCAGAAGGGTCAGGCAAAGGCTACGTACGGTACTGGCTGCTTTGTCCTATACAATACAGGAGATATTCGCGTGAACTCCAGCAGGGGGTTGCTCACAACGGTTGCATACCAGCTGGGTAGCTATAATCCACCTTGCTACGCTCTAGAGGGCTCTGTAGCTGTAGCTGGGGCCGCTTTAGGTTGGCTAAAAGACAACATAGGCATGATAGAGAACGCCAAGAAATCACAGGAAATTGCTGAGAAAGCCACAGATAATGGCAGTGTCGTTTTTGTACCGGCTTTTAGTGGGTTATACGCCCCATATTGGAGACAAGATGCTAGAGGCGTTATCTGTGGTATAACAGAAGATACAAACTCAAACCACATCGTCAAAGCTTCTCTAGAAGCGGTTTGTTTCCAAGTCCGCGACATACTAGACGCTATGAATGAAGACTGCGGCATTCCCTTACAATTGTTGAAGTTCGGCATCAATTGCCAAAAATTTATTGGTTGGAGTTGTCAAATGACTTACAACTGCAGCGTAGACTGCGGTTGGATCGTTAACTGCAAACAGAGATTGCTAGAAACTGGCAATGTACCTACGACAGAAAACAAAATATCACAGGCTAAGACATCGGAAGATAAAGCAAACAAAGAAGCAGATGAATCGGATTCAGTTGCCACAGAAATTATTAATTTACCTTCGTCTGCCACAGTAGCTGTTCCATCTTCAGCTAAAACATCGATTGATAAAGTCTTAGATACTTCAGCCAACAAGACAGCGTTCGCAGCTGGTCCAGATTCAGCAATCACAACGACTCCAGCCGCTAAACCAGTTGATGGGGGTATGACGTCGAATGATCTTGTAATGCAGATGCAGGCTGATCTTATTGGCATAAACGTTATAAAGGCCGGTTTCACAGAGAGCACAGCGTTGGGGGCAGCCCTGGTCGCGTTCTGGGGCGTCGAGAATAACAAAGCTGCAACCATAGCTATGACCAGTGGAGTCACTTACGTACCACAGATAAGTGAAGATGAGAGGGACATGAGATACAAACAGTGGAAAATGGCGGTCGAAAGATCATTGGGTTGGGAACAGAATTAA

Protein sequence:

>DPOGS206789-PA
MSFGFGKFGPLVGAIDEGTSSARFIIFKANSCEVVASYQKELEQYFPQEGWVEQDPNAILEVVTICITKAVEQLVSLGGKPQDIIAVGVTNQRETTIVWDKYTGRPLHNAIVWLDMRTSSTIEKLLDTVPNKTRNKNYLKPLCGLPLSPYFSAVKLRWLSDNVDVVKQAMKKGTCYFGTVDSWIIWNLTGGPNGGKHVTDVSNASRTMLMNIENLKWDPLLIKFFEVTKSVLPEIKSSSEIYGRVADGPLKGVPISGCLGDQQAALVGQMCLQKGQAKATYGTGCFVLYNTGDIRVNSSRGLLTTVAYQLGSYNPPCYALEGSVAVAGAALGWLKDNIGMIENAKKSQEIAEKATDNGSVVFVPAFSGLYAPYWRQDARGVICGITEDTNSNHIVKASLEAVCFQVRDILDAMNEDCGIPLQLLKFGINCQKFIGWSCQMTYNCSVDCGWIVNCKQRLLETGNVPTTENKISQAKTSEDKANKEADESDSVATEIINLPSSATVAVPSSAKTSIDKVLDTSANKTAFAAGPDSAITTTPAAKPVDGGMTSNDLVMQMQADLIGINVIKAGFTESTALGAALVAFWGVENNKAATIAMTSGVTYVPQISEDERDMRYKQWKMAVERSLGWEQN-