Monarch geneset OGS2.0

DPOGS208407
TranscriptDPOGS208407-TA1050 bp
ProteinDPOGS208407-PA349 aa
Genomic positionDPSCF300241 + 79158-84148
RNAseq coverage99x (Rank: top 61%)
Annotation
HeliconiusHMEL0050488e-8983.52% 
BombyxBGIBMGA004062-TA2e-13369.28% 
DrosophilaCG1271-PA5e-6440.36% 
EBI UniRef50UniRef50_E0W2J67e-8646.18%Glycerol kinase, putative n=11 Tax=Neoptera RepID=E0W2J6_PEDHC
NCBI RefSeqXP_002432590.11e-8646.18%glycerol kinase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420243482e-8546.18%glycerol kinase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420243489e-8446.18%glycerol kinase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00167731.7e-122phosphotransferase activity, alcohol group as acceptor
GO:00059751.7e-122carbohydrate metabolic process
KEGG pathwaytnp:Tnap_13826e-48 
 K00864 (E2.7.1.30, glpK)maps-> Plant-pathogen interaction
    Glycerolipid metabolism
    PPAR signaling pathway
InterPro domain[1-333] IPR0005771.7e-122Carbohydrate kinase, FGGY
[82-273] IPR0184855e-20Carbohydrate kinase, FGGY, C-terminal
[1-72] IPR0184843.3e-07Carbohydrate kinase, FGGY, N-terminal
Orthology groupMCL12351 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208407-TA
ATGACGGATGTTTCATCAGCGTCAGCGACGGGTTTCTTCGACCCTTACACCATGCAGTGGGCTGGGTGGGCGATGGCCATATTCGGTATACCCATGGAAGCCCTGCCGGAGGTAGTGGACACAGCCGGGGAACACTTCACCAGCACCGCCCCCGATATATGGGGGCATGCGATACCCATACGGAGCTGTATAGCTGATCAGACGGCTTCAATGTGGGGCTCGTGCTGCTTCAGCCCTGGTGACGTGAAGCTGACAATGGGTACTGGAACGTTCTTAAACATTCATACGGGAGCGTCACCTCACACCTCCTTGACCGGCCTATACCCTGTGGTTGGCTGGCGTATGGGAGACGAGCTGGTGTTCTCCGCTGAAGGCGCCAATAATGACACCGCCAGTATCATAAAATGGGCACAGAATTTGGGTCTTTTTGACAATCCTCAAGAAACAGCTGACATAGCGATGTCTGTGCCGGATTCAGACGGAGTGTTCTTCATACCGGCGTTTAGTGGTTTGGGTCCTCCCTACAACGACTGTATTATCTCATCAGCAATACTAATTTATTATCGAATACCAAATCGACGTACTCGTCGGACAGCATTATTTAATACACTGCCGATGAGCACAAAATATGGAAGCAACTGCTTAATGCCAGTATTTTCAAATACATTATGTTTTGACTTAATAATGTTATTATTTAGATTGGATGGCGGCGTCTCCAACAATGACTTCTTATCCCAGCTGGTTGCGGATCTGACCGGACTTACAGTGGAGCGTCCCGTACAGGTCGAGATGTCTTCATTAGGATGCGCCCACATTGTAGGATTACAGCTAGGTATCTTCACATCAAAGGAACAGCTGAAGTCACTCCGGAAGGTTGGTAAATTGTTCACTCCTAGGGCTCATGTGAAAAAATCCTACGAACCGATCATAGAGAAATGGGAAGATGCCGTCAAAAGGATGTGCGGTTGGTACAACAATGACAGAACAACACAGAGTAACACGCAGAACAATTTGAAGGTAAAACTAAAAAAACAGGGCAAATCCAAATAG

Protein sequence:

>DPOGS208407-PA
MTDVSSASATGFFDPYTMQWAGWAMAIFGIPMEALPEVVDTAGEHFTSTAPDIWGHAIPIRSCIADQTASMWGSCCFSPGDVKLTMGTGTFLNIHTGASPHTSLTGLYPVVGWRMGDELVFSAEGANNDTASIIKWAQNLGLFDNPQETADIAMSVPDSDGVFFIPAFSGLGPPYNDCIISSAILIYYRIPNRRTRRTALFNTLPMSTKYGSNCLMPVFSNTLCFDLIMLLFRLDGGVSNNDFLSQLVADLTGLTVERPVQVEMSSLGCAHIVGLQLGIFTSKEQLKSLRKVGKLFTPRAHVKKSYEPIIEKWEDAVKRMCGWYNNDRTTQSNTQNNLKVKLKKQGKSK-