DPGLEAN04019 in OGS1.0

New model in OGS2.0DPOGS215041 
Genomic Positionscaffold2418:- 8285-14922
See gene structure
CDS Length1167
Paired RNAseq reads  294
Single RNAseq reads  786
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005670 (2e-30)
Best Drosophila hit  ND
Best Human hitgalactokinase (2e-65)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_90953 [Branchiostoma floridae] (9e-106)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_90953 [Branchiostoma floridae] (2e-86)
GeneOntology terms









  
GO:0016301 kinase activity
GO:0046835 carbohydrate phosphorylation
GO:0005524 ATP binding
GO:0005737 cytoplasm
GO:0016310 phosphorylation
GO:0004335 galactokinase activity
GO:0006012 galactose metabolic process
GO:0016773 phosphotransferase activity, alcohol group as acceptor
GO:0008152 metabolic process
GO:0000166 nucleotide binding
GO:0016740 transferase activity
InterPro families






  
IPR019539 Galactokinase galactose-binding domain
IPR013750 GHMP kinase, C-terminal
IPR006204 GHMP kinase
IPR019741 Galactokinase, conserved site
IPR000705 Galactokinase
IPR020568 Ribosomal protein S5 domain 2-type fold
IPR014721 Ribosomal protein S5 domain 2-type fold, subgroup
IPR006206 Mevalonate/galactokinase
Orthology groupMCL15654

Nucleotide sequence:

ATGTCGGAAGCGGTCCCTAAAGGTGAAGAAGTTCTGCTTAAAGAGGCAGTCTCTAAGTTC
GTCTCCACATACAATCGCAAGCCGCTAGCGGCAGCTGCTGCCCCTGGTAGGGTCAATCTC
ATTGGAGAGCACGTTGACTATTGTGAAGGATTCGTGCTACCTGTGGCGTTACCATTTTTC
ACTGTAGTGGTGGGTGCATACAATTCCACTGATGAGTGCATGGTGCTGTCAATCCTTGCC
AGTGGACAAGAGGTCCAGACCAGCTTCTCTTCTACAGAATCATCTTCTTTGCAACCCGGG
GAGCCAGGGTGGGCTAACTATGTAAAGGGTGTGCTGGCCAATTTTCCAGAAAAGGTGAAA
GGTCTCGATGCGGTTATTGTATCAGACGTGCCAATGGGGTCCGGCGTCTCCAGCAGCGCT
TCATTGGAAGTAGCATTCTTCACGTTCCTTGAGGACCTCACTAAGATCACGGTTGATCCA
GTCAAAAAAGCTCAGTTGTGTCAGAAAGCTGAGCATGATTTTCCCGGAATGCCGTGCGGT
ATTATGGACCAGTTCATAGTGACTCTTGGAAAAAAAGATCACGCATTACTAATAGATTGC
AGGTCATTGGAGTCCAAACAGGTGCCAATGAAGTGTTCAGACGTCGTGCTGTTGGTTGTG
AATTCTAGTGTGAAGCATCAGCTAACCGGAAGCGAATACCCTCAGAGACGAGCGCAGTGT
CAGCAAGCGGCTGATGAATTGGGGAAACCCTCTTTAAGGAGCGCCACCATTCAAGATCTT
TCAAAACTGAAATGCGAGGAATTAGTTCTGAAACGTGCTAAGCATGTGGTCGAAGAGATC
ACCCGGACCGAGTTAGTCGCACAGCTTTTAGAGAGGAAAGATTATAAGGAGGTAGGGCGA
CTGTTCTATCAGTCCCACGAGTCCCTGAGCAAGCTGATGGAGGTTTCCTGTCCCGAGTTA
GACCAACTGGTTGATATCATGAGGTCATCGGACGGAGTGTTCGGCGCCAGAATGACGGGC
GGCGGCTTCGGGGGATGCGTCATAGCCTTAATAAAGAAGGAATGCTTGGCGTCTTTAAAG
AGCAAGGTCCGGTCGGAGTACAAAGGTAACCCAGTGTTCTTTGAGTGCGAGCCGAGTGAC
GGAGCGAGAATATTAAAGATAGGATAA

Protein sequence:

MSEAVPKGEEVLLKEAVSKFVSTYNRKPLAAAAAPGRVNLIGEHVDYCEGFVLPVALPFF
TVVVGAYNSTDECMVLSILASGQEVQTSFSSTESSSLQPGEPGWANYVKGVLANFPEKVK
GLDAVIVSDVPMGSGVSSSASLEVAFFTFLEDLTKITVDPVKKAQLCQKAEHDFPGMPCG
IMDQFIVTLGKKDHALLIDCRSLESKQVPMKCSDVVLLVVNSSVKHQLTGSEYPQRRAQC
QQAADELGKPSLRSATIQDLSKLKCEELVLKRAKHVVEEITRTELVAQLLERKDYKEVGR
LFYQSHESLSKLMEVSCPELDQLVDIMRSSDGVFGARMTGGGFGGCVIALIKKECLASLK
SKVRSEYKGNPVFFECEPSDGARILKIG