New model in OGS2.0 | DPOGS215041  |
---|---|
Genomic Position | scaffold2418:- 8285-14922 |
See gene structure | |
CDS Length | 1167 |
Paired RNAseq reads   | 294 |
Single RNAseq reads   | 786 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005670 (2e-30) |
Best Drosophila hit   | ND |
Best Human hit | galactokinase (2e-65) |
Best NR hit (blastp)   | hypothetical protein BRAFLDRAFT_90953 [Branchiostoma floridae] (9e-106) |
Best NR hit (blastx)   | hypothetical protein BRAFLDRAFT_90953 [Branchiostoma floridae] (2e-86) |
GeneOntology terms    | GO:0016301 kinase activity GO:0046835 carbohydrate phosphorylation GO:0005524 ATP binding GO:0005737 cytoplasm GO:0016310 phosphorylation GO:0004335 galactokinase activity GO:0006012 galactose metabolic process GO:0016773 phosphotransferase activity, alcohol group as acceptor GO:0008152 metabolic process GO:0000166 nucleotide binding GO:0016740 transferase activity |
InterPro families    | IPR019539 Galactokinase galactose-binding domain IPR013750 GHMP kinase, C-terminal IPR006204 GHMP kinase IPR019741 Galactokinase, conserved site IPR000705 Galactokinase IPR020568 Ribosomal protein S5 domain 2-type fold IPR014721 Ribosomal protein S5 domain 2-type fold, subgroup IPR006206 Mevalonate/galactokinase |
Orthology group | MCL15654 |
Nucleotide sequence:
ATGTCGGAAGCGGTCCCTAAAGGTGAAGAAGTTCTGCTTAAAGAGGCAGTCTCTAAGTTC
GTCTCCACATACAATCGCAAGCCGCTAGCGGCAGCTGCTGCCCCTGGTAGGGTCAATCTC
ATTGGAGAGCACGTTGACTATTGTGAAGGATTCGTGCTACCTGTGGCGTTACCATTTTTC
ACTGTAGTGGTGGGTGCATACAATTCCACTGATGAGTGCATGGTGCTGTCAATCCTTGCC
AGTGGACAAGAGGTCCAGACCAGCTTCTCTTCTACAGAATCATCTTCTTTGCAACCCGGG
GAGCCAGGGTGGGCTAACTATGTAAAGGGTGTGCTGGCCAATTTTCCAGAAAAGGTGAAA
GGTCTCGATGCGGTTATTGTATCAGACGTGCCAATGGGGTCCGGCGTCTCCAGCAGCGCT
TCATTGGAAGTAGCATTCTTCACGTTCCTTGAGGACCTCACTAAGATCACGGTTGATCCA
GTCAAAAAAGCTCAGTTGTGTCAGAAAGCTGAGCATGATTTTCCCGGAATGCCGTGCGGT
ATTATGGACCAGTTCATAGTGACTCTTGGAAAAAAAGATCACGCATTACTAATAGATTGC
AGGTCATTGGAGTCCAAACAGGTGCCAATGAAGTGTTCAGACGTCGTGCTGTTGGTTGTG
AATTCTAGTGTGAAGCATCAGCTAACCGGAAGCGAATACCCTCAGAGACGAGCGCAGTGT
CAGCAAGCGGCTGATGAATTGGGGAAACCCTCTTTAAGGAGCGCCACCATTCAAGATCTT
TCAAAACTGAAATGCGAGGAATTAGTTCTGAAACGTGCTAAGCATGTGGTCGAAGAGATC
ACCCGGACCGAGTTAGTCGCACAGCTTTTAGAGAGGAAAGATTATAAGGAGGTAGGGCGA
CTGTTCTATCAGTCCCACGAGTCCCTGAGCAAGCTGATGGAGGTTTCCTGTCCCGAGTTA
GACCAACTGGTTGATATCATGAGGTCATCGGACGGAGTGTTCGGCGCCAGAATGACGGGC
GGCGGCTTCGGGGGATGCGTCATAGCCTTAATAAAGAAGGAATGCTTGGCGTCTTTAAAG
AGCAAGGTCCGGTCGGAGTACAAAGGTAACCCAGTGTTCTTTGAGTGCGAGCCGAGTGAC
GGAGCGAGAATATTAAAGATAGGATAA
Protein sequence:
MSEAVPKGEEVLLKEAVSKFVSTYNRKPLAAAAAPGRVNLIGEHVDYCEGFVLPVALPFF
TVVVGAYNSTDECMVLSILASGQEVQTSFSSTESSSLQPGEPGWANYVKGVLANFPEKVK
GLDAVIVSDVPMGSGVSSSASLEVAFFTFLEDLTKITVDPVKKAQLCQKAEHDFPGMPCG
IMDQFIVTLGKKDHALLIDCRSLESKQVPMKCSDVVLLVVNSSVKHQLTGSEYPQRRAQC
QQAADELGKPSLRSATIQDLSKLKCEELVLKRAKHVVEEITRTELVAQLLERKDYKEVGR
LFYQSHESLSKLMEVSCPELDQLVDIMRSSDGVFGARMTGGGFGGCVIALIKKECLASLK
SKVRSEYKGNPVFFECEPSDGARILKIG