DPGLEAN13217 in OGS1.0

New model in OGS2.0DPOGS204601 
Genomic Positionscaffold2948:- 8619-13624
See gene structure
CDS Length1041
Paired RNAseq reads  173
Single RNAseq reads  514
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001613 (9e-134)
Best Drosophila hit  CG9232 (4e-117)
Best Human hitgalactose-1-phosphate uridylyltransferase (2e-117)
Best NR hit (blastp)  PREDICTED: similar to ENSANGP00000017622 [Nasonia vitripennis] (4e-130)
Best NR hit (blastx)  PREDICTED: similar to ENSANGP00000017622 [Nasonia vitripennis] (2e-132)
GeneOntology terms








  
GO:0008270 zinc ion binding
GO:0005975 carbohydrate metabolic process
GO:0046872 metal ion binding
GO:0006012 galactose metabolic process
GO:0005829 cytosol
GO:0008108 UDP-glucose:hexose-1-phosphate uridylyltransferase activity
GO:0005625 soluble fraction
GO:0016779 nucleotidyltransferase activity
GO:0019691 UDP-glucose conversion
GO:0016740 transferase activity
InterPro families




  
IPR019779 Galactose-1-phosphate uridyl transferase, class I His-active site
IPR005849 Galactose-1-phosphate uridyl transferase, N-terminal
IPR005850 Galactose-1-phosphate uridyl transferase, C-terminal
IPR001937 Galactose-1-phosphate uridyl transferase, class I
IPR011151 Histidine triad motif
IPR011146 Histidine triad-like motif
Orthology groupMCL13681

Nucleotide sequence:

ATGGAATTCAATGTCACAGAACATCAGCACGTCCGCTACAATCCCTTAAAAGACCAGTGG
GTTTTGGTCTCACCACATCGCTGCAAACGTCCCTGGAGCGGGCAGACGGAACCAGAACCG
GAAGAACTCTCCGATGAGAAGAACCCATTGAAAGCTGGTGCAGTCAGAGCTAATGGACAG
AAAAATCCGAATTACACTTCCACGTACGTGTTCCCAAACGACTTCCCGGCTCTCCTGGAA
CGTGTCCCGGAACCGCCGCCGTCAGAACACCCTCTGTTCCAGATGTCCCAGGCGAAGGGA
ACTTGCAGGGTGATGTGCTTTCATCCGGATTCCAAAATGACGATATCGCTGATGACCGTG
GACGAAATACTGAGCGTCATCGAAGAATGGATACGACAAACCCAGGAGTTGGGTCGACGT
TACACCTGGGTGCAGGTCTTTGAGAACAAGGGCTCCGTCATGGGCTGCTCCAACCCTCAC
CCCCACTGTCAGATATGGGCCTCCAGCTATTTACCCGACGAGGGTAAAATTAAGGACAGG
TGTCAGAAAGAGTACTTCATCAAAAATGCTAGGCCGATGTTGATGGAGTACTTGGAGCAA
GAGCTGATGAGGAAGGAACGTATAGTCCTCGAGAACCAGTCTTGGGTGACCCTCGTCCCG
TACTGGGCTGTATGGCCGTACGAGACCTTACTTCTGCCGAAGCAGCACGTTCAGAGGATC
ACAGACCTGGACGAGGTTCAGAAGCAGGACCTGGCTATCATGATGAAAGAGCTGAACACC
AAATATGATAACTTATTCCAATGCAACTTCCCCTACAGTATGGGCTGGCATGGGGCTCCC
ACGGGTCCATCCGCTAAACCCGGGGACTCCCCGCACTGGGTGTTCCACGGCATCTATCTA
CCGCCACTCCTGAGATCGGCTAGTGTCAAAAAATTCATGGTGGGCTACGAACTGCTCGCA
CAACCACAAAGAGATTTAACACCCGAGCAAGCAGCGGAAAAACTAAGAGGATGCAGTCTA
GTACACTACAAATATGTGTAG

Protein sequence:

MEFNVTEHQHVRYNPLKDQWVLVSPHRCKRPWSGQTEPEPEELSDEKNPLKAGAVRANGQ
KNPNYTSTYVFPNDFPALLERVPEPPPSEHPLFQMSQAKGTCRVMCFHPDSKMTISLMTV
DEILSVIEEWIRQTQELGRRYTWVQVFENKGSVMGCSNPHPHCQIWASSYLPDEGKIKDR
CQKEYFIKNARPMLMEYLEQELMRKERIVLENQSWVTLVPYWAVWPYETLLLPKQHVQRI
TDLDEVQKQDLAIMMKELNTKYDNLFQCNFPYSMGWHGAPTGPSAKPGDSPHWVFHGIYL
PPLLRSASVKKFMVGYELLAQPQRDLTPEQAAEKLRGCSLVHYKYV