DPGLEAN15156 in OGS1.0

New model in OGS2.0DPOGS207165 
Genomic Positionscaffold7:- 977725-982597
See gene structure
CDS Length1551
Paired RNAseq reads  513
Single RNAseq reads  1249
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000621 (2e-66)
Best Drosophila hit  CG31915 (2e-111)
Best Human hitprocollagen galactosyltransferase 2 precursor (6e-91)
Best NR hit (blastp)  PREDICTED: similar to CG31915-PA [Apis mellifera] (1e-126)
Best NR hit (blastx)  PREDICTED: similar to Glycosyltransferase 25 family member [Tribolium castaneum] (2e-129)
GeneOntology terms

  
GO:0008475 procollagen-lysine 5-dioxygenase activity
GO:0005575 cellular_component
GO:0009103 lipopolysaccharide biosynthetic process
InterPro families  IPR002654 Glycosyl transferase, family 25
Orthology groupMCL10759

Nucleotide sequence:

ATGAATTTGGACTATCCGAAAGATAGGATATTCTTATGGTTCCGCAGTGACTACAACAGC
GATCATTCAGTTGATGTTTTGCGTGATTTTGTTAATAAATTCGGAACCTTATACAACAGA
GTACATCTTTCATATAACACGTCTAAACAAAAGTTCGACGATGAATTGTCACCGACCCAT
TGGAGCCATAGTCGATTTATGCATTTGATTAAGTGGAGGGAAATGGGAATTAAATTTGCG
AAGCGACAATGGGCCGATTATGTATTTATGCTGGACGCAGATGTGTTTCTCACGAACCCC
CAGACGCTACGGCATCTCATCCAAAAACAACTCCGTGTGGTGGCGCCAATGCTCGTCTCA
GATCGATATTACTCCAACTTCTGGTTGTCCGTTGACGACGACTTCAACTATCGTCTAAAT
CACGAGGATGAATTCTATCCATTATATGAGTACAACGAATTGTACATGGGATGTCATATA
GTTCCAGTGATATACGGGGCGGTACTAATGGATCTTCGATCAAAGAAATCGGACTATATA
ACCTATGATCCCTACAAAATAGTCGATTACTTGGGCCCGCTGCAGGACCACATTATATTT
GCCGTGAACGCCATGAGGAACAATATATCGCTACACATTTGCAACGACGATTTCTTCGGT
TACATCACCCGGCCAATAAAAGAAGGCGAACCACTTGAAAGGGATGTATTGCATCTCACC
AACCTGAAGCTGTCAGCGATTGCACGCAGCAAGCCGCTTCAATACCACTACAAGCTACAG
CGTTTTGTGTACTACCCTCCTTCTCTGGATTACCAAGTCGACAAGATTTATATGATCAAC
CTCGAACGAAGACCCGATAAAAGGAAGCTGATGGAACAGAGCTTCAAGGAATTGGGCATG
AATGTTACACGTGTTGAAGCTGTCGACGGCAAGAGCCTGGATCCGAAAAAACTTCAAAAT
ATGAACGTTACCTTGATGCCTGGATATGAAGATGCCTACTATAAACGCCCTATGACCTAC
GGCGAGATCGGATGTTTCTTGAGCCATTACAAGATTTGGGTCGAGGTTGCGGAGAGAAAC
TACAACAGAGTGTTGATTTTGGAAGACGACGTTAATTTCTTGCCTTATTTCAAGGAAAAC
TATGATACGATTATATGGGAGTCTTCTGTTCTGAAACATGATTTTATCTACCTCGGCCGC
AAAATTATGATGGATAAGGTTGAGATTAGAATGACGACGCATCTCACTAAGCCGCTGTAC
TCTTACTGGACCATTGGTTACATTATAACAAAATTGGGTGCTGAGAAACTGATCGAGGCT
AAACCTTTGAGCAAATTGTTACCAGTCGATGAGTTCCTACCTATTATGTTCGATCAACAT
CCAGATAAAAAATATAAAGAATTCTTCCCCAATCGGAATCTGAACGCGCTGGCTGCGAGT
CCGTCTATCATTTCACCGACACACTACACGGGCATGCCGGGATACATCAGTGACACTGAA
GATTCTATCCCTCTGTCTACTGAGCCCTGCACTGATAATCCAGAACTTTAA

Protein sequence:

MNLDYPKDRIFLWFRSDYNSDHSVDVLRDFVNKFGTLYNRVHLSYNTSKQKFDDELSPTH
WSHSRFMHLIKWREMGIKFAKRQWADYVFMLDADVFLTNPQTLRHLIQKQLRVVAPMLVS
DRYYSNFWLSVDDDFNYRLNHEDEFYPLYEYNELYMGCHIVPVIYGAVLMDLRSKKSDYI
TYDPYKIVDYLGPLQDHIIFAVNAMRNNISLHICNDDFFGYITRPIKEGEPLERDVLHLT
NLKLSAIARSKPLQYHYKLQRFVYYPPSLDYQVDKIYMINLERRPDKRKLMEQSFKELGM
NVTRVEAVDGKSLDPKKLQNMNVTLMPGYEDAYYKRPMTYGEIGCFLSHYKIWVEVAERN
YNRVLILEDDVNFLPYFKENYDTIIWESSVLKHDFIYLGRKIMMDKVEIRMTTHLTKPLY
SYWTIGYIITKLGAEKLIEAKPLSKLLPVDEFLPIMFDQHPDKKYKEFFPNRNLNALAAS
PSIISPTHYTGMPGYISDTEDSIPLSTEPCTDNPEL