DPGLEAN11113 in OGS1.0

New model in OGS2.0DPOGS204233 
Genomic Positionscaffold1204:+ 73466-77387
See gene structure
CDS Length2106
Paired RNAseq reads  703
Single RNAseq reads  1896
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007512 (2e-151)
Best Drosophila hit  UDP-glucose-glycoprotein glucosyltransferase (2e-95)
Best Human hitUDP-glucose:glycoprotein glucosyltransferase 1 precursor (6e-83)
Best NR hit (blastp)  PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum] (4e-148)
Best NR hit (blastx)  PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum] (4e-131)
GeneOntology terms




  
GO:0003980 UDP-glucose:glycoprotein glucosyltransferase activity
GO:0005783 endoplasmic reticulum
GO:0006486 protein amino acid glycosylation
GO:0005635 nuclear envelope
GO:0005737 cytoplasm
GO:0005791 rough endoplasmic reticulum
InterPro families  IPR009448 UDP-glucose:Glycoprotein Glucosyltransferase
Orthology groupMCL11265

Nucleotide sequence:

ATGAAAGCTACAATTTTGGGCATTATTTTAACTGTAGTAATTTCAATCTTTGGTAATGTT
GTTGCCAATGGTGACTTACCGAAACGAGAGGAAAGAAAGTCTAAAGGAGTTACTACATTT
ATTAGCGCCAAATGGGAAGCGACGCCAATTGTGCTAGAACTGGCGGAATATTTGTCTGCT
GAAAGTTCTGATTTATTTTGGTCTTATTTTGATGGCATTATTTCACTTAAATCAAGCTTA
GAGTCTTTGGAGACAGATAAACAAGTTTATGATGCTTGCATTGGAGTAGCAAGTACATTA
TTAGCTCCGGCACAGCTCCGTATGGCTAAGCTAGCCTTGTCCATGCATTTGACTTCACCA
GCAGTCCGCATGTTTGATCAGATTGCTACACAAAACGGTGCAAAAGAGTTGCCCTGTGAA
ACATTTGTGGCAATTGCATCAAGAAAAGTTTGTGATAATGATATCCTTAGGGACATTCTT
AAATCTACAGTCAAATTTGATCCAGAGGAGCATAGAATTGAAACATACCTATTAGACCAC
TCATATCCCAGCAGTGACAATAGAAGCCTCACAGCTATTCTATATGGAGAGCTGGGAAAC
TCTGACTTTTCAGCCAAACATAAAATATTATCTGGCTACGCTGATAAAGGTGTTATTAAC
TATGTGGTCAGATGGAACATAAAATCTAGAGGCAAGCCAAAACTTCGTCTGTCTGGATAT
GGAATTGAATTGCAATTGAAGAGTACAGAATACAAAAGTCAAGATGATACCACTCCTAAG
GAGACTGTAGATGATGCAGGAGTGCCCTCAGAAGAAGAAGACGAAAATGATCCCCAGAAC
CAAATAGATGGATTCAATTTTGGAAGACTAAAGAATTTATTTCCGGCACTTCGCACACCT
CTCGAGCGTTTCCGAAGACATCTCTCTGAAATGAGTGAAGAAATAGAGCCCCTTAAAGTA
TGGCAGATGCAAGCTCTGAGTATGCAGGCCGCTGCTGCTGTGATGGATGCACACGACGCG
GGCGGAGATGAGGCTCTTAAAGTGTTGATATCTCTAGCACAGAACTTCCCCATGCAGACT
AAATCGTTGATCCATGTGAATGTGCCCCGATCCTTCCGCGATGAAGTCCTGTACAATCAA
GACGTTTGGTCGTCATCTCTAGGGCTCCGGCCTGCGGAACCCTTGTTGCTCGTATCCGGG
GCTCAGTACGATGCTGACGAGGTCGACCTTATGGCCCTCTTAGCAGCGCTCAGAGAAGAC
ATAGGACCTATGAATACTCTGCATGCTTTGGGTCTGAACAGGAAGCTCATCAACAAGCTT
CTATCACTTGAACTCGGTGAGTCTTTCACTTGGGAAGAGTATGGCTTAGACATCCGTGAC
ACAGCCATCACCTGGCTCAACGATCTAGAGACAGACGATAGATACAGACGATGGCCATCT
TCATACATGGAACTCCTACGACCCACATATCCTGGTATGCTGAGGAACTTAAGAAGAAAT
ATATATAATTACGTGATAGTGATAGACCCAACATCACCGTCGTCCGCGCCCCCTTTAAAG
CTGGGTGAGACATTACTGAAACATGCTACGCCTGTACGAGTGGGCTTGGTACTGGCACCG
GGACGCGACTCCGCTCTGGGCACCGCACTAAGAAGCGCCTTCAACTATGTAGCACAGGAG
AGGAATTCTAACAAGGAGGCCTTCTATTTCCTTACACAGGTTCTCAATTCTCTTCAAGAA
GATGCTCTGAGTGTGGATCATATAAAAAAGTATCTGAAAAAGTATGCCAGTTCGAGCGCA
AATCTCGATGAAATCATTTCAGAGGAATCTGAATTCAACTTCGGACACCAACTGGCTGAG
GAGTTCGTGTCGAAGCTGGGAACTAATAAATTCCCTCAAGTGATAGTGAATGGCGTTCCT
CTGTACGATGAGGGCTCTGGTGCGTTGTCTTCGGTGGAACTGCTCGAGGAGGCGCTAGTG
ACGGCACTGTCGCGTCACACGGCGCGTCTACAGCGAGCCGTGTTTAGAGGGAACCTCGCA
GACTCCGACGACGCCGTAGAGTATCTCATGAAGCAGCCGCATATTGTGTCCAGGTTTGCC
GTTTAG

Protein sequence:

MKATILGIILTVVISIFGNVVANGDLPKREERKSKGVTTFISAKWEATPIVLELAEYLSA
ESSDLFWSYFDGIISLKSSLESLETDKQVYDACIGVASTLLAPAQLRMAKLALSMHLTSP
AVRMFDQIATQNGAKELPCETFVAIASRKVCDNDILRDILKSTVKFDPEEHRIETYLLDH
SYPSSDNRSLTAILYGELGNSDFSAKHKILSGYADKGVINYVVRWNIKSRGKPKLRLSGY
GIELQLKSTEYKSQDDTTPKETVDDAGVPSEEEDENDPQNQIDGFNFGRLKNLFPALRTP
LERFRRHLSEMSEEIEPLKVWQMQALSMQAAAAVMDAHDAGGDEALKVLISLAQNFPMQT
KSLIHVNVPRSFRDEVLYNQDVWSSSLGLRPAEPLLLVSGAQYDADEVDLMALLAALRED
IGPMNTLHALGLNRKLINKLLSLELGESFTWEEYGLDIRDTAITWLNDLETDDRYRRWPS
SYMELLRPTYPGMLRNLRRNIYNYVIVIDPTSPSSAPPLKLGETLLKHATPVRVGLVLAP
GRDSALGTALRSAFNYVAQERNSNKEAFYFLTQVLNSLQEDALSVDHIKKYLKKYASSSA
NLDEIISEESEFNFGHQLAEEFVSKLGTNKFPQVIVNGVPLYDEGSGALSSVELLEEALV
TALSRHTARLQRAVFRGNLADSDDAVEYLMKQPHIVSRFAV