New model in OGS2.0 | DPOGS204233  |
---|---|
Genomic Position | scaffold1204:+ 73466-77387 |
See gene structure | |
CDS Length | 2106 |
Paired RNAseq reads   | 703 |
Single RNAseq reads   | 1896 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007512 (2e-151) |
Best Drosophila hit   | UDP-glucose-glycoprotein glucosyltransferase (2e-95) |
Best Human hit | UDP-glucose:glycoprotein glucosyltransferase 1 precursor (6e-83) |
Best NR hit (blastp)   | PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum] (4e-148) |
Best NR hit (blastx)   | PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum] (4e-131) |
GeneOntology terms    | GO:0003980 UDP-glucose:glycoprotein glucosyltransferase activity GO:0005783 endoplasmic reticulum GO:0006486 protein amino acid glycosylation GO:0005635 nuclear envelope GO:0005737 cytoplasm GO:0005791 rough endoplasmic reticulum |
InterPro families   | IPR009448 UDP-glucose:Glycoprotein Glucosyltransferase |
Orthology group | MCL11265 |
Nucleotide sequence:
ATGAAAGCTACAATTTTGGGCATTATTTTAACTGTAGTAATTTCAATCTTTGGTAATGTT
GTTGCCAATGGTGACTTACCGAAACGAGAGGAAAGAAAGTCTAAAGGAGTTACTACATTT
ATTAGCGCCAAATGGGAAGCGACGCCAATTGTGCTAGAACTGGCGGAATATTTGTCTGCT
GAAAGTTCTGATTTATTTTGGTCTTATTTTGATGGCATTATTTCACTTAAATCAAGCTTA
GAGTCTTTGGAGACAGATAAACAAGTTTATGATGCTTGCATTGGAGTAGCAAGTACATTA
TTAGCTCCGGCACAGCTCCGTATGGCTAAGCTAGCCTTGTCCATGCATTTGACTTCACCA
GCAGTCCGCATGTTTGATCAGATTGCTACACAAAACGGTGCAAAAGAGTTGCCCTGTGAA
ACATTTGTGGCAATTGCATCAAGAAAAGTTTGTGATAATGATATCCTTAGGGACATTCTT
AAATCTACAGTCAAATTTGATCCAGAGGAGCATAGAATTGAAACATACCTATTAGACCAC
TCATATCCCAGCAGTGACAATAGAAGCCTCACAGCTATTCTATATGGAGAGCTGGGAAAC
TCTGACTTTTCAGCCAAACATAAAATATTATCTGGCTACGCTGATAAAGGTGTTATTAAC
TATGTGGTCAGATGGAACATAAAATCTAGAGGCAAGCCAAAACTTCGTCTGTCTGGATAT
GGAATTGAATTGCAATTGAAGAGTACAGAATACAAAAGTCAAGATGATACCACTCCTAAG
GAGACTGTAGATGATGCAGGAGTGCCCTCAGAAGAAGAAGACGAAAATGATCCCCAGAAC
CAAATAGATGGATTCAATTTTGGAAGACTAAAGAATTTATTTCCGGCACTTCGCACACCT
CTCGAGCGTTTCCGAAGACATCTCTCTGAAATGAGTGAAGAAATAGAGCCCCTTAAAGTA
TGGCAGATGCAAGCTCTGAGTATGCAGGCCGCTGCTGCTGTGATGGATGCACACGACGCG
GGCGGAGATGAGGCTCTTAAAGTGTTGATATCTCTAGCACAGAACTTCCCCATGCAGACT
AAATCGTTGATCCATGTGAATGTGCCCCGATCCTTCCGCGATGAAGTCCTGTACAATCAA
GACGTTTGGTCGTCATCTCTAGGGCTCCGGCCTGCGGAACCCTTGTTGCTCGTATCCGGG
GCTCAGTACGATGCTGACGAGGTCGACCTTATGGCCCTCTTAGCAGCGCTCAGAGAAGAC
ATAGGACCTATGAATACTCTGCATGCTTTGGGTCTGAACAGGAAGCTCATCAACAAGCTT
CTATCACTTGAACTCGGTGAGTCTTTCACTTGGGAAGAGTATGGCTTAGACATCCGTGAC
ACAGCCATCACCTGGCTCAACGATCTAGAGACAGACGATAGATACAGACGATGGCCATCT
TCATACATGGAACTCCTACGACCCACATATCCTGGTATGCTGAGGAACTTAAGAAGAAAT
ATATATAATTACGTGATAGTGATAGACCCAACATCACCGTCGTCCGCGCCCCCTTTAAAG
CTGGGTGAGACATTACTGAAACATGCTACGCCTGTACGAGTGGGCTTGGTACTGGCACCG
GGACGCGACTCCGCTCTGGGCACCGCACTAAGAAGCGCCTTCAACTATGTAGCACAGGAG
AGGAATTCTAACAAGGAGGCCTTCTATTTCCTTACACAGGTTCTCAATTCTCTTCAAGAA
GATGCTCTGAGTGTGGATCATATAAAAAAGTATCTGAAAAAGTATGCCAGTTCGAGCGCA
AATCTCGATGAAATCATTTCAGAGGAATCTGAATTCAACTTCGGACACCAACTGGCTGAG
GAGTTCGTGTCGAAGCTGGGAACTAATAAATTCCCTCAAGTGATAGTGAATGGCGTTCCT
CTGTACGATGAGGGCTCTGGTGCGTTGTCTTCGGTGGAACTGCTCGAGGAGGCGCTAGTG
ACGGCACTGTCGCGTCACACGGCGCGTCTACAGCGAGCCGTGTTTAGAGGGAACCTCGCA
GACTCCGACGACGCCGTAGAGTATCTCATGAAGCAGCCGCATATTGTGTCCAGGTTTGCC
GTTTAG
Protein sequence:
MKATILGIILTVVISIFGNVVANGDLPKREERKSKGVTTFISAKWEATPIVLELAEYLSA
ESSDLFWSYFDGIISLKSSLESLETDKQVYDACIGVASTLLAPAQLRMAKLALSMHLTSP
AVRMFDQIATQNGAKELPCETFVAIASRKVCDNDILRDILKSTVKFDPEEHRIETYLLDH
SYPSSDNRSLTAILYGELGNSDFSAKHKILSGYADKGVINYVVRWNIKSRGKPKLRLSGY
GIELQLKSTEYKSQDDTTPKETVDDAGVPSEEEDENDPQNQIDGFNFGRLKNLFPALRTP
LERFRRHLSEMSEEIEPLKVWQMQALSMQAAAAVMDAHDAGGDEALKVLISLAQNFPMQT
KSLIHVNVPRSFRDEVLYNQDVWSSSLGLRPAEPLLLVSGAQYDADEVDLMALLAALRED
IGPMNTLHALGLNRKLINKLLSLELGESFTWEEYGLDIRDTAITWLNDLETDDRYRRWPS
SYMELLRPTYPGMLRNLRRNIYNYVIVIDPTSPSSAPPLKLGETLLKHATPVRVGLVLAP
GRDSALGTALRSAFNYVAQERNSNKEAFYFLTQVLNSLQEDALSVDHIKKYLKKYASSSA
NLDEIISEESEFNFGHQLAEEFVSKLGTNKFPQVIVNGVPLYDEGSGALSSVELLEEALV
TALSRHTARLQRAVFRGNLADSDDAVEYLMKQPHIVSRFAV