DPGLEAN22287 in OGS1.0

New model in OGS2.0DPOGS204233 
Genomic Positionscaffold777:+ 245-6876
See gene structure
CDS Length2142
Paired RNAseq reads  1594
Single RNAseq reads  3962
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007512 (0.0)
Best Drosophila hit  UDP-glucose-glycoprotein glucosyltransferase (0.0)
Best Human hitUDP-glucose:glycoprotein glucosyltransferase 1 precursor (0.0)
Best NR hit (blastp)  PREDICTED: similar to UDP-glucose glycoprotein:glucosyltransferase [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC013545 [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0003980 UDP-glucose:glycoprotein glucosyltransferase activity
GO:0005783 endoplasmic reticulum
GO:0006486 protein amino acid glycosylation
GO:0005635 nuclear envelope
GO:0005737 cytoplasm
GO:0005791 rough endoplasmic reticulum
InterPro families
  
IPR009448 UDP-glucose:Glycoprotein Glucosyltransferase
IPR002495 Glycosyl transferase, family 8
Orthology groupMCL11265

Nucleotide sequence:

CCTGAGTTGGTACCGGCGTTGAACAAGTACGAGTGGGTGTTGAAGGCGTCCCGCGTGTTA
TGTGCTCGGAGCCTCAAGCTGCGTGGCTCTGAGCGAGCAGTCATACACAACGCCAGAGTT
ATAGGACCCTTCAACAAAGGAGAGAGCTTCTCCCTAGAAGACTTCGCACTGCTTGAGAGG
TACAGTAACCAAGTGTATGGAGACAAGCTATCCGAATTGTTACACCAGAACAAGAAGCTG
TCAAATAATGTTTTGGACGATGACGATGACATCACTGATATAAGCACAGATAACTATTTG
AAGGTTATATCAGTGCTTGCTTCGCGTAGTCCCCGTGTGCGCACGCCCTTACCGAGCGGA
TTACGAACGGATCATTCTGTTATAGAACTACCTCCTTTGTATGAGGACGAAGCGGCCGTT
GAAATAGTAGCCGTGGTGGACCCGGCGTCGGCGGCCGCTCAGCGCCTAGCTCCGCTGCTG
CTGGTGTTGCGACGCGTTGTCAACTGTCGCTTACAGTTGTTCCTCAACCCACAGGACAAG
AATTCTGACATGCCGCTTAAGAGTTTTTACCGCTACGTGTTGGAGCCGGAGCTACAATTC
AATAGCGCGGGTGCGCAGACGGGCGGTGCGATCGCGCGTTTCTCCCGCTTGCCGCACGCT
CCTCTTCTATCGCTGGAGCTGCGCACGCCACCCAATTGGCTGGTCGAGTGCGTGAAGTCT
GTATACGACTTGGATAATATACGCCTGGCCGATGTCGAGTCACTCGTTCACAGTGAGTTC
GAGTTGGAATACCTGCTTGTGGAAGGTCACGCGTGGGATACGTCTCTGGGCACGCCGCCT
CGCGGGTTACAACTCGTGCTGGGCACGAGACACCGACCAGACACAGTTGACACCATCGTG
ATGGCCAACCTCGGCTACTTCCAGCTCAAGGCCAACCCCGGTGCCTGGACGTTGCGTCTC
AGACCCGGCCGCTCTGACGATATTTACGAGATTGTCGGGCACGAAAACACTGACACCCCA
GCCGGCAGTAAAGACATCCAGGTCCTGATGAGTTCATTCCGGAGTCAAGTTATTAAATTG
AGGGTCACTAAGAAGGCGGATAAACAACACCTTGATCTTTTAGCTGAAAATGACGAAAAG
AACGCTGGTGGGATATGGAATTCTATTGCAAGTTCGTTCGGAGGTGGCGAAGAACAAGAA
GCGCAAGACGAGACTATCAACGTGTTCTCAGTAGCATCCGGTCACTTGTACGAACGTTTT
CTACGTATTATGATGCTGTCTGTACTAAAGAACACTAAGTCACCCGTGAAGTTCTGGTTC
TTAAAGAACTATCTCAGCCCCTCACTTAAGGACATCCTTCCATACATGGCGCAAGAGTAC
GGGTTCCAGTACGAGCTGGTACAGTACCAGTGGCCTCGCTGGCTGCAGCGGCAGCGTGAC
AGACAGCGGACCATCTGGGGGTACAAGATACTGTTCCTCGACGTGTTATTCCCATTGGAC
GTCAAGAAGATCATCTTTGTTGATGCTGATCAGATTGTTCGAGCTGATCTAAAGGAACTA
GTAGATTTGGATCTAGGCGGAGCTCCCTATGGATACACCCCGTTCTGTGACAGTAGAAAA
GAAATGGAAGGATTCAGGTTCTGGAAGCAAGGCTACTGGCGGAATCATCTCCAAGGTCGG
AGTTATCACATCAGTGCACTGTACGTGGTGGATCTGAAGCGTTTCAGACGAATCGCTGCC
GGCGACCGACTGAGGGGACAGTACCAGGCGCTCAGCCAGGACCCTAACAGTTTGTCAAAT
CTAGATCAAGATCTTCCCAACAATATGATTCACCAGGTGGCTATAAAGTCTCTGCCCCAG
GAATGGTTGTGGTGTGAGACCTGGTGCGATAATGAATCCAAGAAATACGCCAAGACCATT
GATTTGTGCAACAACCCTATGACGAAGGAGGCCAAGTTGTCAGCAGCTATGCGCATCGTG
CCTGAGTGGAGCGACTATGACAACGAGCTGAGAGCATTGCACGCCCGCGTCAGGCAGGGA
CACTACCAGGACGACACCGAACAGGAAATCGAGACTCATGAACATGAACAAGTCAGCAAA
GAAGATAAAACTGATAAAGCACAGGAACACACTGAGTTATGA

Protein sequence:

PELVPALNKYEWVLKASRVLCARSLKLRGSERAVIHNARVIGPFNKGESFSLEDFALLER
YSNQVYGDKLSELLHQNKKLSNNVLDDDDDITDISTDNYLKVISVLASRSPRVRTPLPSG
LRTDHSVIELPPLYEDEAAVEIVAVVDPASAAAQRLAPLLLVLRRVVNCRLQLFLNPQDK
NSDMPLKSFYRYVLEPELQFNSAGAQTGGAIARFSRLPHAPLLSLELRTPPNWLVECVKS
VYDLDNIRLADVESLVHSEFELEYLLVEGHAWDTSLGTPPRGLQLVLGTRHRPDTVDTIV
MANLGYFQLKANPGAWTLRLRPGRSDDIYEIVGHENTDTPAGSKDIQVLMSSFRSQVIKL
RVTKKADKQHLDLLAENDEKNAGGIWNSIASSFGGGEEQEAQDETINVFSVASGHLYERF
LRIMMLSVLKNTKSPVKFWFLKNYLSPSLKDILPYMAQEYGFQYELVQYQWPRWLQRQRD
RQRTIWGYKILFLDVLFPLDVKKIIFVDADQIVRADLKELVDLDLGGAPYGYTPFCDSRK
EMEGFRFWKQGYWRNHLQGRSYHISALYVVDLKRFRRIAAGDRLRGQYQALSQDPNSLSN
LDQDLPNNMIHQVAIKSLPQEWLWCETWCDNESKKYAKTIDLCNNPMTKEAKLSAAMRIV
PEWSDYDNELRALHARVRQGHYQDDTEQEIETHEHEQVSKEDKTDKAQEHTEL