DPGLEAN20277 in OGS1.0

New model in OGS2.0DPOGS211641 
Genomic Positionscaffold1648:+ 4449-6612
See gene structure
CDS Length2007
Paired RNAseq reads  1094
Single RNAseq reads  2341
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011693 (0.0)
Best Drosophila hit  peptide O-xylosyltransferase (4e-162)
Best Human hitxylosyltransferase 2 (3e-126)
Best NR hit (blastp)  PREDICTED: similar to protein-O-xylosyltransferase [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to protein-O-xylosyltransferase [Tribolium castaneum] (6e-176)
GeneOntology terms





  
GO:0030158 protein xylosyltransferase activity
GO:0042732 D-xylose metabolic process
GO:0015012 heparan sulfate proteoglycan biosynthetic process
GO:0050650 chondroitin sulfate proteoglycan biosynthetic process
GO:0030206 chondroitin sulfate biosynthetic process
GO:0008375 acetylglucosaminyltransferase activity
GO:0016020 membrane
InterPro families  IPR003406 Glycosyl transferase, family 14
Orthology groupMCL11660

Nucleotide sequence:

ACAATGAATGTTTATGAGACAGGCCTTGATAAGTTTGTTGCACAGATTCCAAAAACTCCT
GAGAACAAACAAACATCAGTTAGAATTGTATTTCTCTTGACATTAAATGGAAGGGCACTT
AGACAAGTACATAGATTAATTAATTCTTTGTACAGAGAAAATCACTATTTTTATATACAT
GTTGATAAGAGACAAGACTATTTACATCGTAAGTTAACTGTATTAGAGAAGCAATTTCCA
AATATAAAATTAGCTAAAAAACAATATTCTACAATATGGGGTGGAGCCTCTCTTCTTACA
ATGTTATTGACATCCATGAAAGATATTTTGAAGAATGGATGGGAATGGGACTATGTCATT
AATTTAAGCGAAAGTGATTTCCCCATAAAGTCTCTAGAGGAACTTGAAAAATTTCTCTCC
GACAACAAAGGTTATAATTTTGTTAAATCTCATGGACGGGAAGTCCAGAGATTTATTAAG
AAACAGGGCCTCGACAAAACCTTCATAGAATGTGAGACACACATGTGGAGGGTGGGAGAG
AGGAAATTACCAAAAGGTATTGTTATAGATGGAGGAAGCGACTGGATAGCATTGTCACCA
GAGCTCGTGTCTTATGTTGTTGGTGAGCGTGATGAGCTTTTATCTGGCTTGGATGTTATA
TTTGAACACACATTACTACCAGCTGAATCTTATTTTCACACTGTATTAAGGAATTCCCGC
TTCTGTAATACATATGTGGATAACAATTTGCATGTAACAAATTGGAAAAGGAAACTGGGT
TGCAAATGCCAATATAAGCATGTTGTTGATTGGTGTGGTTGTTCTCCTAATGACTTTAAA
ACTGAAGACTGGCCGAGGATTCAGAACACACAGAGTAGACAGTTATTCTTTGCTAGAAAA
TTTGAGCCTATAATCAACCAAGAAATCATCACGAGAGTTGAGCAGTACATAGGATTTAAA
GACCATTATTTAATCCCTAATTTAGAGGCGTACTGGCAAAATATATATGATATAGAAGAT
TTAACAGCCAATACTGATGACACTTTACTCTCGCATGGGGGTAGCATAATTCGCCATAAT
TCAAAGATTTTAGCTCAAGAAAACTGCAATATTGAAATTAAAGAAATCATTGAAATTAAT
TTGTATAAATATGCAGATGTTTACAAAGGTAACCTTATACTGCACAAAGCGACAATCAAT
AACAATATGGAAGTGTTTCTGGAGACTTGGTACAAACCAAAGAAATTTCTCGATTTAGGC
ATTGAAAATCTTGATATGGAATATATAAAAGTATTTAAAGTTAGCTCAGATTATGATCAG
AAGGAAATGTTATTTCGGAATTTGGCAAATATTCTGGGTCCTTGGTCGGAGCCGGTATTG
CTTTATCAGTTCTCTGCATATGTAGATAAAAATATGGGAAACTTGACTCTAGTATGGTTA
GACCCAGCCGGTGTGATTGCGGATATAAATATAATTTCCCGAGATGAAAATAACTTAACC
AGTTTTATCAAACCTCACATCAAAGCACCTTTATTGCCTGGTGTCTGGAAAGTTGGCCTA
TTTGATAACACAAGTACTATTGCCGTTACTAAGTTCCTTATAACTCCTCTGGAATATTTC
TCTGGCAAAGAAATAACCCAACAGGAAGTTGGTTTAATACATGGGGGGTCACAAAATTCT
TATAAATATTTTGATAATTTTATGTTAAAGAGCAACAGCTTTATACCCAGTCGTGAGAAT
AGATTGTTATTAATGGAAATATCCAAGTTAAACACGAGAAGAATTAATAAGGACCTGATA
GAGTGGATTGACAGTCTACATACAGAGTTTTATAATATCTTGGGTTCTTGTGTATTTTTA
GCAGAAAATAAATCATTTGAAAAGATATCTTGTGGGAATTATAAATTTCAGCAATGCATC
CTTTCTGATTGGAGTTCTAAATCACCGGATCCAAAAGGCGTGGTAGGAAAGTTGGATAAA
AATACTGGCCGTTTAAAAAGGATGTGA

Protein sequence:

TMNVYETGLDKFVAQIPKTPENKQTSVRIVFLLTLNGRALRQVHRLINSLYRENHYFYIH
VDKRQDYLHRKLTVLEKQFPNIKLAKKQYSTIWGGASLLTMLLTSMKDILKNGWEWDYVI
NLSESDFPIKSLEELEKFLSDNKGYNFVKSHGREVQRFIKKQGLDKTFIECETHMWRVGE
RKLPKGIVIDGGSDWIALSPELVSYVVGERDELLSGLDVIFEHTLLPAESYFHTVLRNSR
FCNTYVDNNLHVTNWKRKLGCKCQYKHVVDWCGCSPNDFKTEDWPRIQNTQSRQLFFARK
FEPIINQEIITRVEQYIGFKDHYLIPNLEAYWQNIYDIEDLTANTDDTLLSHGGSIIRHN
SKILAQENCNIEIKEIIEINLYKYADVYKGNLILHKATINNNMEVFLETWYKPKKFLDLG
IENLDMEYIKVFKVSSDYDQKEMLFRNLANILGPWSEPVLLYQFSAYVDKNMGNLTLVWL
DPAGVIADINIISRDENNLTSFIKPHIKAPLLPGVWKVGLFDNTSTIAVTKFLITPLEYF
SGKEITQQEVGLIHGGSQNSYKYFDNFMLKSNSFIPSRENRLLLMEISKLNTRRINKDLI
EWIDSLHTEFYNILGSCVFLAENKSFEKISCGNYKFQQCILSDWSSKSPDPKGVVGKLDK
NTGRLKRM