New model in OGS2.0 | DPOGS211641  |
---|---|
Genomic Position | scaffold1648:+ 4449-6612 |
See gene structure | |
CDS Length | 2007 |
Paired RNAseq reads   | 1094 |
Single RNAseq reads   | 2341 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011693 (0.0) |
Best Drosophila hit   | peptide O-xylosyltransferase (4e-162) |
Best Human hit | xylosyltransferase 2 (3e-126) |
Best NR hit (blastp)   | PREDICTED: similar to protein-O-xylosyltransferase [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to protein-O-xylosyltransferase [Tribolium castaneum] (6e-176) |
GeneOntology terms    | GO:0030158 protein xylosyltransferase activity GO:0042732 D-xylose metabolic process GO:0015012 heparan sulfate proteoglycan biosynthetic process GO:0050650 chondroitin sulfate proteoglycan biosynthetic process GO:0030206 chondroitin sulfate biosynthetic process GO:0008375 acetylglucosaminyltransferase activity GO:0016020 membrane |
InterPro families   | IPR003406 Glycosyl transferase, family 14 |
Orthology group | MCL11660 |
Nucleotide sequence:
ACAATGAATGTTTATGAGACAGGCCTTGATAAGTTTGTTGCACAGATTCCAAAAACTCCT
GAGAACAAACAAACATCAGTTAGAATTGTATTTCTCTTGACATTAAATGGAAGGGCACTT
AGACAAGTACATAGATTAATTAATTCTTTGTACAGAGAAAATCACTATTTTTATATACAT
GTTGATAAGAGACAAGACTATTTACATCGTAAGTTAACTGTATTAGAGAAGCAATTTCCA
AATATAAAATTAGCTAAAAAACAATATTCTACAATATGGGGTGGAGCCTCTCTTCTTACA
ATGTTATTGACATCCATGAAAGATATTTTGAAGAATGGATGGGAATGGGACTATGTCATT
AATTTAAGCGAAAGTGATTTCCCCATAAAGTCTCTAGAGGAACTTGAAAAATTTCTCTCC
GACAACAAAGGTTATAATTTTGTTAAATCTCATGGACGGGAAGTCCAGAGATTTATTAAG
AAACAGGGCCTCGACAAAACCTTCATAGAATGTGAGACACACATGTGGAGGGTGGGAGAG
AGGAAATTACCAAAAGGTATTGTTATAGATGGAGGAAGCGACTGGATAGCATTGTCACCA
GAGCTCGTGTCTTATGTTGTTGGTGAGCGTGATGAGCTTTTATCTGGCTTGGATGTTATA
TTTGAACACACATTACTACCAGCTGAATCTTATTTTCACACTGTATTAAGGAATTCCCGC
TTCTGTAATACATATGTGGATAACAATTTGCATGTAACAAATTGGAAAAGGAAACTGGGT
TGCAAATGCCAATATAAGCATGTTGTTGATTGGTGTGGTTGTTCTCCTAATGACTTTAAA
ACTGAAGACTGGCCGAGGATTCAGAACACACAGAGTAGACAGTTATTCTTTGCTAGAAAA
TTTGAGCCTATAATCAACCAAGAAATCATCACGAGAGTTGAGCAGTACATAGGATTTAAA
GACCATTATTTAATCCCTAATTTAGAGGCGTACTGGCAAAATATATATGATATAGAAGAT
TTAACAGCCAATACTGATGACACTTTACTCTCGCATGGGGGTAGCATAATTCGCCATAAT
TCAAAGATTTTAGCTCAAGAAAACTGCAATATTGAAATTAAAGAAATCATTGAAATTAAT
TTGTATAAATATGCAGATGTTTACAAAGGTAACCTTATACTGCACAAAGCGACAATCAAT
AACAATATGGAAGTGTTTCTGGAGACTTGGTACAAACCAAAGAAATTTCTCGATTTAGGC
ATTGAAAATCTTGATATGGAATATATAAAAGTATTTAAAGTTAGCTCAGATTATGATCAG
AAGGAAATGTTATTTCGGAATTTGGCAAATATTCTGGGTCCTTGGTCGGAGCCGGTATTG
CTTTATCAGTTCTCTGCATATGTAGATAAAAATATGGGAAACTTGACTCTAGTATGGTTA
GACCCAGCCGGTGTGATTGCGGATATAAATATAATTTCCCGAGATGAAAATAACTTAACC
AGTTTTATCAAACCTCACATCAAAGCACCTTTATTGCCTGGTGTCTGGAAAGTTGGCCTA
TTTGATAACACAAGTACTATTGCCGTTACTAAGTTCCTTATAACTCCTCTGGAATATTTC
TCTGGCAAAGAAATAACCCAACAGGAAGTTGGTTTAATACATGGGGGGTCACAAAATTCT
TATAAATATTTTGATAATTTTATGTTAAAGAGCAACAGCTTTATACCCAGTCGTGAGAAT
AGATTGTTATTAATGGAAATATCCAAGTTAAACACGAGAAGAATTAATAAGGACCTGATA
GAGTGGATTGACAGTCTACATACAGAGTTTTATAATATCTTGGGTTCTTGTGTATTTTTA
GCAGAAAATAAATCATTTGAAAAGATATCTTGTGGGAATTATAAATTTCAGCAATGCATC
CTTTCTGATTGGAGTTCTAAATCACCGGATCCAAAAGGCGTGGTAGGAAAGTTGGATAAA
AATACTGGCCGTTTAAAAAGGATGTGA
Protein sequence:
TMNVYETGLDKFVAQIPKTPENKQTSVRIVFLLTLNGRALRQVHRLINSLYRENHYFYIH
VDKRQDYLHRKLTVLEKQFPNIKLAKKQYSTIWGGASLLTMLLTSMKDILKNGWEWDYVI
NLSESDFPIKSLEELEKFLSDNKGYNFVKSHGREVQRFIKKQGLDKTFIECETHMWRVGE
RKLPKGIVIDGGSDWIALSPELVSYVVGERDELLSGLDVIFEHTLLPAESYFHTVLRNSR
FCNTYVDNNLHVTNWKRKLGCKCQYKHVVDWCGCSPNDFKTEDWPRIQNTQSRQLFFARK
FEPIINQEIITRVEQYIGFKDHYLIPNLEAYWQNIYDIEDLTANTDDTLLSHGGSIIRHN
SKILAQENCNIEIKEIIEINLYKYADVYKGNLILHKATINNNMEVFLETWYKPKKFLDLG
IENLDMEYIKVFKVSSDYDQKEMLFRNLANILGPWSEPVLLYQFSAYVDKNMGNLTLVWL
DPAGVIADINIISRDENNLTSFIKPHIKAPLLPGVWKVGLFDNTSTIAVTKFLITPLEYF
SGKEITQQEVGLIHGGSQNSYKYFDNFMLKSNSFIPSRENRLLLMEISKLNTRRINKDLI
EWIDSLHTEFYNILGSCVFLAENKSFEKISCGNYKFQQCILSDWSSKSPDPKGVVGKLDK
NTGRLKRM