DPGLEAN14317 in OGS1.0

New model in OGS2.0DPOGS213821 
Genomic Positionscaffold515:- 17985-24075
See gene structure
CDS Length1212
Paired RNAseq reads  60
Single RNAseq reads  179
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006776 (3e-119)
Best Drosophila hit  heparan sulfate 3-O sulfotransferase-B (2e-60)
Best Human hitheparan sulfate glucosamine 3-O-sulfotransferase 5 (3e-74)
Best NR hit (blastp)  PREDICTED: similar to Heparan sulfate glucosamine 3-O-sulfotransferase 5 (Heparan sulfate D-glucosaminyl 3-O-sulfotransferase 5) (Heparan sulfate 3-O-sulfotransferase 5) [Apis mellifera] (1e-147)
Best NR hit (blastx)  PREDICTED: similar to heparan sulfate sulfotransferase [Tribolium castaneum] (2e-126)
GeneOntology terms






  
GO:0005515 protein binding
GO:0006477 protein amino acid sulfation
GO:0008146 sulfotransferase activity
GO:0008467 [heparan sulfate]-glucosamine 3-sulfotransferase 1 activity
GO:0015015 heparan sulfate proteoglycan biosynthetic process, enzymatic modification
GO:0016740 transferase activity
GO:0046596 regulation of virion penetration into host cell
GO:0050819 negative regulation of coagulation
InterPro families  IPR000863 Sulfotransferase domain
Orthology groupMCL14226

Nucleotide sequence:

ATGGAGTCGCGGGCGCAGATGAATCAGTCGCTGCTGAACAAAAATCGTCTTGAAGCCAAC
TACAGGTCAAGCTGTTCGCAATCATTGCCGCTTGATATTCTATGGTTACCGCGGTCATCG
TTCCGGCCGGCGGAGGGCGAGCTGGCAGACTGCGTGCTGGTGGTGGGAGTGTCGCGGCCA
AAGTTAGCCGCGGCCCTCCTGTCCGTAACTCTTTTATCTCTGTTCCTCACCTTCCACGTG
CTCTATGACAGTGCCCTTTATAGCATACAAGCGGCTAGTATGGCGTCAGCTGCCAGCGAG
GGCCGCAAGATATTACAAAATCAAGAGAGCAGTCGCACAATAAGCCACCCTATGGCTCTA
AGAAAAAGGTTAAAAACACCTCGTGCCCCACGGCCAGCCCGTCGTCTCCCCCAGGCTCTT
ATTATAGGAGTTAGGAAATGCGGAACGCGAGCTCTTCTTGAGATGCTCTATTTACATCCT
ATGGTACAAAAAGCCTCCGGCGAAGTTCATTTTTTTGATCGCGATGAAAATTATGCTCTC
GGTCTAGAATGGTACAAGAGTAAAATGCCTCTTTCATTTAAGGGACAAATAACTATAGAA
AAAAGTCCCAGCTACTTCGTTACTCCCGAGGTACCAGAGCGAGTTCGTGCTATGAATTCG
TCAGTGAGGCTACTTCTTATTGTGCGAGAACCAGTAACTCGTGCAATATCGGACTATACC
CAGCTTCGTAGTCGAGCCACCCCTTCAGCTCCTACTGTTTCGTTGGTTGGACACCCCTTG
CCTGATACTGTTAAGCCTTTTGAACACCTTGCTTTGGCACCAGATGGTTCAATCAATGTT
GCGTATAGGCCAATAGCTATATCACTGTATCATGCATACTTTCATCGCTGGTTGGAAGTG
TTCCCCAGAGAACAGATTCTTGTTGTAAACGGAGATCAGCTGATTGAAGATCCAGTACCA
CAATTACGACGCATTGAGAAATTTCTTGGCCTTGAACATAAAATAGGAAGAAGAAACTTC
TACTTCAACGAAACTAAAGGATTCTACTGTTTGCGTAACGATACCACGGATAAGTGTTTG
CGAGAGACAAAAGGTCGCAAGCATCCCCGCGTAGACCCAGCGGTTGTCACAAAGTTACGT
AAGTTTTTTGTCCAACATAATCAACGTTTCTACGACCTGATCGGCGAAGATCTCGGCTGG
CCCGAGGATTGA

Protein sequence:

MESRAQMNQSLLNKNRLEANYRSSCSQSLPLDILWLPRSSFRPAEGELADCVLVVGVSRP
KLAAALLSVTLLSLFLTFHVLYDSALYSIQAASMASAASEGRKILQNQESSRTISHPMAL
RKRLKTPRAPRPARRLPQALIIGVRKCGTRALLEMLYLHPMVQKASGEVHFFDRDENYAL
GLEWYKSKMPLSFKGQITIEKSPSYFVTPEVPERVRAMNSSVRLLLIVREPVTRAISDYT
QLRSRATPSAPTVSLVGHPLPDTVKPFEHLALAPDGSINVAYRPIAISLYHAYFHRWLEV
FPREQILVVNGDQLIEDPVPQLRRIEKFLGLEHKIGRRNFYFNETKGFYCLRNDTTDKCL
RETKGRKHPRVDPAVVTKLRKFFVQHNQRFYDLIGEDLGWPED