New model in OGS2.0 | DPOGS213821  |
---|---|
Genomic Position | scaffold515:- 17985-24075 |
See gene structure | |
CDS Length | 1212 |
Paired RNAseq reads   | 60 |
Single RNAseq reads   | 179 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006776 (3e-119) |
Best Drosophila hit   | heparan sulfate 3-O sulfotransferase-B (2e-60) |
Best Human hit | heparan sulfate glucosamine 3-O-sulfotransferase 5 (3e-74) |
Best NR hit (blastp)   | PREDICTED: similar to Heparan sulfate glucosamine 3-O-sulfotransferase 5 (Heparan sulfate D-glucosaminyl 3-O-sulfotransferase 5) (Heparan sulfate 3-O-sulfotransferase 5) [Apis mellifera] (1e-147) |
Best NR hit (blastx)   | PREDICTED: similar to heparan sulfate sulfotransferase [Tribolium castaneum] (2e-126) |
GeneOntology terms    | GO:0005515 protein binding GO:0006477 protein amino acid sulfation GO:0008146 sulfotransferase activity GO:0008467 [heparan sulfate]-glucosamine 3-sulfotransferase 1 activity GO:0015015 heparan sulfate proteoglycan biosynthetic process, enzymatic modification GO:0016740 transferase activity GO:0046596 regulation of virion penetration into host cell GO:0050819 negative regulation of coagulation |
InterPro families   | IPR000863 Sulfotransferase domain |
Orthology group | MCL14226 |
Nucleotide sequence:
ATGGAGTCGCGGGCGCAGATGAATCAGTCGCTGCTGAACAAAAATCGTCTTGAAGCCAAC
TACAGGTCAAGCTGTTCGCAATCATTGCCGCTTGATATTCTATGGTTACCGCGGTCATCG
TTCCGGCCGGCGGAGGGCGAGCTGGCAGACTGCGTGCTGGTGGTGGGAGTGTCGCGGCCA
AAGTTAGCCGCGGCCCTCCTGTCCGTAACTCTTTTATCTCTGTTCCTCACCTTCCACGTG
CTCTATGACAGTGCCCTTTATAGCATACAAGCGGCTAGTATGGCGTCAGCTGCCAGCGAG
GGCCGCAAGATATTACAAAATCAAGAGAGCAGTCGCACAATAAGCCACCCTATGGCTCTA
AGAAAAAGGTTAAAAACACCTCGTGCCCCACGGCCAGCCCGTCGTCTCCCCCAGGCTCTT
ATTATAGGAGTTAGGAAATGCGGAACGCGAGCTCTTCTTGAGATGCTCTATTTACATCCT
ATGGTACAAAAAGCCTCCGGCGAAGTTCATTTTTTTGATCGCGATGAAAATTATGCTCTC
GGTCTAGAATGGTACAAGAGTAAAATGCCTCTTTCATTTAAGGGACAAATAACTATAGAA
AAAAGTCCCAGCTACTTCGTTACTCCCGAGGTACCAGAGCGAGTTCGTGCTATGAATTCG
TCAGTGAGGCTACTTCTTATTGTGCGAGAACCAGTAACTCGTGCAATATCGGACTATACC
CAGCTTCGTAGTCGAGCCACCCCTTCAGCTCCTACTGTTTCGTTGGTTGGACACCCCTTG
CCTGATACTGTTAAGCCTTTTGAACACCTTGCTTTGGCACCAGATGGTTCAATCAATGTT
GCGTATAGGCCAATAGCTATATCACTGTATCATGCATACTTTCATCGCTGGTTGGAAGTG
TTCCCCAGAGAACAGATTCTTGTTGTAAACGGAGATCAGCTGATTGAAGATCCAGTACCA
CAATTACGACGCATTGAGAAATTTCTTGGCCTTGAACATAAAATAGGAAGAAGAAACTTC
TACTTCAACGAAACTAAAGGATTCTACTGTTTGCGTAACGATACCACGGATAAGTGTTTG
CGAGAGACAAAAGGTCGCAAGCATCCCCGCGTAGACCCAGCGGTTGTCACAAAGTTACGT
AAGTTTTTTGTCCAACATAATCAACGTTTCTACGACCTGATCGGCGAAGATCTCGGCTGG
CCCGAGGATTGA
Protein sequence:
MESRAQMNQSLLNKNRLEANYRSSCSQSLPLDILWLPRSSFRPAEGELADCVLVVGVSRP
KLAAALLSVTLLSLFLTFHVLYDSALYSIQAASMASAASEGRKILQNQESSRTISHPMAL
RKRLKTPRAPRPARRLPQALIIGVRKCGTRALLEMLYLHPMVQKASGEVHFFDRDENYAL
GLEWYKSKMPLSFKGQITIEKSPSYFVTPEVPERVRAMNSSVRLLLIVREPVTRAISDYT
QLRSRATPSAPTVSLVGHPLPDTVKPFEHLALAPDGSINVAYRPIAISLYHAYFHRWLEV
FPREQILVVNGDQLIEDPVPQLRRIEKFLGLEHKIGRRNFYFNETKGFYCLRNDTTDKCL
RETKGRKHPRVDPAVVTKLRKFFVQHNQRFYDLIGEDLGWPED