New model in OGS2.0 | DPOGS210791  |
---|---|
Genomic Position | scaffold488:- 154317-156445 |
See gene structure | |
CDS Length | 804 |
Paired RNAseq reads   | 18 |
Single RNAseq reads   | 42 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007104 (4e-99) |
Best Drosophila hit   | heparan sulfate 3-O sulfotransferase-B (1e-98) |
Best Human hit | heparan sulfate glucosamine 3-O-sulfotransferase 3A1 (2e-83) |
Best NR hit (blastp)   | GL13007 [Drosophila persimilis] (1e-110) |
Best NR hit (blastx)   | AGAP000422-PA [Anopheles gambiae str. PEST] (5e-103) |
GeneOntology terms    | GO:0008467 [heparan sulfate]-glucosamine 3-sulfotransferase 1 activity GO:0001745 compound eye morphogenesis GO:0007476 imaginal disc-derived wing morphogenesis GO:0008587 imaginal disc-derived wing margin morphogenesis GO:0016348 imaginal disc-derived leg joint morphogenesis GO:0008407 bristle morphogenesis GO:0050768 negative regulation of neurogenesis GO:0007032 endosome organization GO:0048190 wing disc dorsal/ventral pattern formation GO:0007219 Notch signaling pathway GO:0007474 imaginal disc-derived wing vein specification GO:0051036 regulation of endosome size GO:0007040 lysosome organization GO:0008586 imaginal disc-derived wing vein morphogenesis |
InterPro families   | IPR000863 Sulfotransferase domain |
Orthology group | MCL10681 |
Nucleotide sequence:
ATGACATTAAGGAAACCGAACCTGGTTCCAACGAAGAGACTGCCGGATGCCTTGATTATA
GGTGTTAAGAAATGTGGAACGAGAGCGCTTTTGGAATTTTTAAGATTGCATCCAGACGTA
AGGGCTGCCGGATCCGAGGTTCATTTTTTCGACAAATTTTATCATAAGGGATTCGAATGG
TACAGGAACAGAATGCCACCAACTCTCGAGGGTCAGATTACTATGGAGAAGACACCTTCT
TATTGGGTAACACGATCGGCCCCGAAGCGTGTCTTCGCTATGAATCCAGCCGTCAAACTA
TTGGCCGTAGTCAGAGATCCTGTCACTAGAGCTATTAGTGACTACACCCAGTCCGCAAGT
AAGAGACCGAGCCTGCCTCGTTTTGAGGAGTTGGCGCTGATGGATAGCCCGTGGGGTTCC
GTAGTGGACACCTGGCCGCCCGTCCGACTGGGAATATACGCCAGACCTCTGAGACGTTGG
CTGAGAAGGTTCCCAAGGTCCAGGATACTTATCATCAGCGGAGAGAGACTCGTTGTGGAC
CCCGCCGCTGAAATGACTAGGGTTCAGGAATTCCTGAACCTCAAGCCAGTGATAACGGAA
AAACATTTCTACTTTAATTCCACCAAAGGGTTCCCATGCCTGCTCAAATCTGAGAGCCGG
TCAACTCCGCACTGCCTTGGCAAAACCAAAGGTAGAAATCATCCCTACATAGATCCAGTA
GCCTTAGAGAGACTGAGAGAATTCTATAGACCTCATAATGAGAGATTTTATGAACTATCT
GGTATAAATTTTGGTTGGCAGTAA
Protein sequence:
MTLRKPNLVPTKRLPDALIIGVKKCGTRALLEFLRLHPDVRAAGSEVHFFDKFYHKGFEW
YRNRMPPTLEGQITMEKTPSYWVTRSAPKRVFAMNPAVKLLAVVRDPVTRAISDYTQSAS
KRPSLPRFEELALMDSPWGSVVDTWPPVRLGIYARPLRRWLRRFPRSRILIISGERLVVD
PAAEMTRVQEFLNLKPVITEKHFYFNSTKGFPCLLKSESRSTPHCLGKTKGRNHPYIDPV
ALERLREFYRPHNERFYELSGINFGWQ