New model in OGS2.0 | DPOGS202663  |
---|---|
Genomic Position | scaffold539:+ 39027-40766 |
See gene structure | |
CDS Length | 1506 |
Paired RNAseq reads   | 59 |
Single RNAseq reads   | 216 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001291 (5e-177) |
Best Drosophila hit   | CG12014 (7e-115) |
Best Human hit | iduronate 2-sulfatase isoform a precursor (5e-108) |
Best NR hit (blastp)   | PREDICTED: similar to iduronate 2-sulfatase [Tribolium castaneum] (2e-137) |
Best NR hit (blastx)   | PREDICTED: similar to iduronate 2-sulfatase [Tribolium castaneum] (3e-134) |
GeneOntology terms    | GO:0004423 iduronate-2-sulfatase activity GO:0008152 metabolic process |
InterPro families    | IPR017850 Alkaline-phosphatase-like, core domain IPR000917 Sulfatase IPR017849 Alkaline phosphatase-like, alpha/beta/alpha |
Orthology group | MCL10794 |
Nucleotide sequence:
ATGTTTGTACTAATAATTTGTTATATTTTAATTTTCTCTAAAATACAATGTTCAGCAAAG
AACATAATAAGAAACAATGTGTTGATTATAATCATTGACGACATGCGATACCTTTCAGAG
GAAGTTTACTTGCCAAATTTTCAAAAATTGGCGGCAAAAGGAATCACATTTCAAAAGGCT
TTTGCACAACAAGCATTGTGTGCACCAAGTAGAAATTCTATTTTAACCGGTCGTCGACCA
GATGAGTTACGCTTGTATGACTTTTATAATTATTGGCGCGACACTGTTGGAAATTTTTCT
ACATTTCCTCAAATATTCAAGGAACACGGATACGATACGTACTCAGCTGGAAAAATATTC
CACCCAGGAAAGAGTTCCAATTTTACGGACGACTATCCTTATAGCTGGACACTAAAACCT
TATCATCCTCCAACCGAAAAATATAAAGACGATGCATTGTGTAAAGATAGACATAGTATA
ACTTTACACAAAAATCTGATTTGTCCAATCAACGTTAAGGAACAGCCCGATAATACATTA
CCTGACCTCGAAACCCTCAAATACTCAATTGATATTATTAAAAATAGAAACCAAACTAAA
CCCTTCCTGCTAGCTGTCGGATTTCACAAGCCTCATATTCCTTTAAAATATCCTCATAAA
TACTTGAAAAATGTTCCAATTAGTTCAGTGAATCCGCCACGTGTGTCGTCTATCCCTAAG
GGTCTACCGCTGGTATCTTGGCATCCTTGGACGGATGTCCGGCGAAGAGATGACATTAAG
AAACTAAACCTTACTTTCCCATTTGGTATAATGCCTCCGAAATGGACGTTAAAGATAAGG
CAAAGTTATTATGCTGCGTCACTATACATAGATGATCTTTTGGGAAAACTTATGAGCCAT
GTAAATCAAACCAACACCATAATTGTTGTTACTAGTGATCATGGTTGGTCTTTGGGTGAA
AATGGACTTTGGGCAAAGTATAGCAACTTTGATGTCGCCCTGAGGGTGCCCTTGCTTTTT
AAAATACCCGGATTTCAGCCCAAGGTCATAACTAATCCTGTTGAATTGGTCGACATATAC
CCAACTTTACTTGAAGTGGGTTTAAATATATTTGTACCAAAATGTAAGAATAATGATGAT
AAATCCACTTTATGTTCGAGTGGAAAAAGTTTAGTACAATTAATGTCAAACAAACATAAT
ACTGGTAGATCATTTGCCATATCCCAGTATCCACGGCCACAGGTACAACCTACAAAAAGT
TCTGATAAACCAAAACTGAAAGATATAAAAATAATGGGTTATAGCATCCGAACGGAAAAA
TATAGATACACTGAATGGATATCATTTAATAATACACATTTCACTAGGAACTGGAATAAA
ATACACGGGATCGAACTATACAACCATGTTTATGATGACGAAGAATCAAATAATCTGTAC
CTAGTACCATATTATCAGGATATAAAAAAACAATTATCAGCATTACTGAGGTCAACAATA
AATTAG
Protein sequence:
MFVLIICYILIFSKIQCSAKNIIRNNVLIIIIDDMRYLSEEVYLPNFQKLAAKGITFQKA
FAQQALCAPSRNSILTGRRPDELRLYDFYNYWRDTVGNFSTFPQIFKEHGYDTYSAGKIF
HPGKSSNFTDDYPYSWTLKPYHPPTEKYKDDALCKDRHSITLHKNLICPINVKEQPDNTL
PDLETLKYSIDIIKNRNQTKPFLLAVGFHKPHIPLKYPHKYLKNVPISSVNPPRVSSIPK
GLPLVSWHPWTDVRRRDDIKKLNLTFPFGIMPPKWTLKIRQSYYAASLYIDDLLGKLMSH
VNQTNTIIVVTSDHGWSLGENGLWAKYSNFDVALRVPLLFKIPGFQPKVITNPVELVDIY
PTLLEVGLNIFVPKCKNNDDKSTLCSSGKSLVQLMSNKHNTGRSFAISQYPRPQVQPTKS
SDKPKLKDIKIMGYSIRTEKYRYTEWISFNNTHFTRNWNKIHGIELYNHVYDDEESNNLY
LVPYYQDIKKQLSALLRSTIN