DPGLEAN19214 in OGS1.0

New model in OGS2.0DPOGS202663 
Genomic Positionscaffold539:+ 39027-40766
See gene structure
CDS Length1506
Paired RNAseq reads  59
Single RNAseq reads  216
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001291 (5e-177)
Best Drosophila hit  CG12014 (7e-115)
Best Human hitiduronate 2-sulfatase isoform a precursor (5e-108)
Best NR hit (blastp)  PREDICTED: similar to iduronate 2-sulfatase [Tribolium castaneum] (2e-137)
Best NR hit (blastx)  PREDICTED: similar to iduronate 2-sulfatase [Tribolium castaneum] (3e-134)
GeneOntology terms
  
GO:0004423 iduronate-2-sulfatase activity
GO:0008152 metabolic process
InterPro families

  
IPR017850 Alkaline-phosphatase-like, core domain
IPR000917 Sulfatase
IPR017849 Alkaline phosphatase-like, alpha/beta/alpha
Orthology groupMCL10794

Nucleotide sequence:

ATGTTTGTACTAATAATTTGTTATATTTTAATTTTCTCTAAAATACAATGTTCAGCAAAG
AACATAATAAGAAACAATGTGTTGATTATAATCATTGACGACATGCGATACCTTTCAGAG
GAAGTTTACTTGCCAAATTTTCAAAAATTGGCGGCAAAAGGAATCACATTTCAAAAGGCT
TTTGCACAACAAGCATTGTGTGCACCAAGTAGAAATTCTATTTTAACCGGTCGTCGACCA
GATGAGTTACGCTTGTATGACTTTTATAATTATTGGCGCGACACTGTTGGAAATTTTTCT
ACATTTCCTCAAATATTCAAGGAACACGGATACGATACGTACTCAGCTGGAAAAATATTC
CACCCAGGAAAGAGTTCCAATTTTACGGACGACTATCCTTATAGCTGGACACTAAAACCT
TATCATCCTCCAACCGAAAAATATAAAGACGATGCATTGTGTAAAGATAGACATAGTATA
ACTTTACACAAAAATCTGATTTGTCCAATCAACGTTAAGGAACAGCCCGATAATACATTA
CCTGACCTCGAAACCCTCAAATACTCAATTGATATTATTAAAAATAGAAACCAAACTAAA
CCCTTCCTGCTAGCTGTCGGATTTCACAAGCCTCATATTCCTTTAAAATATCCTCATAAA
TACTTGAAAAATGTTCCAATTAGTTCAGTGAATCCGCCACGTGTGTCGTCTATCCCTAAG
GGTCTACCGCTGGTATCTTGGCATCCTTGGACGGATGTCCGGCGAAGAGATGACATTAAG
AAACTAAACCTTACTTTCCCATTTGGTATAATGCCTCCGAAATGGACGTTAAAGATAAGG
CAAAGTTATTATGCTGCGTCACTATACATAGATGATCTTTTGGGAAAACTTATGAGCCAT
GTAAATCAAACCAACACCATAATTGTTGTTACTAGTGATCATGGTTGGTCTTTGGGTGAA
AATGGACTTTGGGCAAAGTATAGCAACTTTGATGTCGCCCTGAGGGTGCCCTTGCTTTTT
AAAATACCCGGATTTCAGCCCAAGGTCATAACTAATCCTGTTGAATTGGTCGACATATAC
CCAACTTTACTTGAAGTGGGTTTAAATATATTTGTACCAAAATGTAAGAATAATGATGAT
AAATCCACTTTATGTTCGAGTGGAAAAAGTTTAGTACAATTAATGTCAAACAAACATAAT
ACTGGTAGATCATTTGCCATATCCCAGTATCCACGGCCACAGGTACAACCTACAAAAAGT
TCTGATAAACCAAAACTGAAAGATATAAAAATAATGGGTTATAGCATCCGAACGGAAAAA
TATAGATACACTGAATGGATATCATTTAATAATACACATTTCACTAGGAACTGGAATAAA
ATACACGGGATCGAACTATACAACCATGTTTATGATGACGAAGAATCAAATAATCTGTAC
CTAGTACCATATTATCAGGATATAAAAAAACAATTATCAGCATTACTGAGGTCAACAATA
AATTAG

Protein sequence:

MFVLIICYILIFSKIQCSAKNIIRNNVLIIIIDDMRYLSEEVYLPNFQKLAAKGITFQKA
FAQQALCAPSRNSILTGRRPDELRLYDFYNYWRDTVGNFSTFPQIFKEHGYDTYSAGKIF
HPGKSSNFTDDYPYSWTLKPYHPPTEKYKDDALCKDRHSITLHKNLICPINVKEQPDNTL
PDLETLKYSIDIIKNRNQTKPFLLAVGFHKPHIPLKYPHKYLKNVPISSVNPPRVSSIPK
GLPLVSWHPWTDVRRRDDIKKLNLTFPFGIMPPKWTLKIRQSYYAASLYIDDLLGKLMSH
VNQTNTIIVVTSDHGWSLGENGLWAKYSNFDVALRVPLLFKIPGFQPKVITNPVELVDIY
PTLLEVGLNIFVPKCKNNDDKSTLCSSGKSLVQLMSNKHNTGRSFAISQYPRPQVQPTKS
SDKPKLKDIKIMGYSIRTEKYRYTEWISFNNTHFTRNWNKIHGIELYNHVYDDEESNNLY
LVPYYQDIKKQLSALLRSTIN