DPGLEAN12787 in OGS1.0

New model in OGS2.0DPOGS209387 
Genomic Positionscaffold151:- 27753-31253
See gene structure
CDS Length1554
Paired RNAseq reads  774
Single RNAseq reads  1771
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005529 (6e-178)
Best Drosophila hit  CG14291 (1e-148)
Best Human hitN-sulphoglucosamine sulphohydrolase precursor (1e-147)
Best NR hit (blastp)  PREDICTED: similar to ENSANGP00000024797 [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to ENSANGP00000024797 [Nasonia vitripennis] (1e-180)
GeneOntology terms

  
GO:0016250 N-sulfoglucosamine sulfohydrolase activity
GO:0008484 sulfuric ester hydrolase activity
GO:0008152 metabolic process
InterPro families

  
IPR017850 Alkaline-phosphatase-like, core domain
IPR017849 Alkaline phosphatase-like, alpha/beta/alpha
IPR000917 Sulfatase
Orthology groupMCL14463

Nucleotide sequence:

ATGGCTCGACACCCGGTCGCTGTTATCACACTGTTATTGTGTCTCATTATCACGAACACT
GTGTTATCAAACAAGAATCGCAACGTTCTCATACTCTTAGCTGACGATGGAGGTTTTGAA
ATCGGAGCGTATAGAAACAAAATTTGCCAAACTCCGAACATCGACGAGTTTGCGAAGCGC
AGCGTCATCTTCAACAATGCCTTCAGTTCCGTCAGTAGCTGCTCTCCGAGTCGCGCGGCT
CTGCTGACCGGCACCCCGAGTCATCAGAACGGCATGTACGGGCTCCATCACGGCGTTCAT
CACTTCAACTCCTTCGATAACGTCACCAGCTTACCGAACATACTGCGCGAGCACGGAGTC
TACACTGGTATAATCGGTAAGAAGCACGTGGGTCCGAGCAGCGTGTACAAGTTCGACATG
GAGTGGACGGAGGAGGGACACAGCATCAACCAGGTCGGAAGGAACATCACGCACATGAAG
CTGCTGGCCAGGAAGTTCCTGCGGGAGGCGAACAGACTCGACAAACCGTTCCTGCTGTAC
GTCGGGTTCCACGACCCTCACCGCTGCGGCCACGAGGCTCCTCAGTACGGCCCCTTCTGT
GAGAGGTTCGGTTCCTCGGAGGAGGGGATGGGGGTCATCCCCGACTGGAAGCCCTGGTAC
TATCAGTGGGACGAGGTGCAGCTGCCTTATCATGTCCAGGACACTGAGGCGGCCCGGAGA
GACATCGCGGCGCAGTACACCACTATGTCCAGATTAGATCAAGGTGTGGGGCTCATGCTG
AAGGAGCTGGAGGCGGCGGGCCACGGACATGACACGCTGGTCATATACACCTCCGACAAC
GGGATTCCCTTCCCTTCGGGGAGGACCAACTTCTACGACCCCGGGCTGAGGGAACCTCTG
ATCATGCACTCGCCAGACCCTGGAGCTCGCAGAAACGAAGCCTCCGGCGCACTGGTGAGT
CTGCTGGACCTCACGCCCACTGTCCTCGACTGGTTCGGCGTCCGCACACCGCGACACATC
GACCACGAGTGGCGCGACCGGCCCAGGAGTCTGCTGCCCATATTGAACAAAGAGCCGCCG
CCGAGTGAGCAAGACGCCGTGTTCTCGTCCCAGACGCACCACGAGATCACCATGTACTAC
CCCATGAGGTCGGTCCGCACCCGTCGGTACAAACTCATCCACAACCTCAACTTCGGGATG
CCCTTCCCCATCGACCAGGACCTGTACGTGTCGCCGACCTTCCAGGATATATTGAACCGC
ACTCGAAGCAAGCAGCCTCTGCCGTGGTACAAGACTCTGAAGCAGTACTACTACCGACCG
CAGTGGGAGATGTACGACCTCAAGAACGACCCCTTGGAGACCCACAACCTGCACGGTAAG
CCGTCGCTGTCCGAGGTGGAGGCTTCTCTCAGGGAGCGGCTTCACTCGTGGCAGCTCGCT
ACCGGCGACCCCTGGCTCTGCTCGCCGGCCGCGGTCCGGGACCCGCGACCCGGGGCAAAC
TCCGCCGTCTGCGACGCGCTCGACAACGGCCTCACACACTACATGCACACCTAG

Protein sequence:

MARHPVAVITLLLCLIITNTVLSNKNRNVLILLADDGGFEIGAYRNKICQTPNIDEFAKR
SVIFNNAFSSVSSCSPSRAALLTGTPSHQNGMYGLHHGVHHFNSFDNVTSLPNILREHGV
YTGIIGKKHVGPSSVYKFDMEWTEEGHSINQVGRNITHMKLLARKFLREANRLDKPFLLY
VGFHDPHRCGHEAPQYGPFCERFGSSEEGMGVIPDWKPWYYQWDEVQLPYHVQDTEAARR
DIAAQYTTMSRLDQGVGLMLKELEAAGHGHDTLVIYTSDNGIPFPSGRTNFYDPGLREPL
IMHSPDPGARRNEASGALVSLLDLTPTVLDWFGVRTPRHIDHEWRDRPRSLLPILNKEPP
PSEQDAVFSSQTHHEITMYYPMRSVRTRRYKLIHNLNFGMPFPIDQDLYVSPTFQDILNR
TRSKQPLPWYKTLKQYYYRPQWEMYDLKNDPLETHNLHGKPSLSEVEASLRERLHSWQLA
TGDPWLCSPAAVRDPRPGANSAVCDALDNGLTHYMHT