DPGLEAN04402 in OGS1.0

New model in OGS2.0DPOGS211688 
Genomic Positionscaffold2681:+ 11480-15205
See gene structure
CDS Length1428
Paired RNAseq reads  12
Single RNAseq reads  40
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011204 (9e-11)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  glucose-1-phosphatase/inositol phosphatase [Serratia proteamaculans 568] (3e-81)
Best NR hit (blastx)  glucose-1-phosphatase/inositol phosphatase [Serratia odorifera 4Rx13] (9e-76)
GeneOntology terms


  
GO:0003993 acid phosphatase activity
GO:0008877 glucose-1-phosphatase activity
GO:0016787 hydrolase activity
GO:0042597 periplasmic space
InterPro families  IPR000560 Histidine phosphatase superfamily, clade-2
Orthology groupMCL16733

Nucleotide sequence:

ATGGAGTTTGCAATATATTTTTTAAATCTCGTTTTAATTGTTAGTTGCTATGATTTAAAG
CAAGTGGTTATATTGAGCAGGCACAACATAAGGAGCCCTTTGGCGAGTTTTTTGAAGAAG
TTCTCGCCTCATCCTTGGCCGGAATGGAATATAAGTGTTGGTTATTTGACAGAAAAAGGT
GCTACTATGGAAGAAGACATGGGTGAATATATGTCCACTTGGTTGTGCACTGAGCTCTTC
AAAGACAGCTGTCCCGAGGAGAGCTCCTTGCAAATATTCTCAAATTCTACTCAGAGAACT
TACGAATCATCGAAAGCGTTTATTCGTGGTACTTTCAAAAATTGCAATAAAGTTTTAAGA
GTTGGATCTGAGGAAATGGCGTCGTTGTTTGAAACTGTTGTCCGCAATGATTCAAAAGTG
ATGAAGGACCTTGTTCTTAACGAAATGAATACGAAAATAATGGAATTGGATACAAAAGAA
TCTTATAATTTATTGGAAGACATATTGGATATGAAAAATGCTGAAGTGTGCAAAATCGAG
GGCATATGCAACTTTGATAAAGAAGACAGCGAAATTACATATGAATTCGGTAATTTGCTG
AACGTCGAGGGCTCCTTGCTGTGGGCGAACCTGATAGTCGATTCGTTTCTTATGAGCTAC
TACGACGGATTTCAAATAGAAAACGTAGCTTGGGGAATGATCAAAGATTCTGGACAGTGG
CGGACGCTCACAAGACTGATGATACAGTATCAGCACGTTGTTTTTAACAGTAAGTTAGTA
GGGAGACAAGTGTCAAAACCTCTCCTTAGCTATATATCGTCTAAGTTTACGGCGGAAACA
GAAAAAAAATTCATTTCGCTTCATGCCCATGACGCAAATTTATATTTTGTTCTGGCGGCA
CTGGAAGTTGAGGAGTTTGTGTTGCCAGAGCAATATGAAAGGACACCGATAGGCGGGAAG
TTGGTGTTCCAGAGATGGCACGACGCTACACAGGGTAGAGATCTGTTTAAATTGAATTTT
GTGTATTTAACCGTAGATCAGATAAGAGATGGGTCCAAACTATCAGCTAGTAATCCCCCA
CGATGGGTGCAGCTGTTTTTCAAGGATTGTCCCGTAGACTCAGACGGGTTCTGTTCTTGG
GAAGATTTTGTTAATGTTCTAAATGATGCAGCCACAGCACTGGACGTTGAGGATTTTCTG
TTGCCAGAGCAATATGAAATCATACCAATAGGTGGGAAGTTGGTGTTCCAGAGATGGCAC
GACGCTACACAGGATAGAGATCTGTTTAAATTGGATTTTGTATATTTAACCGTAGATCAG
ATAAGAGATGGATCCAAACTATCAGCTAGTAATCCGCCTCGACGGGTGCAGCTTGTCATA
AAAGATTGTAGACTGAGATGGGTTCTGTTCTTGGGCAGAGTTTGTTAA

Protein sequence:

MEFAIYFLNLVLIVSCYDLKQVVILSRHNIRSPLASFLKKFSPHPWPEWNISVGYLTEKG
ATMEEDMGEYMSTWLCTELFKDSCPEESSLQIFSNSTQRTYESSKAFIRGTFKNCNKVLR
VGSEEMASLFETVVRNDSKVMKDLVLNEMNTKIMELDTKESYNLLEDILDMKNAEVCKIE
GICNFDKEDSEITYEFGNLLNVEGSLLWANLIVDSFLMSYYDGFQIENVAWGMIKDSGQW
RTLTRLMIQYQHVVFNSKLVGRQVSKPLLSYISSKFTAETEKKFISLHAHDANLYFVLAA
LEVEEFVLPEQYERTPIGGKLVFQRWHDATQGRDLFKLNFVYLTVDQIRDGSKLSASNPP
RWVQLFFKDCPVDSDGFCSWEDFVNVLNDAATALDVEDFLLPEQYEIIPIGGKLVFQRWH
DATQDRDLFKLDFVYLTVDQIRDGSKLSASNPPRRVQLVIKDCRLRWVLFLGRVC