DPGLEAN15073 in OGS1.0

New model in OGS2.0DPOGS211074 
Genomic Positionscaffold525:- 50137-60950
See gene structure
CDS Length3417
Paired RNAseq reads  258
Single RNAseq reads  706
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002956 (0.0)
Best Drosophila hit  PH domain leucine-rich repeat protein phosphatase (4e-82)
Best Human hitPH domain leucine-rich repeat-containing protein phosphatase 1 (2e-34)
Best NR hit (blastp)  PREDICTED: similar to adenylate cyclase [Tribolium castaneum] (1e-151)
Best NR hit (blastx)  PREDICTED: similar to adenylate cyclase [Tribolium castaneum] (2e-139)
GeneOntology terms


  
GO:0006470 protein amino acid dephosphorylation
GO:0004722 protein serine/threonine phosphatase activity
GO:0005575 cellular_component
GO:0005515 protein binding
InterPro families


  
IPR001932 Protein phosphatase 2C-like
IPR003591 Leucine-rich repeat, typical subtype
IPR001611 Leucine-rich repeat
IPR014045 Protein phosphatase 2C, N-terminal
Orthology groupMCL10707

Nucleotide sequence:

ATGGTCTGTGACGACGGCCTCACTCCATCGCCTCTTATCTCACGCAAATCGCTTCGAAGA
CTAGCATCAACAAGAGGATTCAAACCGCGCTCTGAATCAACCTGGATACGCGTATTTGAT
GGTCTTGAACCTTATGCTGTGGATGCGCCGAGCAAGCTCGTAAAAGTGTCCCCCTATACT
ACCGTCGAAGACATAAACAAGAAACTTGGCTTCAATGAAGAATTGACGCTATGGGTGCAG
ATAGGAGGAGAAAATTCTCGACGGCTGGAATTGAACGAGTTCCCATTCCAAATACAAGAG
AAGTTTTTAATTAACAATGGTTGGAAGTCAGAGGCTAGGCGACAGCGGCTTGCGGTAGAT
CCGGAGTTACGTCACAGTCTGCGCTGGTGTGCAGGACCTTCCAGTCGGTCTGGTGGTGTC
CTGCGGTCAGGCACTGTTTATGTTTTAAAAGGGCACGTGTTCCCACAATGGAAGCCCCGA
CAGGCCCACATTATTGGATCGCAATTACATACACACGGTGTGTCCTGGGATATGTTGGAG
CTCAGTGGAGGTAGTATTGAAATGTGTCAACCGAAAGCTCAGAAACTAGTCCTCTGCGTA
AAGCTTCTTTGTCAAGGCAATGGTGTACTTGACACGGGAGTCAATCATTTATTTCTGGGG
TTCAATACAATTTGGGAGCGTAATATGTGGTGCCGTTGGTTAAAAGAGTGTATTGTAATG
GTTGAGAGTAATAAAATGAAATGTGAAGACGAGGAGATAGACAGCTTGGATGTTTTCTCT
CAAGAAAATGAAGATGTGTTCTTGGATAGCTTGGAACCAGCTACGTATAGAGAGAAATAT
GAAAACTCGGGCACTCAAAGTACACAGCCACCAAATGTCTTAGACTTGAGCGGTGGCGGA
CGGTCCTGTCTGCCCGTAGCTCTAAACCAGCACGCCTCAGATGGGTTCGCGGTGAAAGTT
TTAAGGATGCGGAGCAACACTCTACCAGCCTTGCCTCCTCAAACATGCAATCTGATAGCC
CTAACTCATCTCGATGTGAGCGACAATAAAATCATTGAGCTACCAAAGGAGATTTCACAT
CTAACGCAATTAGAAGAAATAAACGTGAGCAACAACGAAATTAAGTCGCTGGACTGTCTC
TTACGACTTCCTCGTCTACGAACTGTCGTAGCTGCTAGAAACCTGATCACGCAATTCGGA
GTCAATGACACTAGCCAAATGGGTTTTCTAGAGGAAAATAAATCAGAATATCGTGCACCG
CTAACGAATGTCGACCTCCGATACAATAAACTAAAAGGAAGCATAATTCTTGGTAATTAT
GAGCATCTGGTGACTCTTGATGTCTCTCAAAACTCCATTGAGGTTTTGGTGCTTTCATCG
CTGCGTGGACTACGGGAGCTGTATGCTGCTCATAATTCTATCCAGCACTTGGCCTTGCAT
GGTGCTTCGTTACGAGTCCTACATGCTCCGTACAATAATATGGAGAATTTGACAACAATG
GTGCCACCAATAAATTTAGTGGAGATGAACCTGACATACAATAAATTATCATCTTTACCA
CAGTGGATCAGTGGTTGTTCAGATCTGACCAAACTCTTTGCAAAAATTGAAGAACTGGTT
CTTTCTGGTAATTCACTCTCGAAATTGCCAGACAATTTGCCACAGATGAATAACATAAAA
ATTGTGAGGGCACATTCAAATCGTCTTCGCTCAGTTCCAATGTTTGCTTGCAGTGCTAGC
GTTAAAATTCTAGACTTTGCTCATAACGAACTGGACAGCATTGATCTGCGTCTTTTAGCA
CCGAAGCAATTAAAATTTTTAGACATATCATGTAATAAGAAGTTACAAATGAATCCCTCG
CAGTTTAACGCTTATAAATGTCAACGACCTTTAAGCCTAGTTGATGTCACTGGACAACAT
GGAAATTCTTTATCGCAAAAAAATAATTTTCATGAAGAATTAAGTGGTGGGACCCCGTGG
GTAACTGGTTTTTCGGAATGTCCAAATAAAAAACTTCTTCTATCTTGTGCACAAATACGA
CTTCCATCGTTTTGTAACAAGGAAGGCTTATTTGCTATAATTGACGGGGAAACAGATATC
GAAGTCCCAAGAATACTACAGTCATGTCTTCCAGGACTACTACTTGAAGAAAAATCTATT
AAGGAAACAGTCAATGAATATATGAAATATGTTATACTAGCTGCACATAGAGAATTGAAA
CAAAAAGGACAGACAAAAGGTGCATGTCTTGTTATGTGTCACTTGTCTCCTATTAGTACC
CCCGATAACAGTTTTGGACAATCTATAAGACGATATAATATAAGATTAGCGAATGTCGGC
AATACAAAAGCAGTGTTAAGTCGTCGTAATGGCCCTTTATGTTTAGGTATAGATGATAAT
AAGCGATTAGGTTATTCTTCAAGATACCCAGTTAATGTACCTGATCCCGATATTATACAA
ACTGTAATTAAAGAAGACGATGAGTTTTTAATATTAGGAAACGCTAAATTTTGGGAATCC
GTTACAGTCGATACTGCAATATCAGAAGTGAGGGCTGAACGGAATCCAGTATTAGCAGCA
AAGAGATTACAAGATTTGGCTCAAAGTTATGGAATAGAAGATTGTATATCGGTGGTAATC
GTAAGATTTGATACAGTTCGTTCTGATGTAGATTTATTAATGCGAGAATTACGACATACG
ATCAACACAAACAAACCTGTATGTAATCCTGACTGCTGTTGCTCTCGTTTAGAACCATGT
TGCCATTCTATCTCACCACCAAAATCAAATAGCGATAGATCTTCTCCAAGTGGACAAAGC
GATCGACCTTCTAGTGAAACAGTTAGTCATCAACACTATGCCAGTGTACGTTCTCATAAT
AGGGCCTCAGAAAGAAAACCAAGAGGCGGAGTTGCACGAGCAATTCGAGTACGAGTTGAA
GAAGATAAAGAGACTGAAAAAATTATTGACGATGTTCCCTCTTCAGATGAACAATTCAAA
TGTTGGGAATATATGCTGGAGCAAAATACACAAATGATATTTGATAAAGAGCTAGATAAT
CTTTCAAAAGGTATCAAATCAAATTCAAGTAGTTTAAGAAATTTAAAGGGACTCTCAGGA
AGTAGTCCCCAACTACATCTAAATACGAAACAAACAAAACTACCGTTTCTCTCAAAACAT
TTCGGGAGTGCTAGATCTTTCGGTAGTAATATAAAGCCTGAGTTTCGTTTCGGTTCAGGA
AGAATGCCTAATGGTGGTCCAAATGCTGCTTACTTTGGTTCACTTCAAAGGTTAATGCCT
TATCATTTAGAATACGATTTCGCGGTTATTCAAGAAAAACAAACACAATCACAGGACTCT
CTTGATCTCGAGGGCCGGATGCAACAATATTGGGGAGTTGCAACAACTGAACTTTAA

Protein sequence:

MVCDDGLTPSPLISRKSLRRLASTRGFKPRSESTWIRVFDGLEPYAVDAPSKLVKVSPYT
TVEDINKKLGFNEELTLWVQIGGENSRRLELNEFPFQIQEKFLINNGWKSEARRQRLAVD
PELRHSLRWCAGPSSRSGGVLRSGTVYVLKGHVFPQWKPRQAHIIGSQLHTHGVSWDMLE
LSGGSIEMCQPKAQKLVLCVKLLCQGNGVLDTGVNHLFLGFNTIWERNMWCRWLKECIVM
VESNKMKCEDEEIDSLDVFSQENEDVFLDSLEPATYREKYENSGTQSTQPPNVLDLSGGG
RSCLPVALNQHASDGFAVKVLRMRSNTLPALPPQTCNLIALTHLDVSDNKIIELPKEISH
LTQLEEINVSNNEIKSLDCLLRLPRLRTVVAARNLITQFGVNDTSQMGFLEENKSEYRAP
LTNVDLRYNKLKGSIILGNYEHLVTLDVSQNSIEVLVLSSLRGLRELYAAHNSIQHLALH
GASLRVLHAPYNNMENLTTMVPPINLVEMNLTYNKLSSLPQWISGCSDLTKLFAKIEELV
LSGNSLSKLPDNLPQMNNIKIVRAHSNRLRSVPMFACSASVKILDFAHNELDSIDLRLLA
PKQLKFLDISCNKKLQMNPSQFNAYKCQRPLSLVDVTGQHGNSLSQKNNFHEELSGGTPW
VTGFSECPNKKLLLSCAQIRLPSFCNKEGLFAIIDGETDIEVPRILQSCLPGLLLEEKSI
KETVNEYMKYVILAAHRELKQKGQTKGACLVMCHLSPISTPDNSFGQSIRRYNIRLANVG
NTKAVLSRRNGPLCLGIDDNKRLGYSSRYPVNVPDPDIIQTVIKEDDEFLILGNAKFWES
VTVDTAISEVRAERNPVLAAKRLQDLAQSYGIEDCISVVIVRFDTVRSDVDLLMRELRHT
INTNKPVCNPDCCCSRLEPCCHSISPPKSNSDRSSPSGQSDRPSSETVSHQHYASVRSHN
RASERKPRGGVARAIRVRVEEDKETEKIIDDVPSSDEQFKCWEYMLEQNTQMIFDKELDN
LSKGIKSNSSSLRNLKGLSGSSPQLHLNTKQTKLPFLSKHFGSARSFGSNIKPEFRFGSG
RMPNGGPNAAYFGSLQRLMPYHLEYDFAVIQEKQTQSQDSLDLEGRMQQYWGVATTEL