New model in OGS2.0 | DPOGS211074  |
---|---|
Genomic Position | scaffold525:- 50137-60950 |
See gene structure | |
CDS Length | 3417 |
Paired RNAseq reads   | 258 |
Single RNAseq reads   | 706 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002956 (0.0) |
Best Drosophila hit   | PH domain leucine-rich repeat protein phosphatase (4e-82) |
Best Human hit | PH domain leucine-rich repeat-containing protein phosphatase 1 (2e-34) |
Best NR hit (blastp)   | PREDICTED: similar to adenylate cyclase [Tribolium castaneum] (1e-151) |
Best NR hit (blastx)   | PREDICTED: similar to adenylate cyclase [Tribolium castaneum] (2e-139) |
GeneOntology terms    | GO:0006470 protein amino acid dephosphorylation GO:0004722 protein serine/threonine phosphatase activity GO:0005575 cellular_component GO:0005515 protein binding |
InterPro families    | IPR001932 Protein phosphatase 2C-like IPR003591 Leucine-rich repeat, typical subtype IPR001611 Leucine-rich repeat IPR014045 Protein phosphatase 2C, N-terminal |
Orthology group | MCL10707 |
Nucleotide sequence:
ATGGTCTGTGACGACGGCCTCACTCCATCGCCTCTTATCTCACGCAAATCGCTTCGAAGA
CTAGCATCAACAAGAGGATTCAAACCGCGCTCTGAATCAACCTGGATACGCGTATTTGAT
GGTCTTGAACCTTATGCTGTGGATGCGCCGAGCAAGCTCGTAAAAGTGTCCCCCTATACT
ACCGTCGAAGACATAAACAAGAAACTTGGCTTCAATGAAGAATTGACGCTATGGGTGCAG
ATAGGAGGAGAAAATTCTCGACGGCTGGAATTGAACGAGTTCCCATTCCAAATACAAGAG
AAGTTTTTAATTAACAATGGTTGGAAGTCAGAGGCTAGGCGACAGCGGCTTGCGGTAGAT
CCGGAGTTACGTCACAGTCTGCGCTGGTGTGCAGGACCTTCCAGTCGGTCTGGTGGTGTC
CTGCGGTCAGGCACTGTTTATGTTTTAAAAGGGCACGTGTTCCCACAATGGAAGCCCCGA
CAGGCCCACATTATTGGATCGCAATTACATACACACGGTGTGTCCTGGGATATGTTGGAG
CTCAGTGGAGGTAGTATTGAAATGTGTCAACCGAAAGCTCAGAAACTAGTCCTCTGCGTA
AAGCTTCTTTGTCAAGGCAATGGTGTACTTGACACGGGAGTCAATCATTTATTTCTGGGG
TTCAATACAATTTGGGAGCGTAATATGTGGTGCCGTTGGTTAAAAGAGTGTATTGTAATG
GTTGAGAGTAATAAAATGAAATGTGAAGACGAGGAGATAGACAGCTTGGATGTTTTCTCT
CAAGAAAATGAAGATGTGTTCTTGGATAGCTTGGAACCAGCTACGTATAGAGAGAAATAT
GAAAACTCGGGCACTCAAAGTACACAGCCACCAAATGTCTTAGACTTGAGCGGTGGCGGA
CGGTCCTGTCTGCCCGTAGCTCTAAACCAGCACGCCTCAGATGGGTTCGCGGTGAAAGTT
TTAAGGATGCGGAGCAACACTCTACCAGCCTTGCCTCCTCAAACATGCAATCTGATAGCC
CTAACTCATCTCGATGTGAGCGACAATAAAATCATTGAGCTACCAAAGGAGATTTCACAT
CTAACGCAATTAGAAGAAATAAACGTGAGCAACAACGAAATTAAGTCGCTGGACTGTCTC
TTACGACTTCCTCGTCTACGAACTGTCGTAGCTGCTAGAAACCTGATCACGCAATTCGGA
GTCAATGACACTAGCCAAATGGGTTTTCTAGAGGAAAATAAATCAGAATATCGTGCACCG
CTAACGAATGTCGACCTCCGATACAATAAACTAAAAGGAAGCATAATTCTTGGTAATTAT
GAGCATCTGGTGACTCTTGATGTCTCTCAAAACTCCATTGAGGTTTTGGTGCTTTCATCG
CTGCGTGGACTACGGGAGCTGTATGCTGCTCATAATTCTATCCAGCACTTGGCCTTGCAT
GGTGCTTCGTTACGAGTCCTACATGCTCCGTACAATAATATGGAGAATTTGACAACAATG
GTGCCACCAATAAATTTAGTGGAGATGAACCTGACATACAATAAATTATCATCTTTACCA
CAGTGGATCAGTGGTTGTTCAGATCTGACCAAACTCTTTGCAAAAATTGAAGAACTGGTT
CTTTCTGGTAATTCACTCTCGAAATTGCCAGACAATTTGCCACAGATGAATAACATAAAA
ATTGTGAGGGCACATTCAAATCGTCTTCGCTCAGTTCCAATGTTTGCTTGCAGTGCTAGC
GTTAAAATTCTAGACTTTGCTCATAACGAACTGGACAGCATTGATCTGCGTCTTTTAGCA
CCGAAGCAATTAAAATTTTTAGACATATCATGTAATAAGAAGTTACAAATGAATCCCTCG
CAGTTTAACGCTTATAAATGTCAACGACCTTTAAGCCTAGTTGATGTCACTGGACAACAT
GGAAATTCTTTATCGCAAAAAAATAATTTTCATGAAGAATTAAGTGGTGGGACCCCGTGG
GTAACTGGTTTTTCGGAATGTCCAAATAAAAAACTTCTTCTATCTTGTGCACAAATACGA
CTTCCATCGTTTTGTAACAAGGAAGGCTTATTTGCTATAATTGACGGGGAAACAGATATC
GAAGTCCCAAGAATACTACAGTCATGTCTTCCAGGACTACTACTTGAAGAAAAATCTATT
AAGGAAACAGTCAATGAATATATGAAATATGTTATACTAGCTGCACATAGAGAATTGAAA
CAAAAAGGACAGACAAAAGGTGCATGTCTTGTTATGTGTCACTTGTCTCCTATTAGTACC
CCCGATAACAGTTTTGGACAATCTATAAGACGATATAATATAAGATTAGCGAATGTCGGC
AATACAAAAGCAGTGTTAAGTCGTCGTAATGGCCCTTTATGTTTAGGTATAGATGATAAT
AAGCGATTAGGTTATTCTTCAAGATACCCAGTTAATGTACCTGATCCCGATATTATACAA
ACTGTAATTAAAGAAGACGATGAGTTTTTAATATTAGGAAACGCTAAATTTTGGGAATCC
GTTACAGTCGATACTGCAATATCAGAAGTGAGGGCTGAACGGAATCCAGTATTAGCAGCA
AAGAGATTACAAGATTTGGCTCAAAGTTATGGAATAGAAGATTGTATATCGGTGGTAATC
GTAAGATTTGATACAGTTCGTTCTGATGTAGATTTATTAATGCGAGAATTACGACATACG
ATCAACACAAACAAACCTGTATGTAATCCTGACTGCTGTTGCTCTCGTTTAGAACCATGT
TGCCATTCTATCTCACCACCAAAATCAAATAGCGATAGATCTTCTCCAAGTGGACAAAGC
GATCGACCTTCTAGTGAAACAGTTAGTCATCAACACTATGCCAGTGTACGTTCTCATAAT
AGGGCCTCAGAAAGAAAACCAAGAGGCGGAGTTGCACGAGCAATTCGAGTACGAGTTGAA
GAAGATAAAGAGACTGAAAAAATTATTGACGATGTTCCCTCTTCAGATGAACAATTCAAA
TGTTGGGAATATATGCTGGAGCAAAATACACAAATGATATTTGATAAAGAGCTAGATAAT
CTTTCAAAAGGTATCAAATCAAATTCAAGTAGTTTAAGAAATTTAAAGGGACTCTCAGGA
AGTAGTCCCCAACTACATCTAAATACGAAACAAACAAAACTACCGTTTCTCTCAAAACAT
TTCGGGAGTGCTAGATCTTTCGGTAGTAATATAAAGCCTGAGTTTCGTTTCGGTTCAGGA
AGAATGCCTAATGGTGGTCCAAATGCTGCTTACTTTGGTTCACTTCAAAGGTTAATGCCT
TATCATTTAGAATACGATTTCGCGGTTATTCAAGAAAAACAAACACAATCACAGGACTCT
CTTGATCTCGAGGGCCGGATGCAACAATATTGGGGAGTTGCAACAACTGAACTTTAA
Protein sequence:
MVCDDGLTPSPLISRKSLRRLASTRGFKPRSESTWIRVFDGLEPYAVDAPSKLVKVSPYT
TVEDINKKLGFNEELTLWVQIGGENSRRLELNEFPFQIQEKFLINNGWKSEARRQRLAVD
PELRHSLRWCAGPSSRSGGVLRSGTVYVLKGHVFPQWKPRQAHIIGSQLHTHGVSWDMLE
LSGGSIEMCQPKAQKLVLCVKLLCQGNGVLDTGVNHLFLGFNTIWERNMWCRWLKECIVM
VESNKMKCEDEEIDSLDVFSQENEDVFLDSLEPATYREKYENSGTQSTQPPNVLDLSGGG
RSCLPVALNQHASDGFAVKVLRMRSNTLPALPPQTCNLIALTHLDVSDNKIIELPKEISH
LTQLEEINVSNNEIKSLDCLLRLPRLRTVVAARNLITQFGVNDTSQMGFLEENKSEYRAP
LTNVDLRYNKLKGSIILGNYEHLVTLDVSQNSIEVLVLSSLRGLRELYAAHNSIQHLALH
GASLRVLHAPYNNMENLTTMVPPINLVEMNLTYNKLSSLPQWISGCSDLTKLFAKIEELV
LSGNSLSKLPDNLPQMNNIKIVRAHSNRLRSVPMFACSASVKILDFAHNELDSIDLRLLA
PKQLKFLDISCNKKLQMNPSQFNAYKCQRPLSLVDVTGQHGNSLSQKNNFHEELSGGTPW
VTGFSECPNKKLLLSCAQIRLPSFCNKEGLFAIIDGETDIEVPRILQSCLPGLLLEEKSI
KETVNEYMKYVILAAHRELKQKGQTKGACLVMCHLSPISTPDNSFGQSIRRYNIRLANVG
NTKAVLSRRNGPLCLGIDDNKRLGYSSRYPVNVPDPDIIQTVIKEDDEFLILGNAKFWES
VTVDTAISEVRAERNPVLAAKRLQDLAQSYGIEDCISVVIVRFDTVRSDVDLLMRELRHT
INTNKPVCNPDCCCSRLEPCCHSISPPKSNSDRSSPSGQSDRPSSETVSHQHYASVRSHN
RASERKPRGGVARAIRVRVEEDKETEKIIDDVPSSDEQFKCWEYMLEQNTQMIFDKELDN
LSKGIKSNSSSLRNLKGLSGSSPQLHLNTKQTKLPFLSKHFGSARSFGSNIKPEFRFGSG
RMPNGGPNAAYFGSLQRLMPYHLEYDFAVIQEKQTQSQDSLDLEGRMQQYWGVATTEL