DPGLEAN21696 in OGS1.0

New model in OGS2.0DPOGS215438 
Genomic Positionscaffold939:- 21013-33789
See gene structure
CDS Length2106
Paired RNAseq reads  1008
Single RNAseq reads  2735
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005728 (7e-122)
Best Drosophila hit  cdc14, isoform A (3e-93)
Best Human hitdual specificity protein phosphatase CDC14A isoform 2 (2e-90)
Best NR hit (blastp)  GA20127 [Drosophila pseudoobscura pseudoobscura] (1e-114)
Best NR hit (blastx)  PREDICTED: CDC14 homolog A-like, partial [Saccoglossus kowalevskii] (1e-99)
GeneOntology terms

  
GO:0008138 protein tyrosine/serine/threonine phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0004725 protein tyrosine phosphatase activity
InterPro families



  
IPR016130 Protein-tyrosine phosphatase, active site
IPR000387 Protein-tyrosine/Dual-specificity phosphatase
IPR020422 Dual specificity phosphatase, subgroup, catalytic domain
IPR003595 Protein-tyrosine phosphatase, catalytic
IPR000340 Dual specificity phosphatase, catalytic domain
Orthology groupMCL14559

Nucleotide sequence:

ATGGATGAAAATGATGTTATCATATCATCTACAGAGATTATTAAAGACAGATTATACTTC
GCTACCCTCGAAACTGGATATAAACCAAAACCGACTCGCAACAGCAAATACTTCCACATA
GAAGATGATATTGTATATGAGAATTTTTATTTCGATTTTGGACCCTATCACCTGTGCCAT
TTGTATGAGTTTTGCAAGAGACTCAACGAGGAACTGGAGAAAAATCCGAAAAAAAAGATA
GTATTCTACACAAGCAACAACGATACTCTCAGACTGAATGCGGCATATCTCATTGGGAGT
TATCAGATAATATATTTGGGCGGTTCACCGGCCGCGGTGTACAAACAGCTGACGGATAAC
ACGTGGCAATTGTTGAATTTCCGCGATGCCTCCGGAGGTCCACCGTTATTTGACATATCG
CTCTTGGATGTGCTGAACGGCATTAAAGTCGCTCATGACGCTAAGTTCTTCGACTTCGAC
GACTTTGACGCTGATCAGTATTTGTTTTATGAGAAAGTGGAAAACGGCGATCTCAATTGG
ATAGTCCCCGATAAGATGCTGGCTTTCTCTGGACCACATCACCGCTCCCGCTTGGACCGC
GGCTACCCGTTACACAGTCCCGAACATTACCACGACTATTTCAAGAAGAACCACGTCACG
ACTGTAGTTAGACTGAACAAGAAGTCATACGACGCTAGACAGTTCACAGCTCACGGGTTC
GAGCATAGAGAACTATTCTTCGTCGATGGATCTGTACCATCAGAGTCGATTGTCAATAGA
TTCATTCGCATCGCTGAAGCCGCTAAGGGAGCTGTTGCTGTTCATTGTAAAGCTGGTCTC
GGTCGCACCGGCACGTTGATAGCATGCTATATGATGAAGCATCACGCGTTCACCGCTCGG
GAAGCCATCGCCTGGCTCAGAGTCTGTAGACCTGGCTCTGTTATAGGACACCAACAATGG
TTCCTTGAAAACATACAGCCTAGAATGCACGCTCTAGGAGAGGCTTATCGACGGCGTAAC
AACGTAACATCTCTACCAGTATTCACGAGAGCTATATACAGCCTTCGACCAGCGCAGACT
GAAGACAAACCCGTGTCACTGCAGACTATACTCAATAGCAGTAAAAATACGAATAAGTTG
GACAACGCTAACGTATTAAACGCGAATGAGACGGACTCGGAAAACAACGTAACTCAAGTC
ATCGGCAAATATAAGACCACAGCGACGCCAATGCTGCCAACAAAAACTCTGTTCTCACCA
AAAATGAACTACATGATACCCACCAACAACATCAGGAATGCCAACAATACAAACCTGAAG
CCCCAACCTAGGTTAATGAACGCCACGCTAAAGTCCCATTATGCAGCCAAGACTTCCACA
TTCCCGCCAACAAGAAGTGCCAATTCCCAAGTTAAATTAACTGCTGGGGTTAAACCACTG
ACTGGTAAGAATTCCTTCCACGGCCAACGGGCTAACCTCGCGAGGCCAAATCTTGCATAT
TCGCACGGCAGCAGCCCGATCAAAACATTCACCAGAAAGGGAATAGTATCAAAGCTGTCA
TCAAACGACACCACAGTAGTTACAACACTAACTAACGTAACTTCAACCCTATCAGCATTG
ACGGAGAATTGCCATTTGAATGGCAAACGGCTGCCGTCAGAACCTAATCTCAAAGCCGTA
CAGTCGAAAACACCCACGAATCTAAACGGCAGAAAGAAATTAATCGTTCGATCCAATTCC
ATAAATCGCAAGAAGCTTCCAAAAACACTTCCAAAAAGTGGTCTCGAAGGTACACAGGAT
CTATCATCCAGTGATACCAACATCACCAACATTTCCGCCGACTCCCTAGACACTCCTTTC
CGTTTCCGGCGGAAACGGGATAAATCGAAAACCCCAGAACCAATGGACTGCATATCGAAT
TCGGTGATAACATCCGACGTGACGCCAGATAATTACGAGGAGACGGGCAATTCGCAGGGG
AACAAGCTATATAAGATCAAAGCGTTACGGAAGAAATGTCCGGCTTTCGGCATGAGCCTA
TTGAAAGAGGGCATTCAAACACGGTCGAGTACTTGCGCGAGTAGTTCGACCGTCAAGAAG
AAATGA

Protein sequence:

MDENDVIISSTEIIKDRLYFATLETGYKPKPTRNSKYFHIEDDIVYENFYFDFGPYHLCH
LYEFCKRLNEELEKNPKKKIVFYTSNNDTLRLNAAYLIGSYQIIYLGGSPAAVYKQLTDN
TWQLLNFRDASGGPPLFDISLLDVLNGIKVAHDAKFFDFDDFDADQYLFYEKVENGDLNW
IVPDKMLAFSGPHHRSRLDRGYPLHSPEHYHDYFKKNHVTTVVRLNKKSYDARQFTAHGF
EHRELFFVDGSVPSESIVNRFIRIAEAAKGAVAVHCKAGLGRTGTLIACYMMKHHAFTAR
EAIAWLRVCRPGSVIGHQQWFLENIQPRMHALGEAYRRRNNVTSLPVFTRAIYSLRPAQT
EDKPVSLQTILNSSKNTNKLDNANVLNANETDSENNVTQVIGKYKTTATPMLPTKTLFSP
KMNYMIPTNNIRNANNTNLKPQPRLMNATLKSHYAAKTSTFPPTRSANSQVKLTAGVKPL
TGKNSFHGQRANLARPNLAYSHGSSPIKTFTRKGIVSKLSSNDTTVVTTLTNVTSTLSAL
TENCHLNGKRLPSEPNLKAVQSKTPTNLNGRKKLIVRSNSINRKKLPKTLPKSGLEGTQD
LSSSDTNITNISADSLDTPFRFRRKRDKSKTPEPMDCISNSVITSDVTPDNYEETGNSQG
NKLYKIKALRKKCPAFGMSLLKEGIQTRSSTCASSSTVKKK