DPGLEAN08833 in OGS1.0

New model in OGS2.0DPOGS203407 
Genomic Positionscaffold6:+ 1135652-1151413
See gene structure
CDS Length1599
Paired RNAseq reads  619
Single RNAseq reads  1608
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012343 (3e-136)
Best Drosophila hit  corkscrew, isoform B (7e-59)
Best Human hittyrosine-protein phosphatase non-receptor type 6 isoform 3 (6e-87)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC012910 [Tribolium castaneum] (6e-113)
Best NR hit (blastx)  AGAP002438-PA [Anopheles gambiae str. PEST] (8e-60)
GeneOntology terms






  
GO:0016311 dephosphorylation
GO:0004725 protein tyrosine phosphatase activity
GO:0005515 protein binding
GO:0016791 phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0005575 cellular_component
GO:0016787 hydrolase activity
GO:0004721 phosphoprotein phosphatase activity
InterPro families



  
IPR016130 Protein-tyrosine phosphatase, active site
IPR000242 Protein-tyrosine phosphatase, receptor/non-receptor type
IPR000980 SH2 motif
IPR000387 Protein-tyrosine/Dual-specificity phosphatase
IPR003595 Protein-tyrosine phosphatase, catalytic
Orthology groupMCL10985

Nucleotide sequence:

ATGTCCAAGGATAGCCCCAGGCTGGGCAGGCTGGGTGTGTTAGCGCCCGCCGCGCTATCA
GCTTGCATGGATGCTTCTGTGTTGGCTGTGCCTGCAGCTGGTGCTATAGTATCGACTGTC
CTCAAGGAACTCAGCAAGGCTTTCATGTTAAAAAAATGGTTCCACGGCGTGATGTCAGCT
AAGGAAGCCGAGCATCTGATGATGGAGAAGGGTCGGAACGGGTCGTTTCTGGTGCGAGAG
TCCCAGGCTCACCCTGGCGAGTACGTGCTGTCTGTTCGCGTGCGAGGTCGGGTCAGTCAT
GTTATGATCAGGAAACAGCAAAACAAATACGACGTAGGCAGCGGTGAGCAGTTCGATGAT
CTGGTGGGTTTGATAGAACATTTCCGATCCTATCCCATGATAGAGACCTCTGGTGACGTT
CTACGTCTTCTGCAGCCTGTCAGCGGAACCTGTCTCCGCGTGCATGATATCGATCAAAAA
GTACAGCAAATGGATGATTTCCAAAAACCCGATCAGAGAAACGGTTTCGACGGTGAATTT
CAATCGTTGAAGATGGTTGAGGACATGCACGTTTTTACAACGACCGAAGGCATGAAGGCG
GAAAATTTTAATAAAAACAGATATCGAAACATTTTACCTTATGACCAGACGCGTGTGTTA
CTGCGTGGACGCGACGGTCGCACAGAGTCAGATTACATTAATGCTAACTTCATTCGCTCG
TCCAGGCTGAGCGATTCCTCCAGCTCAGTACAATCCTCCAACGAGAGTCTGAACAGTGTC
AATTCTTTAATCCTTGGTATTGACCCAAAGAGAACAGTTCCGCTAGTCACCAAATCCCTA
TCGGAAGAGGCGCTGAGGGATGTTAAAAAGAGTATAAAATTGGACAGAATTAACAGAAAT
ATTTACAGAAATATAGTGAAAGAGAAAATATACATCGCAAGTCAGGGTTGTCTCTCAAAT
ACTGTAGACGATTTCTGGAGAATGCTATGGCAAGAGGACGTCAGGGTTATAGCAATGATC
ACAAACGAATTTGAAAAGGGAAAGAAAAAATGCGAGCGTTACTGGCCAGCATCAGGCCAA
GAGGAGCGTTACGATGAGCTGATTGTAAAATCAATTTCTGAGACCTGCTACGAAGACTAT
ACTTTGAGAGAATTTGATGTAAGCGATAAAAACATCTGCAGGACCATCTATCAGTACCAA
TATACGTGTTGGCCTGATCACGGCACACCAGCTGAACCTGAGGGGGTGCTTTCGTTCATG
GAAGATATTAATAGGAAGATGTATCAAATATCCCAACAAAAGGATGCGCCGGAACAGAAT
GTGTTGTGTGTGCACTGCTCAGCTGGAGTTGGAAGAACTGGAACGTTCATAGTGTTAGAT
ATGCTTATTGATAAAATAAAGCTAACTGGTTTTAACTGCGAAATAGACGTCCATAGCACG
GTGAAGTTGGTGCGAGCTCAACGCAGTGGCATGGTTCAGAATAAGGCGCAGTACAGATTC
ATTTATCTCGCATTGCAAAGTTACATAGATAATAATAATAAAGTTAAATTAAAAAGGAAA
GTAATTCTGTTACCATTCTTACGAAACATGGTTTTTTAA

Protein sequence:

MSKDSPRLGRLGVLAPAALSACMDASVLAVPAAGAIVSTVLKELSKAFMLKKWFHGVMSA
KEAEHLMMEKGRNGSFLVRESQAHPGEYVLSVRVRGRVSHVMIRKQQNKYDVGSGEQFDD
LVGLIEHFRSYPMIETSGDVLRLLQPVSGTCLRVHDIDQKVQQMDDFQKPDQRNGFDGEF
QSLKMVEDMHVFTTTEGMKAENFNKNRYRNILPYDQTRVLLRGRDGRTESDYINANFIRS
SRLSDSSSSVQSSNESLNSVNSLILGIDPKRTVPLVTKSLSEEALRDVKKSIKLDRINRN
IYRNIVKEKIYIASQGCLSNTVDDFWRMLWQEDVRVIAMITNEFEKGKKKCERYWPASGQ
EERYDELIVKSISETCYEDYTLREFDVSDKNICRTIYQYQYTCWPDHGTPAEPEGVLSFM
EDINRKMYQISQQKDAPEQNVLCVHCSAGVGRTGTFIVLDMLIDKIKLTGFNCEIDVHST
VKLVRAQRSGMVQNKAQYRFIYLALQSYIDNNNKVKLKRKVILLPFLRNMVF