DPGLEAN05871 in OGS1.0

New model in OGS2.0DPOGS209983 
Genomic Positionscaffold128:+ 291048-298182
See gene structure
CDS Length3351
Paired RNAseq reads  2919
Single RNAseq reads  6695
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011269 (0.0)
Best Drosophila hit  Pez (1e-65)
Best Human hittyrosine-protein phosphatase non-receptor type 14 (3e-55)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC000994 [Tribolium castaneum] (1e-154)
Best NR hit (blastx)  AGAP009975-PA [Anopheles gambiae str. PEST] (2e-93)
GeneOntology terms



  
GO:0004725 protein tyrosine phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0008092 cytoskeletal protein binding
GO:0005488 binding
GO:0005856 cytoskeleton
InterPro families








  
IPR019749 Band 4.1 domain
IPR000242 Protein-tyrosine phosphatase, receptor/non-receptor type
IPR000299 FERM domain
IPR019748 FERM central domain
IPR018979 FERM, N-terminal
IPR018980 FERM, C-terminal PH-like domain
IPR014352 FERM/acyl-CoA-binding protein, 3-helical bundle
IPR011993 Pleckstrin homology-type
IPR019747 FERM conserved site
IPR019750 Band 4.1 family
Orthology groupMCL11608

Nucleotide sequence:

ATGCCTTTCAAGCTCAAGTTGAAGAAGACACGGCAGTACAATGTAGCATCGAAGAGTCTG
TTTGTCATAAGTGTGGAGTTGTTAGACGGCGGTGTAGCGGACTGCACTCTGTCGGTGGAG
AGCACCGGCCAAGAGTGTCTCGACAACGTGTGCCAGCGCCAGGCGATCAACCAGCCGGAG
TTCTTCGGATTGAGATATATAAACAGGAACCAACAGCCGCGATGGGTTCAACTCGACCGC
CCACTAAAGCTCCAATTAGAAAAATATGCATCTTCACATCAATTGTATTTAAGGGTGATG
TATTACGTGATATCTGGAACTAGTCTAATCACGGATGAAGTGACGCGGTACCATTATTTC
CTGCAACTGAAGAATGATCTTGTGGAAGGTAGAATTATATGTGATTGCCAGCAAGCCGTG
GTGCTGGCATCGTATAGTCGTCAAGCAGAGTACGGTAACCACGATCGAGAGCGACACACC
GTGGAGTATCTAAAGAATTTGCTGACATTCCCCAAACAGATGTTAGACGGAGGTCAAGTA
CTAGACAGCAGCAGTCTAGCAGAGACGGGTCGCCTGGAGTGGCTGACGGCGGCTGTCATA
CAACATCACATGTCCCTGAACAACATGCCACAGGCTCAGGCCGAGGAGGGTTTCATCACG
ACATGCCAGCAGCTCCGCGGCTACGGCCAGGAGATGTTCACCGCCAAGGACAAGAAGGAC
CAGGCGGAGGTGACCCTCGCCGTGTCGCTGACGGGCATCCGGGTACTCACCGACACCAGC
GACACTCCACATTTCTACCGGTGGATGGACATAACGAACGTGATAAACCACAAGAGGACC
TTCAGCGTGGAGTGTCAGCAGCGGGAGTCGGCCTCCTTCGTGCTGCCGTCGCCCGAGGAC
GGCAAGTACGTGTGGCGGATGTGCGTCATGCAGCACACCTTCTACATGCGGCACCAGCAC
TCCCTCTCCGCCGCCGCCGCCACCGCCACCGCCACTGCCGCTGCCGTCGCGGACCAGCAC
AGACCCAAGTTCCAGGAACACGGTCTGTCTGAGAGTCGCGAGGAGCTCGAGTCCCGCGAC
CTGCGCGAGCCTCACCCCGCCGCGCTCAGCACGGCTCGCGCCAGGTCCGCCTCGTGTCTG
GAACTGGCTGAACATCGACCACATCTAGCCACCAGAGCCCTTCTTCCATCATACCGTCCA
GCCCCGGACTACGAGACGGCGATCCAGCAGAAGTACCAGCAGCAGCGCGCTGAAGCCCAA
CTCCGCTATCAGAACCACTCTCATTCCCAACTCCTGGGGGCCAGCACTCAACCCCTCCTG
TACGGCTCCCACCCCGATATACCTCGCGTTCACTACCCGGACGTGACGCGGCACACCGTG
TCCGTGAAGCAGCCCGTCACCGCCGACGATTACTCCTACGGGCTCAAGTTCGTGGGGAAC
TACCCTGTGGCCGTCAACCCCAACGTGCAGACCGAGCACGCCCTCCACTACGTGAACGTG
TACAAGCCGCCGCCCCCCTATCCGTCCAACGGTCTGGCCTCCAACTCGACCCCGGACCTG
GCAGTAGCGAGTCAAGCTCTTAACTACCATCGGAGCTACATCGACGCGCACGTGTCCGGC
TCCAGCCCCGACCTCGTCTCCACTAGGACCGCCCTCAACAGGCAGTACCTCGGTTATGTG
AGTCCGCACAACGTCGTCAACTACGGCCGAGCGAACGTCCTGCCGGCGACTCACGGCACC
TACAACAACCTCACGTCGGTGATGGAACCGAACCGCATCATCGTCGACCCCCACCTCGTA
TCGGACAATATACAGAAAGTCTACGACGACAGAGGGAACGTCATGTACTCCATGCCCACG
AGAAGGATAGTGGTGGTGCCGACCGTGCGGCACGAGGAGCCCCAGGAACCGATATACGAG
AACGTGCCTCTGCCGTGGACCTCGGACGGCGGCGCGAGAGGGAGGGCGCACAGTCTCACC
GCCGCGGGCGAGGTGACCGGCTTCAACGACAGGAACGGGAATTGCGCCCTCGCCACCGCG
CAGAGCCTCGCCTCCAAGATAGACGACTCGCACTATGTCAACGCCCAGGTCATCAAGGGT
GCGAGGGAGACGAGCGACACGTCAGAACAGAACCACGACACGGGTGACGTCACCGCCGAC
CTGGACAGGATGTCGCTGAGGAACGAAGACAAAGACTACGATGAGACACCCTACACCAGT
CTGTCCACGCAGAGAGTGACCAACAACAATACCATCAGCAAGTCGGCCGACAACGTCACG
TCCATAATGGTGACGGGAGAAGACCACGAGCATGACGGGGCGGCCGGGGCTCCGGGAAAG
GACGCGCGGGACTCCTCCTACAGCAGCAGCGTGGAGATGGATTGCAACAACGCCAGCAAG
GAGAAGAAGAGGAGGAGGTGGGGGGTGTTCATGGGCAGGAGTAAGAACGTCGAGGTGAAG
AGCGCCACTCTCGGGAGGGATAGGGCGAAGGCGGGACAGACGACCAACAAGCATCGCTGG
TCCACTGGACTGCCCAAATATCAGCCGCTACCACCCAGCATTACCAAAGAGACTCTGTGT
CAACTGTTGGAGCGCAAGATGTCGGAGGAGCAGCTGTCGTTCTCGTTCGAATCCATCCCC
CGGGGCCGCGGCGGGGACGGGGACTCCGGCCGGCCTTACGACGACAACCGCGTCAGACTG
CACCCCACGGCCGCCAACCCGCAAGGGTACATCGACGCATCGCACATCACAATGACGGTG
GGAGGTTCCCAGCGGTTCTACGTGGTGGTCCAGGCGAACAGCGGCGAGCAGGCGGCCCTG
CTCTGGGAGTGCGTGTGGCAGGTCGGCGCCAGGGTGCTGGCACTCGCGGATGGCCAGCAG
CCGCGCTACCTGCCGCAGGACCAGCGGACCATCACCTACGGACAGTTCGAGGTGTGGTGC
GGCGCGTGGTCGTGCACGTCGTGGGGCGGGTCGCGCCGGGTCACGGTGCGGAGAGGCGCC
GGCCCGCCCAGGACGCTCTGGCACGTCCGCTGTGGCTGGCCCGCAGACACCACCGGCTTC
CTGGACTTCATATCGGAGATCAACAGTCTCCGTGAGACGTGCGAGGCGGAGGCGGCGGCC
GAGCGCTCCCTGACCCTGAACGCGCCGCTCGTGCTGGGAGGGCGCGCCGCCGGCGTCACG
CTGGCCGCGGACCTGCTGCTGCACGTCATCGACACCAACCAGGAGCTGGACATCCCTCGC
ACCGTATCGTTGCTGCACCAGCAGCGCGCCGGCCTGCTGCCGAGCCTGCAGCACTACCGC
TTCCTGCACCTGGTGCTGCTCCACTACCTCAAGCAGTCCCGCCTCATATGA

Protein sequence:

MPFKLKLKKTRQYNVASKSLFVISVELLDGGVADCTLSVESTGQECLDNVCQRQAINQPE
FFGLRYINRNQQPRWVQLDRPLKLQLEKYASSHQLYLRVMYYVISGTSLITDEVTRYHYF
LQLKNDLVEGRIICDCQQAVVLASYSRQAEYGNHDRERHTVEYLKNLLTFPKQMLDGGQV
LDSSSLAETGRLEWLTAAVIQHHMSLNNMPQAQAEEGFITTCQQLRGYGQEMFTAKDKKD
QAEVTLAVSLTGIRVLTDTSDTPHFYRWMDITNVINHKRTFSVECQQRESASFVLPSPED
GKYVWRMCVMQHTFYMRHQHSLSAAAATATATAAAVADQHRPKFQEHGLSESREELESRD
LREPHPAALSTARARSASCLELAEHRPHLATRALLPSYRPAPDYETAIQQKYQQQRAEAQ
LRYQNHSHSQLLGASTQPLLYGSHPDIPRVHYPDVTRHTVSVKQPVTADDYSYGLKFVGN
YPVAVNPNVQTEHALHYVNVYKPPPPYPSNGLASNSTPDLAVASQALNYHRSYIDAHVSG
SSPDLVSTRTALNRQYLGYVSPHNVVNYGRANVLPATHGTYNNLTSVMEPNRIIVDPHLV
SDNIQKVYDDRGNVMYSMPTRRIVVVPTVRHEEPQEPIYENVPLPWTSDGGARGRAHSLT
AAGEVTGFNDRNGNCALATAQSLASKIDDSHYVNAQVIKGARETSDTSEQNHDTGDVTAD
LDRMSLRNEDKDYDETPYTSLSTQRVTNNNTISKSADNVTSIMVTGEDHEHDGAAGAPGK
DARDSSYSSSVEMDCNNASKEKKRRRWGVFMGRSKNVEVKSATLGRDRAKAGQTTNKHRW
STGLPKYQPLPPSITKETLCQLLERKMSEEQLSFSFESIPRGRGGDGDSGRPYDDNRVRL
HPTAANPQGYIDASHITMTVGGSQRFYVVVQANSGEQAALLWECVWQVGARVLALADGQQ
PRYLPQDQRTITYGQFEVWCGAWSCTSWGGSRRVTVRRGAGPPRTLWHVRCGWPADTTGF
LDFISEINSLRETCEAEAAAERSLTLNAPLVLGGRAAGVTLAADLLLHVIDTNQELDIPR
TVSLLHQQRAGLLPSLQHYRFLHLVLLHYLKQSRLI