DPGLEAN20004 in OGS1.0

New model in OGS2.0DPOGS216205 
Genomic Positionscaffold1220:- 7198-24268
See gene structure
CDS Length4539
Paired RNAseq reads  1457
Single RNAseq reads  3748
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004514 (2e-20)
Best Drosophila hit  protein tyrosine phosphatase 69D, isoform B (6e-177)
Best Human hitreceptor-type tyrosine-protein phosphatase alpha isoform 2 precursor (1e-99)
Best NR hit (blastp)  PREDICTED: similar to Protein tyrosine phosphatase 69D CG10975-PB, isoform B [Apis mellifera] (0.0)
Best NR hit (blastx)  GJ13803 [Drosophila virilis] (0.0)
GeneOntology terms










  
GO:0004725 protein tyrosine phosphatase activity
GO:0005001 transmembrane receptor protein tyrosine phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0005886 plasma membrane
GO:0008045 motor axon guidance
GO:0007415 defasciculation of motor neuron axon
GO:0007414 axonal defasciculation
GO:0004728 receptor signaling protein tyrosine phosphatase activity
GO:0007411 axon guidance
GO:0048813 dendrite morphogenesis
GO:0007409 axonogenesis
GO:0030424 axon
InterPro families









  
IPR016130 Protein-tyrosine phosphatase, active site
IPR003599 Immunoglobulin subtype
IPR003961 Fibronectin, type III
IPR000242 Protein-tyrosine phosphatase, receptor/non-receptor type
IPR003595 Protein-tyrosine phosphatase, catalytic
IPR013783 Immunoglobulin-like fold
IPR000387 Protein-tyrosine/Dual-specificity phosphatase
IPR007110 Immunoglobulin-like
IPR008957 Fibronectin type III domain
IPR013151 Immunoglobulin
IPR013106 Immunoglobulin V-set
Orthology groupMCL16042

Nucleotide sequence:

ATGTTGCTCTTGGTCAGGAAGTACCCCGTAACACTTGTGCTTTATTTTGGGTGTATCGCA
TCGTGTTATCTCGTTTTAGGCCAAGATGCATCAGAGATATTGGACTTACAATTGGCTAAA
GGGGACATTGGCGATGAGACGAATATCAGTTGCAGTGTAGCACCATCGGAAGTGAATGTG
GCATGGCTCTACAATAATAAACCGTTTAAGATTGGTGAAAGGATAAAACAGCGAGATGAA
GAAAGACTTCTACAGAAGAAAGATCTTGATGGTAATCCCAGGAAATATAAAATTTACAAT
CTCACGTTGACCAACACAACGGCGAACGACGATGGTAATTATACGTGCGTGGCGGTGCTA
GGGGAACTGAGGGCGGAGAAAACTATCGTCTTAGACCTCAGCTTCCCGGGAAGATTAATA
AATAAGACGACTGGACCAATAAAACAAAATGTTACAGACCAACAGAACGTCACGATGTTC
TGTGCATTCGAAATTTATCGGCCAAATGAAGTGAGGTGGTGGAAAAGGGGCAAAGACGAC
GAAATTATAGAACTCGGAACAAAAGCCGCTAAAGTGATCGATCTCAAACGGATGCAGAGT
CAATATGATCTCCACATACGTAGCCCGGAAGACAATGGGACTTACATCTGCGAGATATGG
GATTCAGTGACGTCATCGAACCTGACAGGGGGTATTGACGTCATAGTCTACGCAGCACCA
CAAGTAGTCATAGACACGGTCATACCGATCAGTGCCTCCCAACTATTCCTTAATTGGACA
ATCCGTTCATACAATTCACCAATCAAAAGCTACAATCTGATGTATCGAAAATTACCCTCA
ACCGACTTCAGTCTGTACACTACTGAGAAGATAAGAGTTAATAATATATCATTTGTCATG
GAGGGTTTGGAGAAATCTACTAAATATCAGCTGAAATTGGAGGTGACAACTACCTATGGA
TCCAGCAAGCCCCATATATATGAGTCTATAGTTCGGACTTTGGATAAAGATCCAATATTT
GTTCCACATATCTCCATCAATGGATTCTCAGCTACTTCAGTGACTATCGGATGGGCGCCA
CCTCCAGAGGATATAGCCGAGTTGATACACTATTATTTATTGGAGGCGAGGAAGATGGAT
GAGGTCGCACCAAGGAGAGCATATCACTCCAGGGACAGCAGAAATTTGCCATACATGTTC
GATAACTTGGAACCTCACAGCACTTACGTATTTCGGGTTTGTGCGTGCTCAGATTTCACA
AAGAAATGCGGTAATTGGTCTCTAGAGATGCAGGCTGCGACGTTGGACGGTATACCGGGA
CGGCCGAGCAATGTCACGGTCACTTGTAGTACTAGCTGGATGAATTTAACCTGGCAGCCC
CCGGTCAAACCTAATGCTGAGATCAAAGGATACACTATGGAACTAACAGGGAACGCGACC
TATAGAGACAGGTACGGCACATACAAAGAGGAGATGTGGGGACCGCTCACCAAGTTTAAA
ACAAACGATTCTAGAAGTGTCAGGTTTGAAGACTTGAAGCCGAATACTAACTACACGGTG
CGGCTGAGTGCGATGACTCGCACCCGGCGGCGCGGGGACGAGGAGGTCCGTCACTGCGCT
ACAGCACCCGCTCCCCCCGACTACCCGCCGCGACTGAGGTTTGAAGACTTGAAGCCGAAT
ACTAACTACACGGTGCGGCTGAGTGCGATGACTCGCACCCGGCGCCGCGGTGACGAGGAG
GTCCGTCACTGCGCTACAGCACCCGCTCCCCCCGACTACCCGCCGCGACTGAGGTGGAGA
AAGGAGCTGGATAATAATAAATACGTATTCACAATGCACTTACAAAGAATAGATGAGCGA
AACGGACCTATATGCTGTTATAGGGTGTACATGGTGAGATTGTTGCCACACTCAGATTGG
AATAATCTTCCCCCTCCGCGTGATATTAGTATAGTTGACTATGAAGAGGCCCACGGGGTA
CAACCTGTACTTGGAGCTTATATCACTGATGTATTCTCTAATGAAAAATTCCCACCCGAA
GCGAATCTTATAATGGGGGATGGTAAATCGTACTTCGATAAAGACGATCCCGGTCTCAAT
AGGGATTCCTGTAAGCGATGTTTAAGGAAACCGAGGCGCGTCTACGACCTGCCTCGACCA
CCGACCACCACACCGACCACAATACCTACCACCACTTACCAACCGCCCACTACCAGGGAC
GATCTATTCGAAGAGGAGATAGATACAACCGCTGAACCTGAAGAGGAGAGGAGGGAGAGA
CGGTCATACTTAGATATTGATAGGGATTATATGAAAAATCCAATGATGATGGACAAACTG
ATCGAGGTCGAAGTGAAAGAAGATCTGAATATTAAAGACGGTCTACTGGACCCGTCGGCG
AACTACACCGTCTTCATAGAATTAATACCCGGTTCACCATCAGACGACCCTCTGTACAGC
GAGTATCTGAACGTGTTAATGGCGGCCGCCACCCCCGTACCAACACAACCACCGTCGGCT
ATGGAATTAGCTATATTGGCGTCGTGTGTGGCGGCGGGGGCGGCGGTGTTGTCTCTAGCC
GCGTGGTGTGTGCTAAGGGCGAGACGATCTCGTAAGCTCCCTCCACACCATCACGTAGAA
ATGAATCCTATACAGGCTGCTCTAAGGTACGTCGTAGGTCACATCGGTGGACGTCAACAG
TTAATAAGTGCCGTGCCCCCGGACATGCCGCCCATAGCCAAGGAAGACCTCGCCGCCGCC
TACCACGAGAGACAAGCTGACTCCGACTACGGCTTCCAGAAGGAGTTCGAGATGTTACCG
GAGTGCTATCCAGACCGCACCACACACGCTTCGGAGGCAAGAGAGAATCAACCCAAGAAT
AGGTACCCGGACATCAAAGCATACGATCAGACGAGGGTCAAATTGACCCAGATAGATGGC
ATCAGTGGCTCTGATTACATAAACGCCAACTATGTCATGGGTTACAAGGAGCGTAAGCAA
TTCATTTGCGCCCAAGGTCCTACTGATACGACTGTGAACGATTTTTGGAGGATGATTTGG
GAGCATGACCTTGAACTGATAGTGATGCTGACCAACCTCGAGGAGTACTCCAAGGTCAAG
TGCAGCAAGTACTGGCCGGACGAGGTGAGAGGCGGCCGGGCCTTTGGCAGCATCAGCGTC
TATCACGTGGCTGAGAAGAGATATTCGGATTACATCGTGAGAGAGTTGAAGATATCGAAA
CAGCCTCTGAACTCGGACGGACAGCCGGTAGTTGAAAACAATGGAGTAGCTAAGAGGAAT
GGAGATTGTGGTATGAGTGACAGCGTGCCGACCTCGCCTCGTGATAACAAGTCTACGGAC
TGTCGCCTCGTCAGACAGTACCACTTCCTCATGTGGAAGGACTTTGCTGCTCCGGAGCAC
CCACACTCTATACTCAAATTTATAAAGAGAGTAAACGAAGCATGGTCAAGTATGGTCGGT
AGGCCGGTGGTAGTTCATTGTTCTGCCGGCGTAGGCCGCACGGGGACGCTCGTAGCACTG
GACTGTCTACTGGAACAGCTACGAGCTACGGGACACGCCTCCGTTTTCAACACCGTAGCC
GAGCTACGACGACAGAGGAACTTCCTTGTTCAATCATTGAAACAATACGTTTTCGTATAT
CGAGCGTTGGTTGAGTACGCGCACTACGGCGACACTGAAATACCGGCGTCAAGACTGAAG
AGTTCCATCGACAGGCTCAGGAACACACCAGAGGGCGCTGACAAGTGTCTCATGGAACAC
GAGTTCGAGAAGATGATGTCACCTCCTATATCCGAGGCTACGAAATCGTGTGCGGCGGGC
GGAGCCGGCGGTTCAGATGAACTGCGAGCCAGGAATCGAAGCCCGGACTGTCTGCCTTAC
GACAGGAACAGAGTCATCCTCCCCCCACTACCAGGACGAGATTACTCCACATACATCAAC
GCTTCATTCATTGAAGCATATGATAATACAGAAGGCTTCATCATCACGCAGGATCCACTT
CCGAACACTATTATGGACTTCTGGAGGATGGTAGCGGAACACAATGTGTCCACTATCGTT
ATGTTAAGTGAGCTTGGTGAAGGTAAATGTCCCCGTTATTGGGACGACGGTACCATACAA
TACGAACACATCTCAGTGCAGTACGAAGAGAGCGAGTCCTGTCCGTATTACACGAGGAGA
CAATTCAGAGTTACCAACAACAAGAGCGGAGAGTGGCGTTGCGTGAGACATCTTCAATAC
CAAGGTTGGCCGACAGCGGCGGGACACGTGCCGGAAGTGACGCGAGGGTTGGCGGAACTG
GCGGAACTAGCGGCGCCCTTAGACTCAGCGCCCGCGCCGCCGCTCGTAGTACACTGCCAA
TTTGGTACAGAGCGTTCACCTTTATTCGTAGCTCTATGTACTCTTATGTGTCAGCTGCGT
GTTGAACGTCGTGTAGATGTCGCCACAGTCGCCAGGAAGGTTCGCTCACAACGGGCGAGA
ACCATTGACACGTTTGTGAGTATTCGACTCTATATATAA

Protein sequence:

MLLLVRKYPVTLVLYFGCIASCYLVLGQDASEILDLQLAKGDIGDETNISCSVAPSEVNV
AWLYNNKPFKIGERIKQRDEERLLQKKDLDGNPRKYKIYNLTLTNTTANDDGNYTCVAVL
GELRAEKTIVLDLSFPGRLINKTTGPIKQNVTDQQNVTMFCAFEIYRPNEVRWWKRGKDD
EIIELGTKAAKVIDLKRMQSQYDLHIRSPEDNGTYICEIWDSVTSSNLTGGIDVIVYAAP
QVVIDTVIPISASQLFLNWTIRSYNSPIKSYNLMYRKLPSTDFSLYTTEKIRVNNISFVM
EGLEKSTKYQLKLEVTTTYGSSKPHIYESIVRTLDKDPIFVPHISINGFSATSVTIGWAP
PPEDIAELIHYYLLEARKMDEVAPRRAYHSRDSRNLPYMFDNLEPHSTYVFRVCACSDFT
KKCGNWSLEMQAATLDGIPGRPSNVTVTCSTSWMNLTWQPPVKPNAEIKGYTMELTGNAT
YRDRYGTYKEEMWGPLTKFKTNDSRSVRFEDLKPNTNYTVRLSAMTRTRRRGDEEVRHCA
TAPAPPDYPPRLRFEDLKPNTNYTVRLSAMTRTRRRGDEEVRHCATAPAPPDYPPRLRWR
KELDNNKYVFTMHLQRIDERNGPICCYRVYMVRLLPHSDWNNLPPPRDISIVDYEEAHGV
QPVLGAYITDVFSNEKFPPEANLIMGDGKSYFDKDDPGLNRDSCKRCLRKPRRVYDLPRP
PTTTPTTIPTTTYQPPTTRDDLFEEEIDTTAEPEEERRERRSYLDIDRDYMKNPMMMDKL
IEVEVKEDLNIKDGLLDPSANYTVFIELIPGSPSDDPLYSEYLNVLMAAATPVPTQPPSA
MELAILASCVAAGAAVLSLAAWCVLRARRSRKLPPHHHVEMNPIQAALRYVVGHIGGRQQ
LISAVPPDMPPIAKEDLAAAYHERQADSDYGFQKEFEMLPECYPDRTTHASEARENQPKN
RYPDIKAYDQTRVKLTQIDGISGSDYINANYVMGYKERKQFICAQGPTDTTVNDFWRMIW
EHDLELIVMLTNLEEYSKVKCSKYWPDEVRGGRAFGSISVYHVAEKRYSDYIVRELKISK
QPLNSDGQPVVENNGVAKRNGDCGMSDSVPTSPRDNKSTDCRLVRQYHFLMWKDFAAPEH
PHSILKFIKRVNEAWSSMVGRPVVVHCSAGVGRTGTLVALDCLLEQLRATGHASVFNTVA
ELRRQRNFLVQSLKQYVFVYRALVEYAHYGDTEIPASRLKSSIDRLRNTPEGADKCLMEH
EFEKMMSPPISEATKSCAAGGAGGSDELRARNRSPDCLPYDRNRVILPPLPGRDYSTYIN
ASFIEAYDNTEGFIITQDPLPNTIMDFWRMVAEHNVSTIVMLSELGEGKCPRYWDDGTIQ
YEHISVQYEESESCPYYTRRQFRVTNNKSGEWRCVRHLQYQGWPTAAGHVPEVTRGLAEL
AELAAPLDSAPAPPLVVHCQFGTERSPLFVALCTLMCQLRVERRVDVATVARKVRSQRAR
TIDTFVSIRLYI