DPGLEAN20007 in OGS1.0

New model in OGS2.0DPOGS216187 
Genomic Positionscaffold1220:+ 33380-44739
See gene structure
CDS Length3258
Paired RNAseq reads  732
Single RNAseq reads  1799
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004537 (0.0)
Best Drosophila hit  protein tyrosine phosphatase 99A, isoform A (2e-56)
Best Human hitreceptor-type tyrosine-protein phosphatase zeta precursor (3e-52)
Best NR hit (blastp)  PREDICTED: similar to Protein tyrosine phosphatase 99A CG2005-PB, isoform B [Apis mellifera] (8e-64)
Best NR hit (blastx)  PREDICTED: similar to Protein tyrosine phosphatase 99A CG2005-PB, isoform B [Apis mellifera] (6e-61)
GeneOntology terms






  
GO:0005886 plasma membrane
GO:0005001 transmembrane receptor protein tyrosine phosphatase activity
GO:0004725 protein tyrosine phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0008045 motor axon guidance
GO:0007415 defasciculation of motor neuron axon
GO:0007414 axonal defasciculation
GO:0030424 axon
InterPro families





  
IPR016130 Protein-tyrosine phosphatase, active site
IPR000242 Protein-tyrosine phosphatase, receptor/non-receptor type
IPR003595 Protein-tyrosine phosphatase, catalytic
IPR013783 Immunoglobulin-like fold
IPR000387 Protein-tyrosine/Dual-specificity phosphatase
IPR003961 Fibronectin, type III
IPR008957 Fibronectin type III domain
Orthology groupMCL39866

Nucleotide sequence:

ATGTCGTCCGACCTCCTGGCCCGCCTGGGACCCCAGTCCGAGTCCCTCTACGTGTTGACG
GACGTTAGACCTCCATCAGCACCTGTTATACTCAACCTGACCTGCGACCAACAGAATGGT
ATTCTCTACCTGCAATGGCGTCAACCTCTCGAGTACAACAACTCATTAGACCAGTACGTG
GTGACGCTGAGGAAGATACCGGAACAGCAGCCGAGGACCAGACTCACGCTGTTCACCAAG
AAGGAGGACATCGAGACTATGATCAGTGTTAAAGTGGATTTATCAAACTCTACAATGTAC
GAGGTGAAGATTTACGCCGTGACCTTATCAGTTGCGACGCCGAAAACACTTATCAATGGA
TCGGAATCACCTCCGAAGGATGTGTCGAGTGAGTCGTGCGCGGTGGTAGCGGCACGAGCG
GGGATCGCGGATATGGAGGGCGAAGCGCCGGGTGCTCCGGCAGCACTGTTGGCTGCCGCC
CTACTAGCCGCCCTAGCAGCCGGTGGAGCCGCCCTAGTTTACTGGAGATGTAGATCTCGT
GTGAGCAAATGTATCAGCGCCGCTTACAATTACTTGGAAGAGGGCGGTGAGAGAGCGGCC
AGGGCTCCACTAAATATAAACAAGAAACCTCATGGCGTCCCCCGTCAGATCAAGCTGGAG
TGGGTAGTGTGGCGGCGGAGGTACATCGCCACTCAAGGCCCCACGCCAGCCACACTAGAC
GCCTTCTGGCGGATGATCTGGCAGCACAGGGTCTGCACCTTGGTCATGATCACCAACCTC
GTGGAGCGGGGCAGGCGTAAGTGCGACATGTACTGGCCGGCGGGCGGGCGCGGCAGTTCC
GCGGAGTTCGGCGGGATACACGTGACGCTGCTGTATGAGGACGTGAGGGCCGCTTACACC
GTCAGACATCTCAGGGGTAGCGAATCGTCGAGCGAGTCGAGTACAGCGAGTGGCGAGGGT
CGCCACGTGGTCCAGTACCACTACACCGTGTGGCCCGACCACGGCACGCCACGGCATCCG
TTAGCTGTGTTGCCGTTCGTACGGGCCGCTGCAGATCCGGCAACCGTGCTCGTTCATTGC
AGTGCGGGCGTTGGAAGAACGGGTACATACATAGTGATAGACGCACAACTGAATCAATTA
AAACTCACGGGAACCCTGTCGCCTTTAGGGTTCCTCTGTCGCGCGCGAACGCAAAGGAAC
CATTTAGTGCAGACCGAGGAACAGTATGTATTCGTACATGACGCTCTGTTAGAGTACGTG
CGTTCGGGTAACACAGAAGTGGAGTTCACAAAAGCTAGGGAATATCTGGCGAAGCTTCTA
GAACCGATATCAGAGGAGGAGCTAGCGGTTATGGACCTTAATCCTATAAAGCATAAGAGC
GTTAACGAAATGAACGGCGAGAACGACATGTCGAGTGTCAAATCTATAGAATGTAGCGAT
AATATAGTAGAAAATGGAAGCAGTCAAGTGTCGATAAAAACTGACGAATTGAACAGCGAA
AGTAAATCATCTGTGGACAACCAAGAGAAGGATGGATTGGTCAACGGCGATGACTCGGAG
GGAGTTTATGATCTGGCGCCGAGGTCCACAGATACTTATAATAAGAAAATGGCGGCCTAT
AACAGCATGAATGAACAAGAGAAAGAGGAAATGCGCAGAGTAAACCGAGCCGAAAACTAC
GCGCTGTTGGAACGGATGCGCTCTTTATCGAACAGGCACCAACTGTACCAAGGACCTCCT
CCTGTTAACTTGTTAGAGAAACAGAATCAGTTGATAACACGTTCGTGTGTGGAGGCGAGT
GTGTGTGCTCGAGCTCCTCACAACGCTGATAAGAACAGACCGGGTGGCATCCTACCCTCA
GACTCCGCCAGGGTCATGTTGGTACCGAAACCCGGTGTTGAAGGAAGTGAATATATAAAT
GCTTCGTGGGTTTGTGGCGTGCGTCGTGTGAGAGAGTACGCCGTGTGTCAACACACAGAG
GCACCTGACCCGTGGCGGCTGTTGTGGGATCACACCGCACAACTAGTGTTGTTACTACAC
GATGATGAACATCCGGAGTGCGATGTGTTTTGGCCGACAGAAGATGAGAAGGAACTGTTC
GTGGCTAATTTCCGTGCGAGTTTTGTGTCTAAGGAAGTATATGTGGCGCACAGGAGATCG
GATAGGACGAGTCGAACAGACACACCCAGCGAGCCCGAGACCAACGGGTACAGAAGGCAG
GAGGGCAGCGACTGTGCTGACGACGAACGACTCATACCTGATAATAATTCACCAGTTAAT
AATACGGAACCATCCTACAGGTTCGACAGGACGGAACTCCGATTGGAGCGTCTCAGTAAC
AGAGATCTGTCCGCCCGGAAATCCATAGCGAACGGAGATTTATTCTCGTCATTATCAGAG
AAGAAGAACGGTCCAAAATCACCGAGAAGTCCATCGAAGATGTCGTTGAAGAACTTTAAG
CTGAGCTCTCCCACCAAGTTCAAATTCCCCGAGTGGGGTTCTAGAGCGGCTGGTTCACCG
CCAGATACTGCACCACCACCGCCACCTATCACACCATCGCTCACCGTAGAAGAGGAAGCT
GAACTACGAAGACCCGTGTACACATTCGAAAAAGTAAAACACCTCCCAAACGTACCATCA
GACAGGGTTATAGAAGTGACGAACGTGAGCGTGCATTCATTGCAGGACGATTACCAGTTA
AGTGTTAAGTTCATAAAGTGCAGTGGCTGGTTGAAAGGTGCTACCACCAAATACAGCGCG
GGTCGGCCGGATGATAATGAATACGTTCGTGCTGTGAGGCATTCCAGCGGGAGCGAACGG
GAAGCGGCCATCGATAGGCTCATAGCGCCCTACCAGGATTCGTTCGCGTTGATAGAGTTC
GTCGCTGGATGTCAGATGGAATACAAGAATGGACCGGTTGTTGTTGTTGACAAATACGGC
GGCTGGCGAGCGTTAACTTTTTGTTCGCTAAGCGCTGCGTGTGGTGGAGTAAGGAATCCA
GATATCAAGGAACCTGGCAGTGAGTGGGCTTCACCCTGTGTGGCCGCTGATTTATACTGC
AGTAGTGCTCTGAACGCTCACGCCCGCTGCCAGGCCTCACCAAATTCCCCGGCCTCACAG
ACTTCTCAGACTTCTCAGGCCTCACAGAGCTCTCAGACATCTCAGGAACGTCCCGTGACC
CACTCGCCGGAGGCTCTGCTGGCGGCGTACTGCGCCCTCACCGCGTACGCAACTAAACTA
CCGAGGCCCGATAGCTGA

Protein sequence:

MSSDLLARLGPQSESLYVLTDVRPPSAPVILNLTCDQQNGILYLQWRQPLEYNNSLDQYV
VTLRKIPEQQPRTRLTLFTKKEDIETMISVKVDLSNSTMYEVKIYAVTLSVATPKTLING
SESPPKDVSSESCAVVAARAGIADMEGEAPGAPAALLAAALLAALAAGGAALVYWRCRSR
VSKCISAAYNYLEEGGERAARAPLNINKKPHGVPRQIKLEWVVWRRRYIATQGPTPATLD
AFWRMIWQHRVCTLVMITNLVERGRRKCDMYWPAGGRGSSAEFGGIHVTLLYEDVRAAYT
VRHLRGSESSSESSTASGEGRHVVQYHYTVWPDHGTPRHPLAVLPFVRAAADPATVLVHC
SAGVGRTGTYIVIDAQLNQLKLTGTLSPLGFLCRARTQRNHLVQTEEQYVFVHDALLEYV
RSGNTEVEFTKAREYLAKLLEPISEEELAVMDLNPIKHKSVNEMNGENDMSSVKSIECSD
NIVENGSSQVSIKTDELNSESKSSVDNQEKDGLVNGDDSEGVYDLAPRSTDTYNKKMAAY
NSMNEQEKEEMRRVNRAENYALLERMRSLSNRHQLYQGPPPVNLLEKQNQLITRSCVEAS
VCARAPHNADKNRPGGILPSDSARVMLVPKPGVEGSEYINASWVCGVRRVREYAVCQHTE
APDPWRLLWDHTAQLVLLLHDDEHPECDVFWPTEDEKELFVANFRASFVSKEVYVAHRRS
DRTSRTDTPSEPETNGYRRQEGSDCADDERLIPDNNSPVNNTEPSYRFDRTELRLERLSN
RDLSARKSIANGDLFSSLSEKKNGPKSPRSPSKMSLKNFKLSSPTKFKFPEWGSRAAGSP
PDTAPPPPPITPSLTVEEEAELRRPVYTFEKVKHLPNVPSDRVIEVTNVSVHSLQDDYQL
SVKFIKCSGWLKGATTKYSAGRPDDNEYVRAVRHSSGSEREAAIDRLIAPYQDSFALIEF
VAGCQMEYKNGPVVVVDKYGGWRALTFCSLSAACGGVRNPDIKEPGSEWASPCVAADLYC
SSALNAHARCQASPNSPASQTSQTSQASQSSQTSQERPVTHSPEALLAAYCALTAYATKL
PRPDS