DPGLEAN17645 in OGS1.0

Genomic Positionscaffold1131:+ 41515-48400
See gene structure
CDS Length3201
Paired RNAseq reads  665
Single RNAseq reads  1995
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007417 (0.0)
Best Drosophila hit  moesin, isoform B (7e-09)
Best Human hittyrosine-protein phosphatase non-receptor type 13 isoform 1 (6e-24)
Best NR hit (blastp)  tyrosine-protein phosphatase [Bombyx mori] (0.0)
Best NR hit (blastx)  tyrosine-protein phosphatase [Bombyx mori] (0.0)
GeneOntology terms






  
GO:0016311 dephosphorylation
GO:0005515 protein binding
GO:0016791 phosphatase activity
GO:0005856 cytoskeleton
GO:0004725 protein tyrosine phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0016787 hydrolase activity
GO:0004721 phosphoprotein phosphatase activity
InterPro families





  
IPR019749 Band 4.1 domain
IPR001478 PDZ/DHR/GLGF
IPR019748 FERM central domain
IPR000299 FERM domain
IPR014352 FERM/acyl-CoA-binding protein, 3-helical bundle
IPR018979 FERM, N-terminal
IPR018980 FERM, C-terminal PH-like domain
Orthology groupMCL10667

Nucleotide sequence:

ATGAAGCTTGGACAGCAGAATCGGAAACAGTGTGACGGGGGAGTAACGTATATACAGGGT
GTGAAGTCCCGCGACACCGACAATAGATGGCGCTGTCAGTCAGCTATCACATCTATTGGT
TTAACATGCTATATTCATCTCTCTGAGAGCAATTACAAAGCTCACTCAGGCCAAGTCGCC
GAAAATATATTGCTTATTTCATCGAATCATAATAAATCTTTTCATAAAATGCCTCGACAT
TGTGACGAAGATTCTGGACGTAGTTCGTGTTCTGCAACATCTATAACATTCAGTCCTGTC
ATAGACTACCATAAGTTTGATACAATGAAGACAAAAGGAAAAACACAAGATTGTTCCTCA
GAGCTCAAAGTCAGCACACATCCAAGGAATTTTATTAACACCTTTAAATCACCCAATGAA
AAGGAACCCTTTCCAAGCTGCAGGCTGCCAAGGAATCAAAGGAAAGCAGTTTTTAGTTTG
TTTGACACCCCAAGACTAGGAGGTTTAAGTAGTGCTCACTCAACATATAATATCGCCAGC
AGCACACAGCTTGAGGCCATGTCCAGTAATGCGGTTTCCATGACAGGTCTTAATGATGGT
CATAAAATTGAAAAGGAGAGCCATAATAGCAATTTTAGAAGAGGAAAACCTGTTCAGAGA
GCAGCTTCCCGCCTGTATAAAGCAGATACAGTCAAAGGAAAAGAAGGATGTGTCGGACCA
GAATTCATAGTGAGGGCTTCACAACCAGCTAAACATATTGATTTAAGCATATCATCAACA
TCTGAGAAACGAACTATCAATGTTGTTCTATTGAATGGTCAAAAAGTAGATGTGGTATGT
GATTCGAATACTGTGACGGCTGGACAAATTTTTGAAGCTTTAGTGTACAGTGAGAAATAC
GAACATAACTTTATGCTCGGTGTTGCTATATTAATCACTGGAGATTTTATATTTCTGCCG
GACGATTACAAGCTGAAGAAGGTTGCTCCCGATTCATGGCATAAAAACATGAAAAACAAA
GTCAATGTTGAGGATGCTTCATTCTTGCCTAAAAATATTTCAAATAGTATACAGGGTGGT
GACTGGAGGTATAGAGTGTATTTGCAATTACGACGAGCAACCTTAGAAGGACAAATGGTA
TCAACGATACAAAATCTTATACTCCTGGCCGGCTATGCATTACATATAGAATTTGGGGAG
TTTAACTACAGAGAACATGGCACCGCTGATTACTTTCTACTAGAACATTATTTGCCAGAA
TTCATGATAACACATGACATGGCTGACGTTAAGCTCAAAATGAAACGGGCTCACGAATCT
CGACGCGGTTTGGAGAAGAGCAAGGCCATAATAAACTATATAACATTAGCTCAAACATTT
AGGGACTATGGTGCACATTTCTATTCAGCGGTTTGGGCTACGAGAGACGGCTTCTGTAGA
GATGTTTGGCTTTCAATTGGACCGAGAGGTGTCACACTTTATTCCAGAAATAACATCTCC
GAAGAAACATCCAACACTGTCCGGATAGTCTTACAGAGCTTGCCGTGGCATCACATTCAT
ACGTTCTGCTATAACAAAAAAAGCCTTTATATTATGCCAAATGCATATTCGGGGCTCTCC
AAAATAGGCATCAAATATAAATTAAAAATGTCGGATAACAAGAGCTTTTTTGCATTTTGG
CTTGCGTCTTTCCATCACAGACTCTATTTAAAACTATATGCAAAAGAGGATTTCATCAAT
TATTTATCAAGTGAACTCAATTGCCCAATAAAGACGGCGAATAGTCCAAAGAAATCCGTA
GAGTGCAGTAATTATCAGGATAGTTACAATTTGGCTGTCAGAAACCCGTCGAGGGTTCGG
AGGCCGGTTAGAAGGAGATTCAAAGTTGATCTTTTTAGTGACAAAAGAAATATGAATAAA
GAAAACGAGAAGCCAAACACTGAGGAGTTATTACGGCAAATATTGTCACCTGCAGCCAGC
AATGACAATATACTACAGTCTCCAAGTGGAAACGCGCATAGAAGCTCTTCAAACGAGGGT
CTGTCGTCATCTGAAAGTAGTTTACCGAGACGACATCGAGTAAAGATGGGAACTAGAGTG
TTTAACGGTATGAAGTCTAACCTAGATGTTAGATTCTCTCAATCCCCAGCTAAGATGCTG
TCAGTGAGATCGGACGGAGATCTGGCCACCACGCAAAGTGACTCCGACGATTTAAGTTCA
GAGCAAAGACAATGCCATATAAATAGTATGAAGTTATTACCAGAGAGATGTTACAGTCAA
AGCATGCTGCACGCGCAGACCGCTTACGTCCTGGAATCCCCGAAGGTCTACTCCGACGTT
TTCAACTACGACACCGGCAACGAATCCTGTGTAAATACTTCATTATTCGAAAAATTAGAT
AATATGGAATGCGTGCAAGGAGAGAGGGTATTCGTCACTACCGTATTAGAGAGGGACAAA
ATGAATGCCTTAGGTTTGCAAGTTGCCGAAGGCTCTGATGGTAATGTATACATAAAATCT
ATAACCCCCGGAAGTTCGGCGGACAGTTGTGGAAAGCTTTTGGCAGGAGATCAAATCATA
TCAGTGAATGGGCAAACTCTACTTAATCTGAAATACGATAAGGCTTTAAGTATGTTACAG
AGCGCTCCAGATAGGGTAGAACTTATTGTCTTACAAAATACTACAAAGAATAACTTCACA
GACAATATCCATTGCTCCCAACAGTTAAAGCAAAGCTTAAGACATAGCATAGCAAGCGCT
TTGGATGCAGAAGGGACGGACGACGAGCTGATCAACGAAGAAGCGTTGAAGACTATCTAC
GCTCTTATAAAACTGACTAAAGAAAAATTTAATACATGGCGCGGACAGCTGGCGGAAAAA
CCCAAGCGGCGACCTCTTAGCTTAAGCATACCGACAGACGTCAGTCTCCCTGATGACTAC
CTGAACGATGACGAAGTCATGAAAAAGACTTCATTGAAGTCTATCAAGTCTTCCCTCAAC
AGTATTGATAGTAAAAGCGTCTCGAGGAACGTTGCCTTGCCTCGTAACTTTGGTATGAGC
AGACGATGGCTAGGTCCAGTCAAATACCCCGTGACGCCGTGCAAAAACAGCGCCACCGAT
GTTGTAGCCGACGGAAACATAGTTAGGAGACATTTTGTTTACGGCACGGGGGACTCAGAC
GAAGAGCAGATATTTCTGTGA

Protein sequence:

MKLGQQNRKQCDGGVTYIQGVKSRDTDNRWRCQSAITSIGLTCYIHLSESNYKAHSGQVA
ENILLISSNHNKSFHKMPRHCDEDSGRSSCSATSITFSPVIDYHKFDTMKTKGKTQDCSS
ELKVSTHPRNFINTFKSPNEKEPFPSCRLPRNQRKAVFSLFDTPRLGGLSSAHSTYNIAS
STQLEAMSSNAVSMTGLNDGHKIEKESHNSNFRRGKPVQRAASRLYKADTVKGKEGCVGP
EFIVRASQPAKHIDLSISSTSEKRTINVVLLNGQKVDVVCDSNTVTAGQIFEALVYSEKY
EHNFMLGVAILITGDFIFLPDDYKLKKVAPDSWHKNMKNKVNVEDASFLPKNISNSIQGG
DWRYRVYLQLRRATLEGQMVSTIQNLILLAGYALHIEFGEFNYREHGTADYFLLEHYLPE
FMITHDMADVKLKMKRAHESRRGLEKSKAIINYITLAQTFRDYGAHFYSAVWATRDGFCR
DVWLSIGPRGVTLYSRNNISEETSNTVRIVLQSLPWHHIHTFCYNKKSLYIMPNAYSGLS
KIGIKYKLKMSDNKSFFAFWLASFHHRLYLKLYAKEDFINYLSSELNCPIKTANSPKKSV
ECSNYQDSYNLAVRNPSRVRRPVRRRFKVDLFSDKRNMNKENEKPNTEELLRQILSPAAS
NDNILQSPSGNAHRSSSNEGLSSSESSLPRRHRVKMGTRVFNGMKSNLDVRFSQSPAKML
SVRSDGDLATTQSDSDDLSSEQRQCHINSMKLLPERCYSQSMLHAQTAYVLESPKVYSDV
FNYDTGNESCVNTSLFEKLDNMECVQGERVFVTTVLERDKMNALGLQVAEGSDGNVYIKS
ITPGSSADSCGKLLAGDQIISVNGQTLLNLKYDKALSMLQSAPDRVELIVLQNTTKNNFT
DNIHCSQQLKQSLRHSIASALDAEGTDDELINEEALKTIYALIKLTKEKFNTWRGQLAEK
PKRRPLSLSIPTDVSLPDDYLNDDEVMKKTSLKSIKSSLNSIDSKSVSRNVALPRNFGMS
RRWLGPVKYPVTPCKNSATDVVADGNIVRRHFVYGTGDSDEEQIFL