DPGLEAN08644 in OGS1.0

New model in OGS2.0DPOGS212740 
Genomic Positionscaffold3:+ 680-13544
See gene structure
CDS Length3204
Paired RNAseq reads  10173
Single RNAseq reads  25324
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013250 (0.0)
Best Drosophila hit  CG3632, isoform F (3e-137)
Best Human hitmyotubularin-related protein 3 isoform c (2e-139)
Best NR hit (blastp)  myotubularin [Culex quinquefasciatus] (9e-172)
Best NR hit (blastx)  myotubularin [Culex quinquefasciatus] (8e-162)
GeneOntology terms







  
GO:0008270 zinc ion binding
GO:0004722 protein serine/threonine phosphatase activity
GO:0016787 hydrolase activity
GO:0005624 membrane fraction
GO:0006470 protein amino acid dephosphorylation
GO:0004725 protein tyrosine phosphatase activity
GO:0046872 metal ion binding
GO:0005737 cytoplasm
GO:0016020 membrane
InterPro families





  
IPR000387 Protein-tyrosine/Dual-specificity phosphatase
IPR017906 Myotubularin phosphatase domain
IPR011011 Zinc finger, FYVE/PHD-type
IPR010569 Myotubularin-related
IPR016130 Protein-tyrosine phosphatase, active site
IPR003595 Protein-tyrosine phosphatase, catalytic
IPR013083 Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL11704

Nucleotide sequence:

ATGAGTGAGCGTGATCGTTACCGCGCCCCTCCCCAGCCGGAGCCCCCGCCGCACCGCGTC
CGGGCCTCGGACCTCTATCCGAGGACCAGGCCTCACCTGGACGAGCCCGGCCTGGAGGCT
GGCTTCGCTACCATATGTGGCGAGTTCGTACAGTTCATCGGGCGTACGTCTGATGGCGGG
ACTATCGCCATGTCCAACTACCGCCTTCACCTCCAGCCTCGGCGACGCACCGGTAGCCTC
GGGTCCTCCGTGCCGCTTCGCCTGATCGAGGCGCTGGAGATCAGAGACCTGCTGTGTCTC
ATAATACTCTGCAAGCACGGACGACAACTCAAATGTTCGTTCAACACTGGCGATCAGTGT
GTGGAGTGGTGGAGGAGGCTGAACACGGCGCTGGTGCCAGTCAGTACATTACAGGAGACG
TTCGCTGCGGCCTACGCTGCCTGGGCCAAGGAACAGCCTAACAACTCCGTCCACAGGGCG
CTCATGAGGGCCTCCCAGCCGCCTCAGAGACACTGGTTCAAACCTGAGTTGGACAGGCTC
GGATTCACAATGAAGGGTGCGTGGCGCGTGTCGGCCGCCAACGTGGAGTACAAGCTGTGC
GGATCCTACCCCCCGCTGCTGGTGGTACCGGCCTCTGTAGGGGACGATGACCTTGAATCC
GTCGCGCGTTTCCGCGCCATGCGTCGTATCCCGGCTGTGGTGTGGCGTCACCGCGTGTCC
GGGGGTATCATCGCGAGGTCCAGTCAGCCCGAGGTCGGCTGGCTGGGCTGGCGCTCCGCC
GAGGACGAACGACTGCTGGCCGCCTTCGTGCACGCCTGCAACCAGGACAGGCCCATACCC
AATAAGCAACTGAAGCTGCTGATAGTGGACGCACGCTCGTACGCGTCCGCGGTGACGAAC
CGTGCTCGTGGTGGGGGGTGCGAGTGTGCTCAGTACTACCCCGCGGCTGATATACAGTTC
ATGTCGCTGCCCAACATCCATCACGTGAGGAGGAGCTTCCAGCAGCTGAGAGCTCTGGCT
GCTGAACCACCAGATCAACCCAATTGGCACAGTTCCCTAGAGCGTACACTATGGCCTCAG
TACGTGTCGGGGGTGCTGCGAGCGGCGGCGGCCGTGTCCCGGGCGGCCGCGGCCGGTCGA
CCTGTGCTCGTGCACTGCTCCGACGGCTGGGACCGTACACCGCAGCTGGTCGCCGCCGCC
CAGATCATACTCGACCCTCACTACAGGACCATAGAGGGTTTCCGCACGTTGATCGAGCGC
GAGTGGTTGGACTTCGGTCACAAGTTCGGTGATCGCTGCGGCCACTCGTTCGGTGGCGAG
GATCCCAACGAGCGCTCGCCCGTGTTCCTCCAGTGGCTGCATTGCATCTACCAGCTGATG
CTGCAGTATCCCTGCAGCTTCGAGTTCAACGAGGCCTACCTGATAAAGCTGGCTGTTCAC
GTGCATTCGTGTATGTTCGGTACGTTCCTGTGTAACTCGAGCCGTGAACGTGTCGAGTAT
CACACGGCTCACACCGCTCAAGTGTGGCGCCTGTTGTCCTCCCCCGCCTACAGGAACCAT
CTATACACACCGCACGAGGATCAGGTGATATGGCCGGAGTGTAGCGTGCGGTCCATGCAG
GTGTGGTGGGGAGTGCTGCTGGGAGAGAGAGAGCGGGAGCCGCCGCGACACAACACACAC
GTCGACAACAACACAACAAACAACACAAACAACATACACAACGGTCTAATGACGAAGACG
AGATCCTGCGATAATCTCCACGGAGGCGAGAAGAAGACGACCCAACGCCGGTGTAGCGAC
CCCAGTCTGGCGCCCGACATCATGAAATTATCGTTACTGAATGGAAGCGAAATACCAGAC
GCACAAACAGACACCGACACGGATCAGGTGGACGGTCTTCATCCTGATCACTTTGACAAT
CATCTCCGAGACATAACAAGCAACTCCTCGTCGCTGGAGAGGGAGCTGGTGTCGATGCCG
CCCGTCACCCTGGACGCCAAGGAACAGGACAACCTCACCAACAGCACAGACAACGACGAG
CCGAACGAAGCTCTCACCATCACCACAATCACCACCATCGACACCATCAACCATGACGAC
CTAAAAATTAGCTCCTCCAACCACGATATTATGAACCGAGACGCTATGGAGCTCAGCACC
TTGAACAACGACCCTGTCAACCACACCCCGGCGAGCCCCTGCGCCCGACTGGAGGTCGAC
TCGCCCGAAGAGCCCGCTGTATTTGTTTGTGAAACCTACACCGACGTCATCGGCATGGCG
GAGGCCGCTCGCGAGTCCGCGCGCACCCGGAACATAAGTATAACGTGGCGGTCGATATCG
GAGTCGAGCAACCAGTCCTCGACCGGCTTCGACATCGGAGACAACTCGCCGCAGACGCGC
CTCGAGCCCGCGACTCGACTCGACCCGCAGGAGATCAACAACCACAACTCCACTGACGGG
GATGTCGTCAACCATAACATTGTGACCAACATGACGAACCACAACAGGGGAGACGGACTG
GAGGGGGTCAACGGGGTCGCTGACGTAACTAACAATTTCCTGGAGGTAGATCTTAATGCA
TGCGCGTCACCGTGTAGCTCGTCCGACTCGTGCTGCGAGGCGGTGCGCGGCTCCAGCCAG
CTCACTCTGTGCCCGGCCACGCCGCCTCACACACACGGCGCGTGTTGCGCTTGCTCCAGC
GAGGCGGAAGAGGCGGACGAGACGTTGGAGGTGACGACGGCCGCTCGGACGGGCTGGTCG
TGTTCGTGCGGCGGAGCGAGAGCTGTGGAAGGATGCAGGGACTCGCTGGAGGGGGTGGAC
GGCTTGGACGGCCTACCTCTAGCCAGCGACCCCGTCCAGGCCAGGCTACATCAAATCATA
CTACAACATAAGAAAATGGTGGAAGATTTAAACGGACAGTTGCGGGAGGCGCGCGAGGCT
TTGAGACGTGCGTCGGGGGTTCGGACCCCCGCGGCCGTGACACGACCCCCACACGCCCAG
AGCCCTACCGGCGTGTCATCCTCGTGTGTGACGTGTGTGGGTCCCGGAGCTCCGGGCGGG
TCCAGCTCGGGCAGCTCCAGCGCCTCGGAGTTGGAAGTGTGTGAGGAGGCGCGGGTCCGT
TGGTTGCCGGACGCCGCCGCTCCTCGTTGCCAACACTGCCGGAACTCCTTCTGGCTGGCG
AGGCGTCGACACCACTGCCGGTGA

Protein sequence:

MSERDRYRAPPQPEPPPHRVRASDLYPRTRPHLDEPGLEAGFATICGEFVQFIGRTSDGG
TIAMSNYRLHLQPRRRTGSLGSSVPLRLIEALEIRDLLCLIILCKHGRQLKCSFNTGDQC
VEWWRRLNTALVPVSTLQETFAAAYAAWAKEQPNNSVHRALMRASQPPQRHWFKPELDRL
GFTMKGAWRVSAANVEYKLCGSYPPLLVVPASVGDDDLESVARFRAMRRIPAVVWRHRVS
GGIIARSSQPEVGWLGWRSAEDERLLAAFVHACNQDRPIPNKQLKLLIVDARSYASAVTN
RARGGGCECAQYYPAADIQFMSLPNIHHVRRSFQQLRALAAEPPDQPNWHSSLERTLWPQ
YVSGVLRAAAAVSRAAAAGRPVLVHCSDGWDRTPQLVAAAQIILDPHYRTIEGFRTLIER
EWLDFGHKFGDRCGHSFGGEDPNERSPVFLQWLHCIYQLMLQYPCSFEFNEAYLIKLAVH
VHSCMFGTFLCNSSRERVEYHTAHTAQVWRLLSSPAYRNHLYTPHEDQVIWPECSVRSMQ
VWWGVLLGEREREPPRHNTHVDNNTTNNTNNIHNGLMTKTRSCDNLHGGEKKTTQRRCSD
PSLAPDIMKLSLLNGSEIPDAQTDTDTDQVDGLHPDHFDNHLRDITSNSSSLERELVSMP
PVTLDAKEQDNLTNSTDNDEPNEALTITTITTIDTINHDDLKISSSNHDIMNRDAMELST
LNNDPVNHTPASPCARLEVDSPEEPAVFVCETYTDVIGMAEAARESARTRNISITWRSIS
ESSNQSSTGFDIGDNSPQTRLEPATRLDPQEINNHNSTDGDVVNHNIVTNMTNHNRGDGL
EGVNGVADVTNNFLEVDLNACASPCSSSDSCCEAVRGSSQLTLCPATPPHTHGACCACSS
EAEEADETLEVTTAARTGWSCSCGGARAVEGCRDSLEGVDGLDGLPLASDPVQARLHQII
LQHKKMVEDLNGQLREAREALRRASGVRTPAAVTRPPHAQSPTGVSSSCVTCVGPGAPGG
SSSGSSSASELEVCEEARVRWLPDAAAPRCQHCRNSFWLARRRHHCR