DPGLEAN16951 in OGS1.0

New model in OGS2.0DPOGS212076 
Genomic Positionscaffold5799:- 3605-8898
See gene structure
CDS Length2022
Paired RNAseq reads  780
Single RNAseq reads  1948
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009640 (4e-164)
Best Drosophila hit  CG5830 (2e-31)
Best Human hitCTD small phosphatase-like protein 2 (1e-72)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL010078 [Aedes aegypti] (2e-102)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL010078 [Aedes aegypti] (3e-90)
GeneOntology terms



  
GO:0016791 phosphatase activity
GO:0005575 cellular_component
GO:0008150 biological_process
GO:0016787 hydrolase activity
GO:0004721 phosphoprotein phosphatase activity
InterPro families

  
IPR004274 NLI interacting factor
IPR023214 HAD-like domain
IPR011948 Dullard phosphatase domain, eukaryotic
Orthology groupMCL16644

Nucleotide sequence:

ATGCGGTTAAGAAGTAGGAAAAGGGAGCGACCGCTCTCAGCGCGAAATTCACAAAGGCAA
ACGACTTTGAAATCAAAATTAAGAAGACCGCTCATCCAAGCGAAAAATACTTGGAGAAAA
ATATTGAAGGCGGACAATGGTATCTCGGTACCAAGTCCTTGCATCACTAGGTCGCAGAAC
AATCTAAATCTAAAATCCAGAACTCGAGCGAAAATTGCCAAAAGTATGGAAAAACCACTG
GAATTGCCATTAAAGAAAAACAAGCCGAAAGAAGAGGCTAAGAAGCCTATTTTCCAGACT
GTCGTTCGAAACTCCGCTAAAGATATGGTGACATCAACAACCAAGCCCAAAACACAGAAC
GATAAAGTAACAACAACCACAAAGACAAAGTTAAAGGATGTGCGAACAACCAAAAGTGCT
CAGCTCCGACAGAGCTCTTTGAACAGAGAGCTTCGTAAGACAGAGTCTATATCGTCAGCT
CTCAGCTCACCTCGCAAAACACGTAGGATGGATAAACCAGAGAAATCTGCCTCATTAAAC
AACTCACCCATTCGTAAAGTCCGTCAACTAGGACTGCCATCGTTGAAGCTGAACAATATA
GCTGTCAGTTCAACTAACGAGAGTCCCAAGAAGAGAATTAAACCCAAAGCTCCAACTGTT
CCGGAGCAGAATAACGATTTCAACGATGATAACGATGTAACCATGTATGAACCGCACACA
ACTACTTTCGATGACTTCCCGATTCCAAAGACCGTTGAATCGGATGTCGATGGGCAAGTC
GGATCATCAGAGCTCTGTCTTCCCAACGACTGTTTCAACGACTTGATAGCTATTGCTGAA
TGTGCCAGGATAATAAGCAACAATCTATCAACAGACGAAGATATTAACTTACTAGCAGAT
AAAGCGGCCAAAGTGATGACCACCGACAAAGATGATAAAGTCAAGAGATGTACGGACAAG
AAAATTAAACAACGGTCAAGCGGGGATTTAGAAATGTACCAGCCGACATCGACCACGGAT
GATTTGGATGTATGCACCGATTTCCTGAACCACGGAGATAAATATGTTCAGCAGCCGGAC
CTAGTATCTCTGTTGGAGCAGGAGTATGTCCGAGATGAGTGTGTCATATCATCGACCATG
GCTATGGAGAGTCTGGAGGCTTTGTCTGCCCGAGGACCCAGCAGCGGGTTCCTGGCCGAG
ATAACACACAGCCTGTCAGCCGGAGACGATACCTGGAGCTCAACGGATGTTGTTGTAGAA
GAAAACGCTTTGAACTGTCACGACACAACACCTGGGATTGAGGAGTCCACATCAGTTATA
TCATCATGCAACACCAGGGCCAGCGGTGAACAGATGACGTCATGGACGGACGCCTTCGAT
CCTTATCTGTTTATTAAACAGCTGCCACCGCTGGAAACCGTATCAGCTGGAGGACTCAGA
ACCAGGTGTCCAGCGCTACCCCTCAAAACTCGCACCAGTCCAGATTTTAGTCTTGTGTTG
GATCTAGACGAGACATTGGTTCACTGTTCTCTCCAGGAGTTACCGGATGCTAGCTTCCAC
TTCCCCGTACTATTCCAGGATTGCAGATATACGGTGTTTGTCCGTACTCGTCCCCACTTT
GCCGAGTTCCTCTCTAAAGTGTCACGTCTGTATGAAGTGATTCTTTTCACGGCTAGCAAG
AGGGTGTACGCTGATAGACTACTGAACCTCCTGGACCCGGCCAGACGATGGATTAAATAT
AGGTTGTTCCGAGAACACTGTCTACTAGTTAATGGTAACTATGTGAAGGATTTGTCGATA
CTGGGACGGGATCTCAGGAGAACTGTCATCGTGGACAATAGCCCACAGGCGTTCGGCTAC
CAGCTGGAGAATGGTATACCTATAGACAGCTGGTTCGTAGACCGCAGTGACAATGAACTG
CTCAAACTGCTGCCGTTCCTGGAACATCTGGCCACGAAAGACGATGTCCGGCCATACATC
AGGGACAAGTACAAGCTGTTCAGTTACTTGCCACCGGATTAA

Protein sequence:

MRLRSRKRERPLSARNSQRQTTLKSKLRRPLIQAKNTWRKILKADNGISVPSPCITRSQN
NLNLKSRTRAKIAKSMEKPLELPLKKNKPKEEAKKPIFQTVVRNSAKDMVTSTTKPKTQN
DKVTTTTKTKLKDVRTTKSAQLRQSSLNRELRKTESISSALSSPRKTRRMDKPEKSASLN
NSPIRKVRQLGLPSLKLNNIAVSSTNESPKKRIKPKAPTVPEQNNDFNDDNDVTMYEPHT
TTFDDFPIPKTVESDVDGQVGSSELCLPNDCFNDLIAIAECARIISNNLSTDEDINLLAD
KAAKVMTTDKDDKVKRCTDKKIKQRSSGDLEMYQPTSTTDDLDVCTDFLNHGDKYVQQPD
LVSLLEQEYVRDECVISSTMAMESLEALSARGPSSGFLAEITHSLSAGDDTWSSTDVVVE
ENALNCHDTTPGIEESTSVISSCNTRASGEQMTSWTDAFDPYLFIKQLPPLETVSAGGLR
TRCPALPLKTRTSPDFSLVLDLDETLVHCSLQELPDASFHFPVLFQDCRYTVFVRTRPHF
AEFLSKVSRLYEVILFTASKRVYADRLLNLLDPARRWIKYRLFREHCLLVNGNYVKDLSI
LGRDLRRTVIVDNSPQAFGYQLENGIPIDSWFVDRSDNELLKLLPFLEHLATKDDVRPYI
RDKYKLFSYLPPD