New model in OGS2.0 | DPOGS212076  |
---|---|
Genomic Position | scaffold5799:- 3605-8898 |
See gene structure | |
CDS Length | 2022 |
Paired RNAseq reads   | 780 |
Single RNAseq reads   | 1948 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009640 (4e-164) |
Best Drosophila hit   | CG5830 (2e-31) |
Best Human hit | CTD small phosphatase-like protein 2 (1e-72) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL010078 [Aedes aegypti] (2e-102) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL010078 [Aedes aegypti] (3e-90) |
GeneOntology terms    | GO:0016791 phosphatase activity GO:0005575 cellular_component GO:0008150 biological_process GO:0016787 hydrolase activity GO:0004721 phosphoprotein phosphatase activity |
InterPro families    | IPR004274 NLI interacting factor IPR023214 HAD-like domain IPR011948 Dullard phosphatase domain, eukaryotic |
Orthology group | MCL16644 |
Nucleotide sequence:
ATGCGGTTAAGAAGTAGGAAAAGGGAGCGACCGCTCTCAGCGCGAAATTCACAAAGGCAA
ACGACTTTGAAATCAAAATTAAGAAGACCGCTCATCCAAGCGAAAAATACTTGGAGAAAA
ATATTGAAGGCGGACAATGGTATCTCGGTACCAAGTCCTTGCATCACTAGGTCGCAGAAC
AATCTAAATCTAAAATCCAGAACTCGAGCGAAAATTGCCAAAAGTATGGAAAAACCACTG
GAATTGCCATTAAAGAAAAACAAGCCGAAAGAAGAGGCTAAGAAGCCTATTTTCCAGACT
GTCGTTCGAAACTCCGCTAAAGATATGGTGACATCAACAACCAAGCCCAAAACACAGAAC
GATAAAGTAACAACAACCACAAAGACAAAGTTAAAGGATGTGCGAACAACCAAAAGTGCT
CAGCTCCGACAGAGCTCTTTGAACAGAGAGCTTCGTAAGACAGAGTCTATATCGTCAGCT
CTCAGCTCACCTCGCAAAACACGTAGGATGGATAAACCAGAGAAATCTGCCTCATTAAAC
AACTCACCCATTCGTAAAGTCCGTCAACTAGGACTGCCATCGTTGAAGCTGAACAATATA
GCTGTCAGTTCAACTAACGAGAGTCCCAAGAAGAGAATTAAACCCAAAGCTCCAACTGTT
CCGGAGCAGAATAACGATTTCAACGATGATAACGATGTAACCATGTATGAACCGCACACA
ACTACTTTCGATGACTTCCCGATTCCAAAGACCGTTGAATCGGATGTCGATGGGCAAGTC
GGATCATCAGAGCTCTGTCTTCCCAACGACTGTTTCAACGACTTGATAGCTATTGCTGAA
TGTGCCAGGATAATAAGCAACAATCTATCAACAGACGAAGATATTAACTTACTAGCAGAT
AAAGCGGCCAAAGTGATGACCACCGACAAAGATGATAAAGTCAAGAGATGTACGGACAAG
AAAATTAAACAACGGTCAAGCGGGGATTTAGAAATGTACCAGCCGACATCGACCACGGAT
GATTTGGATGTATGCACCGATTTCCTGAACCACGGAGATAAATATGTTCAGCAGCCGGAC
CTAGTATCTCTGTTGGAGCAGGAGTATGTCCGAGATGAGTGTGTCATATCATCGACCATG
GCTATGGAGAGTCTGGAGGCTTTGTCTGCCCGAGGACCCAGCAGCGGGTTCCTGGCCGAG
ATAACACACAGCCTGTCAGCCGGAGACGATACCTGGAGCTCAACGGATGTTGTTGTAGAA
GAAAACGCTTTGAACTGTCACGACACAACACCTGGGATTGAGGAGTCCACATCAGTTATA
TCATCATGCAACACCAGGGCCAGCGGTGAACAGATGACGTCATGGACGGACGCCTTCGAT
CCTTATCTGTTTATTAAACAGCTGCCACCGCTGGAAACCGTATCAGCTGGAGGACTCAGA
ACCAGGTGTCCAGCGCTACCCCTCAAAACTCGCACCAGTCCAGATTTTAGTCTTGTGTTG
GATCTAGACGAGACATTGGTTCACTGTTCTCTCCAGGAGTTACCGGATGCTAGCTTCCAC
TTCCCCGTACTATTCCAGGATTGCAGATATACGGTGTTTGTCCGTACTCGTCCCCACTTT
GCCGAGTTCCTCTCTAAAGTGTCACGTCTGTATGAAGTGATTCTTTTCACGGCTAGCAAG
AGGGTGTACGCTGATAGACTACTGAACCTCCTGGACCCGGCCAGACGATGGATTAAATAT
AGGTTGTTCCGAGAACACTGTCTACTAGTTAATGGTAACTATGTGAAGGATTTGTCGATA
CTGGGACGGGATCTCAGGAGAACTGTCATCGTGGACAATAGCCCACAGGCGTTCGGCTAC
CAGCTGGAGAATGGTATACCTATAGACAGCTGGTTCGTAGACCGCAGTGACAATGAACTG
CTCAAACTGCTGCCGTTCCTGGAACATCTGGCCACGAAAGACGATGTCCGGCCATACATC
AGGGACAAGTACAAGCTGTTCAGTTACTTGCCACCGGATTAA
Protein sequence:
MRLRSRKRERPLSARNSQRQTTLKSKLRRPLIQAKNTWRKILKADNGISVPSPCITRSQN
NLNLKSRTRAKIAKSMEKPLELPLKKNKPKEEAKKPIFQTVVRNSAKDMVTSTTKPKTQN
DKVTTTTKTKLKDVRTTKSAQLRQSSLNRELRKTESISSALSSPRKTRRMDKPEKSASLN
NSPIRKVRQLGLPSLKLNNIAVSSTNESPKKRIKPKAPTVPEQNNDFNDDNDVTMYEPHT
TTFDDFPIPKTVESDVDGQVGSSELCLPNDCFNDLIAIAECARIISNNLSTDEDINLLAD
KAAKVMTTDKDDKVKRCTDKKIKQRSSGDLEMYQPTSTTDDLDVCTDFLNHGDKYVQQPD
LVSLLEQEYVRDECVISSTMAMESLEALSARGPSSGFLAEITHSLSAGDDTWSSTDVVVE
ENALNCHDTTPGIEESTSVISSCNTRASGEQMTSWTDAFDPYLFIKQLPPLETVSAGGLR
TRCPALPLKTRTSPDFSLVLDLDETLVHCSLQELPDASFHFPVLFQDCRYTVFVRTRPHF
AEFLSKVSRLYEVILFTASKRVYADRLLNLLDPARRWIKYRLFREHCLLVNGNYVKDLSI
LGRDLRRTVIVDNSPQAFGYQLENGIPIDSWFVDRSDNELLKLLPFLEHLATKDDVRPYI
RDKYKLFSYLPPD