New model in OGS2.0 | DPOGS201328  |
---|---|
Genomic Position | scaffold1560:+ 32397-35188 |
See gene structure | |
CDS Length | 1380 |
Paired RNAseq reads   | 86 |
Single RNAseq reads   | 244 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010537 (2e-148) |
Best Drosophila hit   | CG9701 (2e-101) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (3e-92) |
Best NR hit (blastp)   | AGAP006424-PA [Anopheles gambiae str. PEST] (5e-126) |
Best NR hit (blastx)   | AGAP006424-PA [Anopheles gambiae str. PEST] (7e-127) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR001360 Glycoside hydrolase, family 1 IPR017853 Glycoside hydrolase, superfamily IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR018120 Glycoside hydrolase, family 1, active site |
Orthology group | ND |
Nucleotide sequence:
ATGAATGAATGCTGTGGAATAATTTTATTTGTCTTATCGTTGGCCATTGGAGTACAGTCT
TCAAATTTACAATGTTACGAAACAGAATTTCCAGAAGGGTTCTTATTCAGCGCATCATCG
TCTGCTTATCAGATAGAAGGTGCTTGGAACAAAGATGGTCGAACTGACAGCATTTGGGAT
GATTTAGTACACCAACGTCCCTACCTCGTCAGAGACAACGCGACCGGAGACATTGCTGAT
AATTCTTATTACATTTATAAAAGGGATATAGAAATTTTGAGAGAGATAGGACTACAAGTA
TATAGGTTCTCTATATCATGGAATAGAATTTTACCTACTGGTTTTCCCAACAAAATTAAT
TATGAAGGTGTTGCATATTACGATAATTTAATTAACGAATTATTGAAATACAACATTATC
CCGGTGGTTACCATTTACCACTTTGATTTGCCTCAAAGACTCCAGGAATTGGGTGGCTGG
GTTAATCCTTATGTCGTTGATTGGTTGGGAGACTACGCAAGGGTTGTTTTCAATTTATTC
GGTGATAGAGTTAAATATTGGATAACAGTGAATGAACCACAGCAGATTTGCTACTACGGC
TATGGTGATGTAATGAATGCACCAGCATTAAATTATAAAGGAATTGCTGAATACTATTGT
GCGAAAAATGTACTATTAGCACATGCAAGGGCATACCACATTTACGACGAAGAGTTTCGA
GACTTTCAGCAAGGCATTATATTTATAGCTATAAGTGCTGAATGGTACGAACCTGCTTCG
TCGGACAAGAATGATATTTTGGCCGCTTACGACTCGAACATGTTTACATATGGACAATAC
GCTCATCCAATTTTCTCTGAGACTGGTGATTTTCCCCAAAAGATGAAGGATCGCATTGCA
GAAAGAAGTGTCATGCAAGGTTTCGTTAGGTCCCGACTACCACAGCTTTCGGAACAGGAA
ATTGATTATATACGTGGCAGTTCTGACGTGTTCGGTTTAAATCACTATTCTACTTTCTAT
GCAAGCAGAAATCAATCTGTTTACACAAATTATGAATCCCCATCATTTTTTGACGATATG
GCAGCATACACGTTTCAGCCGCCTGAATGGAGATTGAGCCCAGATGCTGGTGTTGCGACT
GTTCCTTGGGGTTTCTACAAATTGCTGCAATTCATCAAGAGAGAGTACAATAATCCTCCC
GTTTTCGTAACCGAGAACGGTTTTGGCGATAATGGCGGTTTAAAAGATAACGATCGTGTT
ACACATTTGAAGGGTTACTTATGTGCTCTTCTGAAAGCTATCAATCACGGCTCAGATATT
ATAGGATATTCTGTTTGGAGTCTCCTGGATTCGTTTGAATGGATGTGTGGATACAAGTAA
Protein sequence:
MNECCGIILFVLSLAIGVQSSNLQCYETEFPEGFLFSASSSAYQIEGAWNKDGRTDSIWD
DLVHQRPYLVRDNATGDIADNSYYIYKRDIEILREIGLQVYRFSISWNRILPTGFPNKIN
YEGVAYYDNLINELLKYNIIPVVTIYHFDLPQRLQELGGWVNPYVVDWLGDYARVVFNLF
GDRVKYWITVNEPQQICYYGYGDVMNAPALNYKGIAEYYCAKNVLLAHARAYHIYDEEFR
DFQQGIIFIAISAEWYEPASSDKNDILAAYDSNMFTYGQYAHPIFSETGDFPQKMKDRIA
ERSVMQGFVRSRLPQLSEQEIDYIRGSSDVFGLNHYSTFYASRNQSVYTNYESPSFFDDM
AAYTFQPPEWRLSPDAGVATVPWGFYKLLQFIKREYNNPPVFVTENGFGDNGGLKDNDRV
THLKGYLCALLKAINHGSDIIGYSVWSLLDSFEWMCGYK