New model in OGS2.0 | DPOGS215795  |
---|---|
Genomic Position | scaffold300:+ 65672-70545 |
See gene structure | |
CDS Length | 1506 |
Paired RNAseq reads   | 602 |
Single RNAseq reads   | 1732 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003512 (0.0) |
Best Drosophila hit   | CG9701 (9e-128) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (2e-113) |
Best NR hit (blastp)   | glucosidase [Bombyx mori] (0.0) |
Best NR hit (blastx)   | glucosidase [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR001360 Glycoside hydrolase, family 1 IPR017853 Glycoside hydrolase, superfamily IPR018120 Glycoside hydrolase, family 1, active site IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | MCL10077 |
Nucleotide sequence:
ATGGCGATCAAAGAGGCAACTTTTATCGCCCTCCTGGCGTTAGTACATTCGGGACATGCG
CAGTACACTAAGTTTCCGGAAGGATTCACCTTTGGAGTAGCGACTGCTGCTCATCAAATA
GAAGGGTCATGGAACGTCAGTGGAAAAACTGAAAACGTATGGGACCATCTGTCTCACAAT
CGGCCATGGATGATAGCTGACGGAACCAATGGCGATGTAGCCTGTGACTCCTACAACCGT
TACCAAGAAGATGTAGATGAGCTGGCATACATGGGTGTAGATTTCTACAGACTGTCTCTG
TCTTGGGCCAGAATTCTGCCAACTGGACGCATGGATGTCATAAATCCTGATGGTATTAGA
TACTACAACGCACTCTTTGATGCTTTAGCCGAAAAAAAAATTGAACCACTGGTTACTCTG
TTCCATTGGGATTTACCACAATCACTCCAAGACCTAGGTGGATGGGCGAATCCGAAAATG
ATTGATTACTTCCGCGATTACGCAGACGTATGCTTCAGAGAGTTCGGTGATAAAGTCAAA
TCCTGGATTACACTTAATGAGCCCTATGAAATTTGTGAAGATGCTTATGGGGATGACAAG
AAAGCTCCTGCCATCGATAGCCACGGTGTAGGAAACTACTTGTGCAGCGACACTCTGTTG
AAAGCTCACGCCGAAGTTTACCATCTCTACAACGACACCTACAGACCTATACAAAACGGA
AGAATAATGATTTCAATAAATTCAATTTGGTACGAACCAAGTGATCCCGAAAACGCGGAA
CAAGTTGCTCTGGCTGAAGTTGCTAACCAATTTAAATTCGGGTGGTTCGCAAATCCTATT
TTCACCGAAGAAGGTGGCTATCCCGTCGTAATGGTAGAAAATATTGCTGAGCAAAGTAAA
GCTGAAGGATTAAATAAACCTAGATTAGAACAATTCGATGAGTACTGGATTGAAAGAATT
AAGGGTACATCAGACTTCCTTGGTATCAATCACTACACCACGCATTTGATAACCGGCCCG
GGAGTGGACTCTCTCGCCAAACACCCGTCTTGGCTAAAAGATATTGGAGCGGTAGTAAGT
TTGGACGTGGGTAGAGATTCAGCCTCAGAGTGGCTAAGAGTAGTGCCAACGGGTTTTGCA
AACTTATTACGCTGGTGCAAGAGTACGTACAATGATGTTCCAATTTACATCACCGAGAAC
GGATTTTCTGATCGTGGCGCCATAGAAGATTACGACCGTATTAGATATTACAACGACTAC
CTCTCCGAAATTTTGAATGTCATTTATGACGATGATGTCAAAGTCCTTGGTTACACTGCA
TGGACCCTAATGGACAACTTCGAATGGCGAGCTGGATTTTCTGAACGCTTCGGTCTTTAC
CACGTGGACATAACGGATCCGAATCTCCCAAGAACACCGAAACTCTCTGCGGAATACTAC
AAGCAATTATGTGAAACGAAGGAAATACCTCAAGATGAACGGTTCAAGGATCCAGCTGTA
AGTTGA
Protein sequence:
MAIKEATFIALLALVHSGHAQYTKFPEGFTFGVATAAHQIEGSWNVSGKTENVWDHLSHN
RPWMIADGTNGDVACDSYNRYQEDVDELAYMGVDFYRLSLSWARILPTGRMDVINPDGIR
YYNALFDALAEKKIEPLVTLFHWDLPQSLQDLGGWANPKMIDYFRDYADVCFREFGDKVK
SWITLNEPYEICEDAYGDDKKAPAIDSHGVGNYLCSDTLLKAHAEVYHLYNDTYRPIQNG
RIMISINSIWYEPSDPENAEQVALAEVANQFKFGWFANPIFTEEGGYPVVMVENIAEQSK
AEGLNKPRLEQFDEYWIERIKGTSDFLGINHYTTHLITGPGVDSLAKHPSWLKDIGAVVS
LDVGRDSASEWLRVVPTGFANLLRWCKSTYNDVPIYITENGFSDRGAIEDYDRIRYYNDY
LSEILNVIYDDDVKVLGYTAWTLMDNFEWRAGFSERFGLYHVDITDPNLPRTPKLSAEYY
KQLCETKEIPQDERFKDPAVS