New model in OGS2.0 | DPOGS210283  |
---|---|
Genomic Position | scaffold2053:- 25938-28750 |
See gene structure | |
CDS Length | 1449 |
Paired RNAseq reads   | 63 |
Single RNAseq reads   | 145 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010536 (3e-129) |
Best Drosophila hit   | CG9701 (3e-103) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (2e-82) |
Best NR hit (blastp)   | glycoside hydrolase [Culex quinquefasciatus] (5e-127) |
Best NR hit (blastx)   | glycoside hydrolase [Culex quinquefasciatus] (1e-118) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR001360 Glycoside hydrolase, family 1 IPR017853 Glycoside hydrolase, superfamily IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | ND |
Nucleotide sequence:
ATGAGGAAACTACCAAACGGGTTAAAGATAGGGGTTGCTACGGCATCTTATCAAATCGAA
GGGGGTTGGAATGCTGGCGATAAAACACCGAGTATTTGGGACACTCATTGTCACAAAGAA
CCATGCCCCGTAAAAGATAATACCAGTGGAGATGACACTTGCGAATCATTCAAATACTAC
AAACGTGATCTGGAGATGATAAAATTTTTGGGATTCCACTTCTACAGATTTTCCATATCA
TGGCCAAGATTACTTCCTGACGGATTTACAAACAGAATCAGCGAGACCGGCCGCGAATAC
TACAATAATCTGATCAACGGATTACTTGAAAATAATATTGAACCGATAATTACTTTGTAT
CATTGGGATCTGCCGCAGACACTTCAAGAACTCGGAGGTTGGAGCAATCCTCTCATAGTG
GACTGGTTCGGTGACTACGCCGCTGTCGCATACCAACTTTTTGGAGACAGAGTTAAAACC
TGGATAACGATCAATGAACCGAAACAAATCGGTGTTTTCGGTTACGGAATGACCAGAATG
GCTCCAGCCCTAAATATATCCGGGATAGCAGATTATATAGCTGCTAAAAATATGGTGTTA
GCACATGCCCGAGCCTGGCATATATACGATAAACAATTTAGATCTACCCAAAAAGGAACA
TGCGGCATCACCATAGCAACCGATTTTCGTGTCGGACTATCTGACTCTCGTGATGATGTC
GAAGCTGGTCTCGACGCTATGGATTTTGAAGTAGGATTATACAGCCATCCTATATTCACA
TCAAAGGGTGGTTTTCCTGAACGAGTTATCCAAAGAGTAGCAGAAAAAAGTAAAGAACAA
GGTTACACTAGAAGTCGACTGCCAGATTTTAGTGACGAAGAAATTGAGTACGCTAAAGGA
ACCAGTGATTTTTATGGCTTCAATCATTATTCGACGAAATTTTTCACAAGGGACACTTAC
ACGCCTGGAAAACATCCAATACCCTCGTATGATGATGATATTGGTGCAGATTTTACTTAC
TTGGACTATGAAAAAGGTGCAGTGCCTCATGTCACAGTAATTCCACACGGAATCAGAAAA
GCCTTGAAATGGGTGAAAGAAAACTGTAACAATCCACCAATAATGATAACCGAGAATGGT
TTCGCCACTTTTGGCGGTTTGGAAGATATGGATAGAATATTCTATTTTAGGAAATATCTT
TACTCGATTTTGGACGCCATTGAAATTGACGGCTGCAATGTTACGTCATATACAGTGTGG
AGTTTAATGGACAATTTTGAATGGGATAGTGGATTAAGTGTTAAATTTGGACTATTCGAA
GTCGATTTTGAGGATGAAAAGAAGACCAGAACGGCAAGATTGTCGGCTTTGTGGTTTAAA
AGACTCATAAAGACAAAATGTCTAGATCTGGAACACATACCGGAAATGGAAGAGAAAATC
CACTTTTAA
Protein sequence:
MRKLPNGLKIGVATASYQIEGGWNAGDKTPSIWDTHCHKEPCPVKDNTSGDDTCESFKYY
KRDLEMIKFLGFHFYRFSISWPRLLPDGFTNRISETGREYYNNLINGLLENNIEPIITLY
HWDLPQTLQELGGWSNPLIVDWFGDYAAVAYQLFGDRVKTWITINEPKQIGVFGYGMTRM
APALNISGIADYIAAKNMVLAHARAWHIYDKQFRSTQKGTCGITIATDFRVGLSDSRDDV
EAGLDAMDFEVGLYSHPIFTSKGGFPERVIQRVAEKSKEQGYTRSRLPDFSDEEIEYAKG
TSDFYGFNHYSTKFFTRDTYTPGKHPIPSYDDDIGADFTYLDYEKGAVPHVTVIPHGIRK
ALKWVKENCNNPPIMITENGFATFGGLEDMDRIFYFRKYLYSILDAIEIDGCNVTSYTVW
SLMDNFEWDSGLSVKFGLFEVDFEDEKKTRTARLSALWFKRLIKTKCLDLEHIPEMEEKI
HF