New model in OGS2.0 | DPOGS201334  |
---|---|
Genomic Position | scaffold3930:+ 27768-35056 |
See gene structure | |
CDS Length | 1533 |
Paired RNAseq reads   | 12 |
Single RNAseq reads   | 33 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010536 (5e-172) |
Best Drosophila hit   | CG9701 (9e-137) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (2e-104) |
Best NR hit (blastp)   | AGAP006424-PA [Anopheles gambiae str. PEST] (2e-159) |
Best NR hit (blastx)   | AGAP006424-PA [Anopheles gambiae str. PEST] (2e-160) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR018120 Glycoside hydrolase, family 1, active site IPR001360 Glycoside hydrolase, family 1 IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR017853 Glycoside hydrolase, superfamily |
Orthology group | MCL10077 |
Nucleotide sequence:
ATGTTCCGTGCTGTAAGATTAATTTTCCTGGCTTTCGTTTTGACTGTGCTTGTCGGTAGC
AATGAAATCTCTCGACATGAAGCCAGAAAAATACCTGACGACTTACTTTTTGGAGCTGCT
ACGGCATCCTACCAAATAGAAGGAGCTTGGAATGAAGATGGTAAATCTGAAAATATTTGG
GATCGATTGACACACCTAAAACCTTGTTATATACACAACTGTGACACGGGAGATATCGCT
GCTGATTCCTATCACCAATATAAGCGCGATGTGGAAATGATGCGGGAACTAGGTCTCGAC
TTTTATAGGTTCTCTCTCTCCTGGACGAGAATATTACCAACGAGTTTTCCAGATCAAATT
AATGAAAAAGGAGTACAATATTACAATAATTTGATAAATGAGATGCTCAAATACAACATA
CAACCCATGGTGACTCTTTATCATTGGGATTTACCTCAGAAGCTGCAAGATCTGGGAGGA
TGGGCAAATCCCCATATAGTTGATTGGTTTACCGACTATGCCAAAGTAGTTTTCGAGTTA
TTTGGAGACAGGGTTAAGTACTGGATAACTGTCAATGAACCTAAACATGTTTGTCATCAA
ACAACCCCACAACTATCACTAGATCCATCTTATAGTGTTTCTTCACATTTTCATTACATG
TGTGCCAAAAATCTGCTAGTAGCACATGCTAACGTCTACCATTTGTATAATAATAAATTT
CGTGAAGTCCAAGGTGGTCAAGTCGGTATAACAATAAGTTCCGCGTGGGCTGAACCTGAG
TCTGAAAATGACATGAAAGCTGCTGAAGATGCCATGCAATTTGAGATGGGTCTTTTTGCA
AATCCAATATTTTCGGAGTCTGGAGATTATCCATCAGTCATGAAAGAAAGAATAGCAGCA
AAGAGTAAGGAACAAGGATTTCCGAGATCACGATTACCACAATTCACTCCGGAGGAAGTA
GATTTAATAAAAGGAAGCTCAGACTTCATTGGATTAAATCATTATACTACTAACATTGTT
TATAGAAACGAATCTGTCTATGGAAGTTACAGTTCTCCATCACTTGAAGATGATGTGGAA
GTTTTAAGTTATCAAGATAGTTCATGGGACTCAGGTGCTTCATCGTGGTTGAAGCGTGTA
CCCTGGGGATTTTATAAATTATTAACAAAAATACGAGAGGACTACAACAACCCACCAGTT
TTCATCACTGAAAATGGATTCTCATCTCGGGGTGGTCTAATTGACGACGACCGCGTAAAG
TATTACAGAACATACATTGATGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGA
GTTTATACAGCGTGGAGTTTGATGGACAATTTCGAATGGATGGAGGGATACAGCGAACGT
TTTGGCCTGTACGAGGTGGACTACGAGAGTCCTGAACGCACCCGCACTCCTCGCAAGTCT
GCTTACGTGTACAAAGAGATGCTGCGCACACGCACACTGGACTATCATTATGAACCTGAC
ATGAGCTTGGGAATGAATGTCGATGAAAACTAA
Protein sequence:
MFRAVRLIFLAFVLTVLVGSNEISRHEARKIPDDLLFGAATASYQIEGAWNEDGKSENIW
DRLTHLKPCYIHNCDTGDIAADSYHQYKRDVEMMRELGLDFYRFSLSWTRILPTSFPDQI
NEKGVQYYNNLINEMLKYNIQPMVTLYHWDLPQKLQDLGGWANPHIVDWFTDYAKVVFEL
FGDRVKYWITVNEPKHVCHQTTPQLSLDPSYSVSSHFHYMCAKNLLVAHANVYHLYNNKF
REVQGGQVGITISSAWAEPESENDMKAAEDAMQFEMGLFANPIFSESGDYPSVMKERIAA
KSKEQGFPRSRLPQFTPEEVDLIKGSSDFIGLNHYTTNIVYRNESVYGSYSSPSLEDDVE
VLSYQDSSWDSGASSWLKRVPWGFYKLLTKIREDYNNPPVFITENGFSSRGGLIDDDRVK
YYRTYIDAMLDAIEDGSDIRVYTAWSLMDNFEWMEGYSERFGLYEVDYESPERTRTPRKS
AYVYKEMLRTRTLDYHYEPDMSLGMNVDEN