New model in OGS2.0 | DPOGS204062  |
---|---|
Genomic Position | scaffold2819:+ 17330-21883 |
See gene structure | |
CDS Length | 1497 |
Paired RNAseq reads   | 1022 |
Single RNAseq reads   | 2528 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010735 (2e-159) |
Best Drosophila hit   | CG9701 (3e-115) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (2e-101) |
Best NR hit (blastp)   | beta-glucosidase precursor [Spodoptera frugiperda] (2e-142) |
Best NR hit (blastx)   | beta-glucosidase precursor [Spodoptera frugiperda] (5e-142) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR018120 Glycoside hydrolase, family 1, active site IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR001360 Glycoside hydrolase, family 1 |
Orthology group | MCL40561 |
Nucleotide sequence:
ATGTTTCAAGTTGAAGGCTGGTCGGATCTCAAGGTTCGAAGATTCCCCGATGGCTTTTTG
TTTGGCGCGGGGACGTCGGCTTATCAGGTCGAAGGGGCGTGGAATGAAGATGGAAAAGGT
GAAAGCATCTGGGACAAATACCTCCACGATAACCCAGACATTATATCCGATGGCAGAAAT
GGTGATGTAGCATCCAACTCCTACCACCAGTACAAGAGAGATGTGGAAATGTTGAGGGAA
TTGGGTGTGGACTACTACAGGTTCTCAATATCCTGGAGCAGAGTATTGCCTAGAGGATTC
TCGAATGAAATAAATGAAAAAGGTCTCGAATACTACGACAAATTGATAGATGAATTATTG
AAATACAACATAAAGCCAATGATAACTTTATACCACTTTGATTTGCCACAAACTCTCCAA
GACTTTGGAGGTTGGGCCAATCCGCTGTCAACAGAATGGTTTGAAGATTATGCGGCTGTG
ATCTTTAAGGCATTCGCTCACAAGGTTCCTTATTGGATAACCGTCAATCAGCCAAATTCC
ATATGCGTGGAAGGTTATGGTCAAGGTTTGATGGCACCAGCGATCAGCTCGAGTGGAATC
GGTGATTACATGTGTATAAAGAATGTGCTGGTGGCACATGCGAGGGCATACAGGTTATAT
GAGAGGGAATATAAAAAGAAATTTAAGGGATCAGTTGGCATAGCGCTTGCATTAAACTGG
GCAGACCCCGTCAATAACAGCACAAAAAATGTCGAAGCTACGGACGTTTACAGAGAATTT
ATGATCGGTCTCTACATGCATCCCATATGGTCGAAAGATGGTGGGTTCCCTAAAATGGTC
AAAGAAAGAGTCCATCAGAACAGCATAAAGCAAGGATTCAAGAAATCTAGACTGCCTGCC
CTTAGCAAGGAAGAAGTTACTCTTTTGAAAGGGTCCTCAGACTTCGTGGGAGTGAATCAT
TATACAACTGTCCTAGTGAAGAGCACGGACAGGGGGATGTCAGCGCCATCTTTCGATGAC
GACGTTCACGTGGAGCTCACCTACAGGCCGGAGTGGAAGAACGCCACATCTAGCTGGCTG
AAGAGCGTGCCCTACGGTATATACAGGGTGTGCGTATATCTCAATACAAAGTACGACTAC
CCTCAAATGTTTGTGACGGAGCACGGCTGGTCCACGAGGCCAGGGTTGAAGGATGACACG
AGGGTTGAGAACCTGAGGCTGTACCTGAAGGCTATACTGTTTGCTATAGAAGATGGCACG
GACTTGAAAGGTTACACCACATGGAGCCTAATGGATAATGTGGAGTGGGTCGCTGGAACC
AGTGAAAGATTCGGTCTTTATGAAGTAGACTTCGAATCAGAGGATAAAAATAGAACAGCG
AGATTGTCAGCTCTGGTGTATAAACGAATCATAGACAAGAGGATCGTTGAAGACGATTAT
AAACCGAACAATTTAAAAATGTCGATAACTAACAGAAATGTTAAGACGGAACTTTGA
Protein sequence:
MFQVEGWSDLKVRRFPDGFLFGAGTSAYQVEGAWNEDGKGESIWDKYLHDNPDIISDGRN
GDVASNSYHQYKRDVEMLRELGVDYYRFSISWSRVLPRGFSNEINEKGLEYYDKLIDELL
KYNIKPMITLYHFDLPQTLQDFGGWANPLSTEWFEDYAAVIFKAFAHKVPYWITVNQPNS
ICVEGYGQGLMAPAISSSGIGDYMCIKNVLVAHARAYRLYEREYKKKFKGSVGIALALNW
ADPVNNSTKNVEATDVYREFMIGLYMHPIWSKDGGFPKMVKERVHQNSIKQGFKKSRLPA
LSKEEVTLLKGSSDFVGVNHYTTVLVKSTDRGMSAPSFDDDVHVELTYRPEWKNATSSWL
KSVPYGIYRVCVYLNTKYDYPQMFVTEHGWSTRPGLKDDTRVENLRLYLKAILFAIEDGT
DLKGYTTWSLMDNVEWVAGTSERFGLYEVDFESEDKNRTARLSALVYKRIIDKRIVEDDY
KPNNLKMSITNRNVKTEL