DPGLEAN19451 in OGS1.0

New model in OGS2.0DPOGS204062 
Genomic Positionscaffold2819:+ 17330-21883
See gene structure
CDS Length1497
Paired RNAseq reads  1022
Single RNAseq reads  2528
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010735 (2e-159)
Best Drosophila hit  CG9701 (3e-115)
Best Human hitlactase-phlorizin hydrolase preproprotein (2e-101)
Best NR hit (blastp)  beta-glucosidase precursor [Spodoptera frugiperda] (2e-142)
Best NR hit (blastx)  beta-glucosidase precursor [Spodoptera frugiperda] (5e-142)
GeneOntology terms

  
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
InterPro families


  
IPR017853 Glycoside hydrolase, superfamily
IPR018120 Glycoside hydrolase, family 1, active site
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR001360 Glycoside hydrolase, family 1
Orthology groupMCL40561

Nucleotide sequence:

ATGTTTCAAGTTGAAGGCTGGTCGGATCTCAAGGTTCGAAGATTCCCCGATGGCTTTTTG
TTTGGCGCGGGGACGTCGGCTTATCAGGTCGAAGGGGCGTGGAATGAAGATGGAAAAGGT
GAAAGCATCTGGGACAAATACCTCCACGATAACCCAGACATTATATCCGATGGCAGAAAT
GGTGATGTAGCATCCAACTCCTACCACCAGTACAAGAGAGATGTGGAAATGTTGAGGGAA
TTGGGTGTGGACTACTACAGGTTCTCAATATCCTGGAGCAGAGTATTGCCTAGAGGATTC
TCGAATGAAATAAATGAAAAAGGTCTCGAATACTACGACAAATTGATAGATGAATTATTG
AAATACAACATAAAGCCAATGATAACTTTATACCACTTTGATTTGCCACAAACTCTCCAA
GACTTTGGAGGTTGGGCCAATCCGCTGTCAACAGAATGGTTTGAAGATTATGCGGCTGTG
ATCTTTAAGGCATTCGCTCACAAGGTTCCTTATTGGATAACCGTCAATCAGCCAAATTCC
ATATGCGTGGAAGGTTATGGTCAAGGTTTGATGGCACCAGCGATCAGCTCGAGTGGAATC
GGTGATTACATGTGTATAAAGAATGTGCTGGTGGCACATGCGAGGGCATACAGGTTATAT
GAGAGGGAATATAAAAAGAAATTTAAGGGATCAGTTGGCATAGCGCTTGCATTAAACTGG
GCAGACCCCGTCAATAACAGCACAAAAAATGTCGAAGCTACGGACGTTTACAGAGAATTT
ATGATCGGTCTCTACATGCATCCCATATGGTCGAAAGATGGTGGGTTCCCTAAAATGGTC
AAAGAAAGAGTCCATCAGAACAGCATAAAGCAAGGATTCAAGAAATCTAGACTGCCTGCC
CTTAGCAAGGAAGAAGTTACTCTTTTGAAAGGGTCCTCAGACTTCGTGGGAGTGAATCAT
TATACAACTGTCCTAGTGAAGAGCACGGACAGGGGGATGTCAGCGCCATCTTTCGATGAC
GACGTTCACGTGGAGCTCACCTACAGGCCGGAGTGGAAGAACGCCACATCTAGCTGGCTG
AAGAGCGTGCCCTACGGTATATACAGGGTGTGCGTATATCTCAATACAAAGTACGACTAC
CCTCAAATGTTTGTGACGGAGCACGGCTGGTCCACGAGGCCAGGGTTGAAGGATGACACG
AGGGTTGAGAACCTGAGGCTGTACCTGAAGGCTATACTGTTTGCTATAGAAGATGGCACG
GACTTGAAAGGTTACACCACATGGAGCCTAATGGATAATGTGGAGTGGGTCGCTGGAACC
AGTGAAAGATTCGGTCTTTATGAAGTAGACTTCGAATCAGAGGATAAAAATAGAACAGCG
AGATTGTCAGCTCTGGTGTATAAACGAATCATAGACAAGAGGATCGTTGAAGACGATTAT
AAACCGAACAATTTAAAAATGTCGATAACTAACAGAAATGTTAAGACGGAACTTTGA

Protein sequence:

MFQVEGWSDLKVRRFPDGFLFGAGTSAYQVEGAWNEDGKGESIWDKYLHDNPDIISDGRN
GDVASNSYHQYKRDVEMLRELGVDYYRFSISWSRVLPRGFSNEINEKGLEYYDKLIDELL
KYNIKPMITLYHFDLPQTLQDFGGWANPLSTEWFEDYAAVIFKAFAHKVPYWITVNQPNS
ICVEGYGQGLMAPAISSSGIGDYMCIKNVLVAHARAYRLYEREYKKKFKGSVGIALALNW
ADPVNNSTKNVEATDVYREFMIGLYMHPIWSKDGGFPKMVKERVHQNSIKQGFKKSRLPA
LSKEEVTLLKGSSDFVGVNHYTTVLVKSTDRGMSAPSFDDDVHVELTYRPEWKNATSSWL
KSVPYGIYRVCVYLNTKYDYPQMFVTEHGWSTRPGLKDDTRVENLRLYLKAILFAIEDGT
DLKGYTTWSLMDNVEWVAGTSERFGLYEVDFESEDKNRTARLSALVYKRIIDKRIVEDDY
KPNNLKMSITNRNVKTEL