DPGLEAN07635 in OGS1.0

New model in OGS2.0DPOGS201330 
Genomic Positionscaffold20186:- 898-2953
See gene structure
CDS Length1455
Paired RNAseq reads  31
Single RNAseq reads  144
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010537 (5e-172)
Best Drosophila hit  CG9701 (7e-129)
Best Human hitlactase-phlorizin hydrolase preproprotein (1e-101)
Best NR hit (blastp)  beta-glucosidase precursor [Spodoptera frugiperda] (2e-151)
Best NR hit (blastx)  beta-glucosidase precursor [Spodoptera frugiperda] (2e-152)
GeneOntology terms

  
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
InterPro families


  
IPR018120 Glycoside hydrolase, family 1, active site
IPR001360 Glycoside hydrolase, family 1
IPR017853 Glycoside hydrolase, superfamily
IPR013781 Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL10077

Nucleotide sequence:

ATGAGGAAATTTCCGGCGAGATTAATTTTTGGAACAGCGACAGCCTCTTATCAAGTGGAA
GGTGCCTGGGACGTCGACGGAAAATCGGAAAATATCTGGGACCATTTAACTCATACAAAT
CCTTGCAAAGTGCTGGACTGCTCTAATGGTGATGTCGCTGACAACTCTTATTATCTCTAT
AAAAGAGATGTGGAAATGATGCGCGAGTTAGGACTCGACACTTACAGGTTTTCTATCTCT
TGGACCAGAATCCTTCCTACTGGTTTTCCAGATTACATCAATAAAGCTGGAGTAGCATAT
TACAACAACTTAATTGATGAAATGCTAAAATATAACATTCAGCCGATAGTAACTTTATAC
CATTGGGACCTACCACAGAAAATACAAGAGATGGGAGGCTGGACGAATAGTGAAATTGTT
AATTGGTTTGGAGACTACGCACGAGTTATATTTAATTTTTTTGGTGATAGAGTAAAATAT
TTTATCACTATTAATGAACCTCATCAAATTTGCGAGTTTGGCTATGGAAAAGATATATTT
GCACCAGCATTAAAGATACAAGGTATAGCTGACTATTTATGCATGAAGAATGTACTATTA
GGTCACGCTAGAGCTTATCACATTTATGATAAAGAATTTCGGGTGAATCAAAATGGAAAA
ATATTCATTACAATAAACGCCGAATGGCATCAACCCAAAACAGTAAATGACGAGGAAGCA
GCCCGGGATGCTAGACAATTTTATTGGGAGGTTTATGCTCATCCAATATTTTCAAAAAGT
GGAAATTTTCCTCCGGAAATGATAAAGAGGATAGCGGATAAAAGTGCTGCACAAGGTTTT
CTCAGATCCAGATTACCAGAATTATCTAGAGCGGAAGTTAAATTTGTACATGGAACCTCT
GATTTCTTTGGACTGAATCATTATTCAACAAGTATTGTCTATAGAAATGAGAGCGCACCT
GAAATTCATCCTGTACCATCATTCGGTGACGATCTGGATATAATAGCATATCAGTTACCC
GAATGGAAAATTGGAAGTTCAAATTTTACTAAGTACGTTCCATGGGGCTTTCGGTCATTA
TTTAACTACATCAGCCATCAATACGGAAATCCACCTATCTTGGTGACTGAGAACGGATTT
GCAACAAATGGTGGTATTATCGACGAAGACCGAGTGACATACTTCAGAGGCTACTTGAAC
GCTGTCTTAGATGCCATCGACGATGGTGTTGATATAAGAGGTTATATTGCCTGGAGTCTC
ATGGATAATTTCGAGTGGTCAAAAGGATACACTGAACGCTTCGGTCTGTATGAAGTCGAC
TACAACGACCCAAACCGTACTCGCACGCCTCGCAAGTCCGCTTATGTACTGAAGGAGATT
ATAAGGACACGATCTATTGATCCCAACTATGAACCTGACATGAGCCAACCCCTGACCATT
GATGATGGACTCTAA

Protein sequence:

MRKFPARLIFGTATASYQVEGAWDVDGKSENIWDHLTHTNPCKVLDCSNGDVADNSYYLY
KRDVEMMRELGLDTYRFSISWTRILPTGFPDYINKAGVAYYNNLIDEMLKYNIQPIVTLY
HWDLPQKIQEMGGWTNSEIVNWFGDYARVIFNFFGDRVKYFITINEPHQICEFGYGKDIF
APALKIQGIADYLCMKNVLLGHARAYHIYDKEFRVNQNGKIFITINAEWHQPKTVNDEEA
ARDARQFYWEVYAHPIFSKSGNFPPEMIKRIADKSAAQGFLRSRLPELSRAEVKFVHGTS
DFFGLNHYSTSIVYRNESAPEIHPVPSFGDDLDIIAYQLPEWKIGSSNFTKYVPWGFRSL
FNYISHQYGNPPILVTENGFATNGGIIDEDRVTYFRGYLNAVLDAIDDGVDIRGYIAWSL
MDNFEWSKGYTERFGLYEVDYNDPNRTRTPRKSAYVLKEIIRTRSIDPNYEPDMSQPLTI
DDGL