New model in OGS2.0 | DPOGS204071  |
---|---|
Genomic Position | scaffold6294:+ 1080-3214 |
See gene structure | |
CDS Length | 1353 |
Paired RNAseq reads   | 164 |
Single RNAseq reads   | 522 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010812 (1e-165) |
Best Drosophila hit   | CG9701 (2e-103) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (1e-80) |
Best NR hit (blastp)   | beta-glucosidase precursor [Spodoptera frugiperda] (0.0) |
Best NR hit (blastx)   | beta-glucosidase precursor [Spodoptera frugiperda] (0.0) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR001360 Glycoside hydrolase, family 1 IPR018120 Glycoside hydrolase, family 1, active site IPR017853 Glycoside hydrolase, superfamily IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | MCL19198 |
Nucleotide sequence:
ATGAAGTTGTTATCATTTCTTTGCTTGTTAATCGGCAGCCATGCTCAAGAAAGAAGATTT
CCTGAGGACTTCATGTTCGGGGCTGCCACATCAGCATATCAGATAGAAGGAGGATGGAGC
GCTGATGACAAAGGAGAGAATATATGGGATCGTTTGACTCACACCAAACCTAACGTAATC
AAGGATGTGAGCAATGGTGATGTTGCAGCCGACACATACAATAACTACAAACGTGATGTG
GAGATGATGAGGGAGTTGGGGCTAGATGCTTACAGGTTCTCTCTCTCCTGGTCTAGAATA
CTACCCAATGGCCTGGCCAACAAAGTCAGCGATGCCGGAGTTGAGTTTTACAACAACTAT
ATAGATGAAATGATCAAATACGGTATAAAGCCCATGGTCACTCTGTACCACTGGGACTTG
CCACAGAAGTTGCAAGATTTGGGAGGATTCACGAATCCATTATTCCCAGAGTGGTTTGAA
GACTACGTCCGGGTGGTTTTTGGAAAGTTTGGAGACAGAGTCAAGCACTGGATTACTTTC
AATGAACCCAGAGAAATCTGTTTCGAAGGCTATGGTTCAGACACCAAAGCGCCTATCCTA
AATGCAACCGACGTCGGTGTTTATTACTGTGCCAAAAATCTGGTTATGGGTCACGCTAGA
GCTTATTACGCATATGTCAATGACTTCAAGCCGAGCCAAGAAGGTGTCTGTGGTATCACA
ATAAGTGTGAATTGGTTCGGGGCGTTGACAGATTCCGAGGAAGATCAATTTGCTGCCGAA
ATGAAGAGACAAGCAGAATGGGGGCTCTATGCTGAACCTATTTTCTCTGAAGAGGGTGGT
TTTCCTAAGGAATTAGCAGAAATTGTTGCCAAAAAAAGCGCTGAACAGGGTTATCCTCGG
TCGCGTATGCCAGAATTCTCTGATGAAGAGAAGGATTTCGTAAAAGGCACTGCTGACTTT
TTAGGAGTAAATCATTACACAGCCGGCTTAGTATCTGCAACTGAATATAAGACTCACCAC
CCAGTGCCGTCTTTATATGATGATATTGATGTAGGAAGCTACACTCCGCCGGAGTGGCCA
AAATCTGCTTCATCTTGGTTAAAATTAGCACCAAACAGTATTTACAATGCCCTCACTCAC
CTTCACAAGAAGTACAACGGTCCCATATTCTACATCACGGAGAACGGCTGGTCCTCGCCT
CCGGAAGCTGATATCCTTGATGATGACAGGATTAGATACTACCGAGCGGCTTTGAACAGT
GTGCTCGATACCTTGGAGGCTGGAGTGGATCTACGGGGGTACATGGCATGGAGTCTGATG
GACAACTTTGAGTGGATGGAGGGTTACACGTAA
Protein sequence:
MKLLSFLCLLIGSHAQERRFPEDFMFGAATSAYQIEGGWSADDKGENIWDRLTHTKPNVI
KDVSNGDVAADTYNNYKRDVEMMRELGLDAYRFSLSWSRILPNGLANKVSDAGVEFYNNY
IDEMIKYGIKPMVTLYHWDLPQKLQDLGGFTNPLFPEWFEDYVRVVFGKFGDRVKHWITF
NEPREICFEGYGSDTKAPILNATDVGVYYCAKNLVMGHARAYYAYVNDFKPSQEGVCGIT
ISVNWFGALTDSEEDQFAAEMKRQAEWGLYAEPIFSEEGGFPKELAEIVAKKSAEQGYPR
SRMPEFSDEEKDFVKGTADFLGVNHYTAGLVSATEYKTHHPVPSLYDDIDVGSYTPPEWP
KSASSWLKLAPNSIYNALTHLHKKYNGPIFYITENGWSSPPEADILDDDRIRYYRAALNS
VLDTLEAGVDLRGYMAWSLMDNFEWMEGYT