DPGLEAN19450 in OGS1.0

New model in OGS2.0DPOGS204061 
Genomic Positionscaffold2819:+ 10446-13092
See gene structure
CDS Length1437
Paired RNAseq reads  196
Single RNAseq reads  634
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010812 (0.0)
Best Drosophila hit  CG9701 (2e-109)
Best Human hitlactase-phlorizin hydrolase preproprotein (3e-86)
Best NR hit (blastp)  beta-glucosidase precursor [Spodoptera frugiperda] (0.0)
Best NR hit (blastx)  beta-glucosidase precursor [Spodoptera frugiperda] (0.0)
GeneOntology terms

  
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
InterPro families

  
IPR017853 Glycoside hydrolase, superfamily
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR001360 Glycoside hydrolase, family 1
Orthology groupMCL19198

Nucleotide sequence:

ATGTTCGGGGCTGCCACATCAGCATATCAGATAGAAGGAGGATGGAGCGCTGATGACAAA
GGAGAGAATATATGGGATCGTTTGACTCACACCAAACCTAACGTAATCAAGGATGTGAGC
AATGGTGATGTTGCAGCCGACACATACAATAACTACAAACGTGATGTGGAGATGATGAGG
GAGTTGGGGCTAGATGCTTACAGGTTCTCTCTCTCCTGGTCTAGAATACTACCCAATGGC
CTGGCCAACAAAGTCAGCGATGCCGGGGTTGAGTTTTACAACAACTATATAGATGAAATG
ATCAAATACGGTATAAAGCCCATGGTCACTCTGTACCACTGGGACTTGCCACAGAAGTTA
CAAGATTTGGGAGGATTCATGAATCCATTATTCCCCGAGTGGTTTGAAGATTACGCCCGG
GTGGTCTTTGAAAAGTTTGGAGACAGAGTCAAGCACTGGATTACTTTCAATGAACCCAGA
GAAATCTGTTTCGAAGGCTATGGTTCAGCAACCAAAGCGCCTATCCTAAATGCAACCGAC
GTCGGTGTTTATTACTGTGCCAAAAATCTGGTTATGGGTCACGCTAGAGCTTATTACGCA
TATGTCAATGACTTCAAGCCGAGCCAAGAAGGTGTCTGTGGTATCACAATAAGTGTGAAT
TGGTTCGGGGCGTTGACAGATTCCGAGGAAGATCAATTTGCTGCCGAAATGAAGAGACAA
GCAGAATGGGGGCTCTATGCTGAACCTATTTTCTCTGAAGAGGGTGGGTTTCCTAAGGAA
TTAGCAGAAATTGTGGCCAAAAAAAGCGCTGAACAGGGTTATCCTCAATCTCGTATGCCA
GCATTCTCTGATGAAGAGAAGGATTTCGTAAAGGGCGCTTTTGATTTCTTTGGAGTAAAT
CATTACTCAGGCAGCTTTGTATCTGCAACTGAATATAAGACTAACCACCCAGTGCCGTCT
TTATATGATGATGTTGATGTTGGAAGCTACACTCCGCCGGAGTGGCCAAAATCTGCTTCT
TCGTGGTTAGTTCAAGCACCAAACAGTGTTTACAATGCCCTCACTCACCTTCACAAGAAG
TACAACGGTCCCATACTCTACATCACGGAGAACGGCTGGTCCTCGTCTCCGGAAGCTGAT
ATCCTTGATGATGATAGGATTAGATACTACCGAGCGGCTTTGAACAGTGTGCTCGATACC
TTGGAGGCTGGAGTGGATCTACGAGGGTACATGGCATGGAGTCTGATGGACAACTTTGAG
TGGAATGCTGGTTACACAGAACTTCTTGGCCTGTACCGTGTCAACTTCTCGGACCCAGGT
CGTGAGAGAACTCCTCGTAAGTCAGCCTTCGTTTACAAACAGATCATCAAGAGTCGGATG
ATTGATGAAGAATATGAACCTGATACCCTGGACATGACCATTGATGAAGGGAACTGA

Protein sequence:

MFGAATSAYQIEGGWSADDKGENIWDRLTHTKPNVIKDVSNGDVAADTYNNYKRDVEMMR
ELGLDAYRFSLSWSRILPNGLANKVSDAGVEFYNNYIDEMIKYGIKPMVTLYHWDLPQKL
QDLGGFMNPLFPEWFEDYARVVFEKFGDRVKHWITFNEPREICFEGYGSATKAPILNATD
VGVYYCAKNLVMGHARAYYAYVNDFKPSQEGVCGITISVNWFGALTDSEEDQFAAEMKRQ
AEWGLYAEPIFSEEGGFPKELAEIVAKKSAEQGYPQSRMPAFSDEEKDFVKGAFDFFGVN
HYSGSFVSATEYKTNHPVPSLYDDVDVGSYTPPEWPKSASSWLVQAPNSVYNALTHLHKK
YNGPILYITENGWSSSPEADILDDDRIRYYRAALNSVLDTLEAGVDLRGYMAWSLMDNFE
WNAGYTELLGLYRVNFSDPGRERTPRKSAFVYKQIIKSRMIDEEYEPDTLDMTIDEGN