DPGLEAN07201 in OGS1.0

New model in OGS2.0DPOGS201926 
Genomic Positionscaffold562:+ 536-8296
See gene structure
CDS Length1590
Paired RNAseq reads  1610
Single RNAseq reads  8295
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002660 (0.0)
Best Drosophila hit  CG9701 (6e-125)
Best Human hitlactase-phlorizin hydrolase preproprotein (9e-103)
Best NR hit (blastp)  glycoside hydrolase [Culex quinquefasciatus] (2e-151)
Best NR hit (blastx)  glycoside hydrolase [Culex quinquefasciatus] (8e-146)
GeneOntology terms

  
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
InterPro families

  
IPR017853 Glycoside hydrolase, superfamily
IPR001360 Glycoside hydrolase, family 1
IPR013781 Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL10077

Nucleotide sequence:

ATGAACGTTAGAATATTATTTAAAAACATTGATTTTTTTATCAGGCAAGCGGAAGTATTA
AATTTAGCTGGAGGCGCAAAGTCTAACTACACTTTTCCGAAGGATTTTCTTTTTGGTGTC
TCAACAGCGGCAATACAAATTGAAGGAGCATGGAATGAAGATGGGAAGACGGAAAGCATA
TGGGATCACTTAGTGCGTGTAAATCCTAACTTCACTAAAGACGGATCTACCCCTGACGTA
GCAGCAGATTCCTATCATTTATACAAGAGAGATGCTGAAATGGTCCACGAACTTGGAGTG
AATATGTATAGGTTTTCAATATCCTGGCCAAGGATACTACCAACTGGTTTAGCCAATCAA
GTTAATCCTCTCGGAATTGAGTACTATAAAAATCTCATAAGCGAGTTGGAAAGGTACAAT
ATTACTCCTATGGTCACCATTTACCATTGGGATCTACCTCAGAAATTACAGGATATTGGG
GGTTGGACGAACGCGCATATCATAGATTATTATACGGACTACGCGAATGTATTGTTTGAA
AATTTCGCGGATAAAGTTAAATATTGGATAACATTCAACGAGCCAATGCAAACCTGCCTG
GAAGGTTACGGCAACACGTACCGAGCGCCTGCACTGAACCGACACGGTATAGCTGAATAT
CTGTGCACACACAATTTGTTAAAAGCGCACGCAAGCGTTTACCATTTGTTCAATAAGCAG
TATCGTCCACTGTATGGAGGGAAAATGGGTATGTCACTGGACTCTAATTGGGCAGAACCC
AAAACAGATACACCAAGAGACAAGAAAGCTGCGGAGTTGTACCTTAAAACTCATCTTGGA
TGGTATGCACATCCTGTATATTCGGAAACTGGAAATTATCCAGAAGAGCTTATCAAACTT
GTTGATGAAAAAAGTAAGAAACAGAACTACACCCACTCTCGACTTCCCAAGTTTACTCCT
GAGGAAATAGCCTATATACGAGGAACTGCAGACTTCTTCGGTTTAAACCATTACACCACG
TATCTTTTGAGCATGGCTGACAGTGAAGTTGGTGAGGTGCCATCACATGCAAACGATGTT
GGTATTGTTAGGGTTCAAGATCCCAAGTGGCCGTCGAAGTCCTCTTCCTCTTGGCTAAAG
GTGGTGCCATTTGGATTTCGTCGCCTCTTAAATTGGATAACTAAAACGTACAATAACGTG
CCAATAATCGTTACGGAGAACGGATATGCTGACTTTAGTGGAGTGAAAGATGAAGCAAGA
GTTTCTTACTATTGCCACTATTTAAATTCTCTCCTCCATTCAATACACGAAGATAAGACA
AACGTTCAAGGGTATTTCGCTTGGAGTCTGATGGATAATTGGGAATGGGACGACGGCTAT
GCGTCCCGCTTCGGTCTTTACTTGGTCGATTTCAATAGTCCCAACAAGACGAGAACTGCT
AAGGAATCGGCGAAATTGTACACGAGCGTAATATCCTCTCGAGGCCTGCCCGCCGACTAC
GACCCAGAAGATTTCACCGCCTTTTCCAGTGCTTCTCTTCTCGTTCCAACTCTACTCTCA
CTCTTACCCTTTTATAGGCTACTTACATGA

Protein sequence:

MNVRILFKNIDFFIRQAEVLNLAGGAKSNYTFPKDFLFGVSTAAIQIEGAWNEDGKTESI
WDHLVRVNPNFTKDGSTPDVAADSYHLYKRDAEMVHELGVNMYRFSISWPRILPTGLANQ
VNPLGIEYYKNLISELERYNITPMVTIYHWDLPQKLQDIGGWTNAHIIDYYTDYANVLFE
NFADKVKYWITFNEPMQTCLEGYGNTYRAPALNRHGIAEYLCTHNLLKAHASVYHLFNKQ
YRPLYGGKMGMSLDSNWAEPKTDTPRDKKAAELYLKTHLGWYAHPVYSETGNYPEELIKL
VDEKSKKQNYTHSRLPKFTPEEIAYIRGTADFFGLNHYTTYLLSMADSEVGEVPSHANDV
GIVRVQDPKWPSKSSSSWLKVVPFGFRRLLNWITKTYNNVPIIVTENGYADFSGVKDEAR
VSYYCHYLNSLLHSIHEDKTNVQGYFAWSLMDNWEWDDGYASRFGLYLVDFNSPNKTRTA
KESAKLYTSVISSRGLPADYDPEDFTAFSSASLLVPTLLSLLPFYRLLT