DPGLEAN13951 in OGS1.0

New model in OGS2.0DPOGS213645 
Genomic Positionscaffold543:- 7570-21332
See gene structure
CDS Length1845
Paired RNAseq reads  3492
Single RNAseq reads  10776
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004586 (0.0)
Best Drosophila hit  trehalase, isoform F (1e-134)
Best Human hittrehalase precursor (7e-131)
Best NR hit (blastp)  trehalase-2 [Spodoptera exigua] (0.0)
Best NR hit (blastx)  trehalase-2 [Spodoptera exigua] (0.0)
GeneOntology terms


  
GO:0004555 alpha,alpha-trehalase activity
GO:0046658 anchored to plasma membrane
GO:0005991 trehalose metabolic process
GO:0015927 trehalase activity
InterPro families

  
IPR001661 Glycoside hydrolase, family 37
IPR008928 Six-hairpin glycosidase-like
IPR018232 Glycoside hydrolase, family 37, conserved site
Orthology groupMCL11530

Nucleotide sequence:

ATGTTAGTGGCCCTGGAGCTGGTCGCCGGGGACCGCACCAAGCTGCCGCCCACCTGTGAT
AGTATGATTTACTGCCACGGACCGCTGCTGGACACGGTCCAAATGGCGGGCCTCTATAAG
GACTCCAAGACCTTCGTGGATATGAAACTGAAGCTCTCCGCCAACATCACCATGGAGCAC
TTCCACGAGATGATGGCCAGGACGAGTGACCGACCGACCAAAGCCGACATTCAGGAGTTC
GTAGACAATAACTTTGACCCCGCGGGCTCCGAGTTTGAAGAGTGGAGACCCACGGATTGG
AAGGACAACCCAGCCTTCCTGTCTCGTATCAAGGACCCTCGCTTCCACCAGTGGGCGTCC
GACCTGAACGCTCTGTGGCTTCAGCTCGGCAGGAAGATGAAGGACGACGTCAAAAACAAC
GAGCACCTCTACTCCATTATATACGTACACAACCCGGTTATAGTACCAGGTGGCAGGTTC
CGCGAGTTCTACTACTGGGACTCCTACTGGATTATCAAGGGTCTGCTTCTATCGGAGATG
AGGTCCACCGCCATGGGGATGATCTCCAACTTCCTGGAGATAGTCGACAGGTTCGGATTC
ATACCTAACGGAGGACGGATCTACTACCTCATGAGATCCCAGCCTCCGCTACTGATCCCG
ATGGTGAAGCTGCTGATGGATGATTTTGAGGACCTGGGCTTCCTCCGGTCACACATACAC
ACGCTGGACAGAGAGTTCGAGTTCTGGATGAACAACCACACGGTCAGCGTCGACTATGAT
GGGAAAAAGTATCAGATGGCTCGATACAACGACATGTCTCAAGGTCCGCGACCCGAGAGT
TACAAGGAGGACATAGATTGTGCCAAACATCTGGACAGTCGCGAGTTGAAGGAGGAGCTA
TACGCAGAGCTGAAAGCGGGCGCGGAGTCAGGGTGGGATTTCTCCTCCAGATGGTTCATA
CTCAACGGAACCAATAAAGGTAACTTGACGAACCTGAAGACTCGCTCCATAGTCCCGGTG
GACCTGAACGCCATCTTGTGTTGGAACGCTGGTCTGTTGTCAGAGTTCCACGCGCGGCTC
GGGGACACCTCCCGGGCCGATTACTACCGGGAGGTCAGGGCTCGCCTGCTGGAGGCCATC
GAGAAGGTGTTGTGGCACGAGGAGGTGGGCGCGTGGCTGGACTTCAGCTTGGAGTCGGGG
CGAGCGCGGGACTACTTCTACCCCTCGAACGTGGCGCCGCTCTGGACCGGCGCCTACGAT
CGCGGCCGCGAGGAATACTACGTCAATAGGGTCATCAACTACCTCGATAAAGTCAAGGTG
GACATCTTCGAGGGCGGCATCCCGGCGACGTTCGAGCACTCCGGGGAGCAGTGGGACTAT
CCGAACGCCTGGCCGCCGCTACAGCACATGGTGGTGGAGGGTCTGGCGGGCACCAGGCAC
GCCGCCGCCAACAGGCTGGCCGGGGAGATCGCCGCCAAGTGGGTGCGCTCCAACTACGAG
GTCTGGAGACACAAGACCGCCATGCTGGAGAAGTACGACGCCACGGTGTTCGGTGGGTTC
GGCGGCGGCGGCGAGTACGTGGTGCAGACCGGCTTCGGCTGGACCAACGGAGTGGTGATG
GTGCTGCTCAACGAGTACGGAGATTGGTTATCGGCAGAGGACGCGTTCGGCGGAGGGGAC
GAGGAAGGCGGGGCGAGAGGGGAAGGGGGAGAGGGGCTGCACGTCCAGGGGGCGGTTCCA
GGGGGCGGGGGTGGGGGCGGGAGGGGTCGCCACCGCCCTGCTGGTGGTCATGGCGTCGCT
CGCTGCCGGGACGCTTGGTGTGATGGTGTACAGGAAGCGTCGTGA

Protein sequence:

MLVALELVAGDRTKLPPTCDSMIYCHGPLLDTVQMAGLYKDSKTFVDMKLKLSANITMEH
FHEMMARTSDRPTKADIQEFVDNNFDPAGSEFEEWRPTDWKDNPAFLSRIKDPRFHQWAS
DLNALWLQLGRKMKDDVKNNEHLYSIIYVHNPVIVPGGRFREFYYWDSYWIIKGLLLSEM
RSTAMGMISNFLEIVDRFGFIPNGGRIYYLMRSQPPLLIPMVKLLMDDFEDLGFLRSHIH
TLDREFEFWMNNHTVSVDYDGKKYQMARYNDMSQGPRPESYKEDIDCAKHLDSRELKEEL
YAELKAGAESGWDFSSRWFILNGTNKGNLTNLKTRSIVPVDLNAILCWNAGLLSEFHARL
GDTSRADYYREVRARLLEAIEKVLWHEEVGAWLDFSLESGRARDYFYPSNVAPLWTGAYD
RGREEYYVNRVINYLDKVKVDIFEGGIPATFEHSGEQWDYPNAWPPLQHMVVEGLAGTRH
AAANRLAGEIAAKWVRSNYEVWRHKTAMLEKYDATVFGGFGGGGEYVVQTGFGWTNGVVM
VLLNEYGDWLSAEDAFGGGDEEGGARGEGGEGLHVQGAVPGGGGGGGRGRHRPAGGHGVA
RCRDAWCDGVQEAS