DPGLEAN07542 in OGS1.0

New model in OGS2.0DPOGS212394 
Genomic Positionscaffold10668:+ 114-3738
See gene structure
CDS Length1452
Paired RNAseq reads  513
Single RNAseq reads  2220
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012077 (8e-149)
Best Drosophila hit  CG14476, isoform D (2e-11)
Best Human hitlysosomal alpha-glucosidase preproprotein (2e-81)
Best NR hit (blastp)  PREDICTED: similar to acid alpha-glucosidase [Tribolium castaneum] (3e-131)
Best NR hit (blastx)  PREDICTED: similar to acid alpha-glucosidase [Tribolium castaneum] (5e-125)
GeneOntology terms
























  
GO:0002026 regulation of the force of heart contraction
GO:0002086 diaphragm contraction
GO:0003007 heart morphogenesis
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004558 alpha-glucosidase activity
GO:0005529 sugar binding
GO:0005625 soluble fraction
GO:0005764 lysosome
GO:0005765 lysosomal membrane
GO:0005975 carbohydrate metabolic process
GO:0005977 glycogen metabolic process
GO:0005980 glycogen catabolic process
GO:0006941 striated muscle contraction
GO:0007040 lysosome organization
GO:0007517 muscle organ development
GO:0007626 locomotory behavior
GO:0008152 metabolic process
GO:0009888 tissue development
GO:0016020 membrane
GO:0043181 vacuolar sequestering
GO:0043587 tongue morphogenesis
GO:0046716 muscle cell homeostasis
GO:0050884 neuromuscular process controlling posture
GO:0050885 neuromuscular process controlling balance
GO:0055010 ventricular cardiac muscle tissue morphogenesis
GO:0060048 cardiac muscle contraction
InterPro families



  
IPR000322 Glycoside hydrolase, family 31
IPR000519 P-type trefoil
IPR017957 P-type trefoil, conserved site
IPR011013 Glycoside hydrolase-type carbohydrate-binding
IPR017853 Glycoside hydrolase, superfamily
Orthology groupND

Nucleotide sequence:

ATGTATTTTTTTGTCAGACACTTTCCGTATATCTTCTTGTTGACCCTTTTCTGCTATTGG
TCGTATGATTTAACACTACAGATTATTCCGACTGAGAACCGCTACCACGACCACAACAAT
TTGTCAACCAAGAACAAGAACATCACAGAAGCGTCTTATAAGAAGGATAATAAGAAAAGT
GACTACGGCGATTTGGATCATCAGTGTGGTGATGTATCAGACAGCATGAGGTTCGACTGT
CATCCGCAGGGGGGCGCCAGTGAGGAAGCTTGTACCAAACGGGGTTGTTGTTGGGGGGCG
ACCGCTGTGCAGGGTGCTCCATACTGCTACTACCCTAAACACTACCCGAGCTACCGCTTC
ATGAACAGCACAGAGAACAAGCACAGCATGACCGTGTACTACGCTCACGGGCTGGATACA
GGGTACCCTGGACAGTGGGGAACTGTCATGGTGACCTTCAACTACCTGGCCGATGATGTC
CTGCAGATTAAGATAACCGACGCTAACAACAAAAGGTTCGAACCCCCGTACCCCGAGGTG
CCGGTGGTGTCGGGGCGGGTCACCAGTCTCCAGTACCGAGTGCTGGTGGACAGTGCCGCC
GTCGGCTTCAAGGTCATCAGGACTGAGGACAATGTCACCATAGTCGACACTCAGAACGTG
GGTGGTTTGATATTATCGGAGAAATTCCTTCAGTTATCGTCAGTACTGCCCACGGACCAC
GTGTACGGCTTGGGAGAAAAACAGGCGCCGCTCCTGAACAACTTCAATTGGAACACCTTC
ACGCTGTTCAACAGCGACATGCCGCCCATAGAGAATAAAAGTCTCTATGGGACTCATCCT
TTTTATCTGGCGTTGGAGAGAAATGGGAAGAGTCATGGGATGCTCCTGTTGAATTCGAAC
GCTATGGACATAGTCCTCCAACCGTCTCCGGCTATAACGTACCGCGCCGTGGGCGGCGTC
CTAGACTTCCTGGTGATGATGGGTCCTTCGCCCTCGCAAGTTGTATCTCAACTCACGAGC
CTTATAGGCAGACCCTTCATGCCGCCGTACTGGGCGCTCGGATTCCATCTGTGCAAATAC
GACTACGGCAGCCTCAACACCACCCGCCAGGTCATGCAGAGGAACATCGACGCCGGGATA
CCGTTGGACGCCCAGTGGAATGACCTGGACTACATGAGCACTGCGAACGACTTCACATAC
GACAAGAAGAAATATGAAGGCCTGCCGCAGTTCGTTGACGACTTACATCAGAAAGGAATG
CACTACGTAGTGCTCGTTGATCCCGGGGTGAGTGCATCCGAAACTCCGGGGAGCTATCCG
CCCTTCGACCGGGGATTGGAAATGGACGTGTTCGTGAAGAACTCCACCGACCAGCCCTTC
GTGGGGAAGGTAGAGGGGCTTTGGACACATATAGATGATATTGTTATTGGTCAAAAAAAA
ATAGTAATATAA

Protein sequence:

MYFFVRHFPYIFLLTLFCYWSYDLTLQIIPTENRYHDHNNLSTKNKNITEASYKKDNKKS
DYGDLDHQCGDVSDSMRFDCHPQGGASEEACTKRGCCWGATAVQGAPYCYYPKHYPSYRF
MNSTENKHSMTVYYAHGLDTGYPGQWGTVMVTFNYLADDVLQIKITDANNKRFEPPYPEV
PVVSGRVTSLQYRVLVDSAAVGFKVIRTEDNVTIVDTQNVGGLILSEKFLQLSSVLPTDH
VYGLGEKQAPLLNNFNWNTFTLFNSDMPPIENKSLYGTHPFYLALERNGKSHGMLLLNSN
AMDIVLQPSPAITYRAVGGVLDFLVMMGPSPSQVVSQLTSLIGRPFMPPYWALGFHLCKY
DYGSLNTTRQVMQRNIDAGIPLDAQWNDLDYMSTANDFTYDKKKYEGLPQFVDDLHQKGM
HYVVLVDPGVSASETPGSYPPFDRGLEMDVFVKNSTDQPFVGKVEGLWTHIDDIVIGQKK
IVI