DPGLEAN06401 in OGS1.0

New model in OGS2.0DPOGS212394 
Genomic Positionscaffold219:+ 279116-283553
See gene structure
CDS Length1185
Paired RNAseq reads  1016
Single RNAseq reads  2492
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012077 (5e-110)
Best Drosophila hit  ND
Best Human hitlysosomal alpha-glucosidase preproprotein (1e-10)
Best NR hit (blastp)  PREDICTED: similar to acid alpha-glucosidase [Tribolium castaneum] (9e-27)
Best NR hit (blastx)  sucrase-isomaltase, intestinal, putative [Pediculus humanus corporis] (7e-28)
GeneOntology terms
























  
GO:0002026 regulation of the force of heart contraction
GO:0002086 diaphragm contraction
GO:0003007 heart morphogenesis
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004558 alpha-glucosidase activity
GO:0005529 sugar binding
GO:0005625 soluble fraction
GO:0005764 lysosome
GO:0005765 lysosomal membrane
GO:0005975 carbohydrate metabolic process
GO:0005977 glycogen metabolic process
GO:0005980 glycogen catabolic process
GO:0006941 striated muscle contraction
GO:0007040 lysosome organization
GO:0007517 muscle organ development
GO:0007626 locomotory behavior
GO:0008152 metabolic process
GO:0009888 tissue development
GO:0016020 membrane
GO:0043181 vacuolar sequestering
GO:0043587 tongue morphogenesis
GO:0046716 muscle cell homeostasis
GO:0050884 neuromuscular process controlling posture
GO:0050885 neuromuscular process controlling balance
GO:0055010 ventricular cardiac muscle tissue morphogenesis
GO:0060048 cardiac muscle contraction
InterPro families

  
IPR000519 P-type trefoil
IPR017957 P-type trefoil, conserved site
IPR000322 Glycoside hydrolase, family 31
Orthology groupND

Nucleotide sequence:

ATGCCTAAGATACCTTACAAACCTGAAAAGGGTCGTGAAGAAGACGAGGATTATGAAATT
GTATCTTTTGAAGACTTTTGCGATAAGCCTCCAGGAACGAGTACCGATTTATTGACTTTA
AATGACAATATTAACTATCGACTTCATTATGAAACCGATAAGGCTTCAAATTTTCCTGAG
GCCGGTGAACCATCAACTCGAGATATTGGGATGCACACGGAAACTTCTTTAAACGGTCCC
AAAAATGCGTCGTACAAAAGAAAATTGTCCTTTGCACCGTTCGGCAGAACCGATAAAAAT
AGCCGCGGTCCATTATTCAGTGGAGTTATACCGAAACCTAGAAATGAAGGGGAATCTAGG
GAACACAGATACGAACGTTTCTCTCCGCGTGGCGGTTGGCTAGCGCGTACCTGGGAACAG
CTAGGGAATTTGTTACCTGGTATGCTAGCTACGGCGTTGTTGAGCGCGTTATGCGTGGGC
GCATGGTGGGCGGTGGGCGGAGCTCTAAGCGGCAGCTGGGGAGATGATCATTACAGACGT
CTCTACGAACGCGCTCATCCAGATGACATAAAAAAACCACTGTCGCCTGTAATTGAGAAG
ATTATTCCGACTGAGAACCGCTACCACGACCACAACAATTTGTCAACCAAGAACAAGAAC
ATCACAGAAGCGTCTTATAAGAAGGATAATAAGAAAAGTGACTACGGCGATTTGGATCAT
CAGTGTGGTGATGTATCAGACAGCATGAGGTTCGACTGTCATCCGCAGGGGGGCGCCAGT
GAGGAAGCTTGTACCAAACGGGGTTGTTGTTGGGGGGCGACCGCTGTGCAGGGTGCTCCA
TACTGCTACTACCCTAAACACTACCCGAGCTACCGCTTCATGAACAGCACAGAGAACAAG
CACAGCATGACCGTGTACTACGCTCACGGTCTGGATACAGGGTACCCTGGACAGTGGGGA
ACTGTCATGGTGACCTTCAACTACCTGGCCGATGATGTCCTGCAGATTAAGATGACCGAC
GCTAACAACAAGAGGTTCGAACCCCCGTACCCCGAGGTGCCGGTGGTGTCGGGGCGGGTC
ACCAGTCTCCAGTACCGAGTGCTGGTGGACAGTGCCGCCGTCGGCTTCAAGGTCATCAGG
ACTGAGGACAACGTCACCATGTTAGTTATATACACTGAGATCTAA

Protein sequence:

MPKIPYKPEKGREEDEDYEIVSFEDFCDKPPGTSTDLLTLNDNINYRLHYETDKASNFPE
AGEPSTRDIGMHTETSLNGPKNASYKRKLSFAPFGRTDKNSRGPLFSGVIPKPRNEGESR
EHRYERFSPRGGWLARTWEQLGNLLPGMLATALLSALCVGAWWAVGGALSGSWGDDHYRR
LYERAHPDDIKKPLSPVIEKIIPTENRYHDHNNLSTKNKNITEASYKKDNKKSDYGDLDH
QCGDVSDSMRFDCHPQGGASEEACTKRGCCWGATAVQGAPYCYYPKHYPSYRFMNSTENK
HSMTVYYAHGLDTGYPGQWGTVMVTFNYLADDVLQIKMTDANNKRFEPPYPEVPVVSGRV
TSLQYRVLVDSAAVGFKVIRTEDNVTMLVIYTEI