New model in OGS2.0 | DPOGS212394  |
---|---|
Genomic Position | scaffold10668:+ 114-3738 |
See gene structure | |
CDS Length | 1452 |
Paired RNAseq reads   | 513 |
Single RNAseq reads   | 2220 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012077 (8e-149) |
Best Drosophila hit   | CG14476, isoform D (2e-11) |
Best Human hit | lysosomal alpha-glucosidase preproprotein (2e-81) |
Best NR hit (blastp)   | PREDICTED: similar to acid alpha-glucosidase [Tribolium castaneum] (3e-131) |
Best NR hit (blastx)   | PREDICTED: similar to acid alpha-glucosidase [Tribolium castaneum] (5e-125) |
GeneOntology terms    | GO:0002026 regulation of the force of heart contraction GO:0002086 diaphragm contraction GO:0003007 heart morphogenesis GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds GO:0004558 alpha-glucosidase activity GO:0005529 sugar binding GO:0005625 soluble fraction GO:0005764 lysosome GO:0005765 lysosomal membrane GO:0005975 carbohydrate metabolic process GO:0005977 glycogen metabolic process GO:0005980 glycogen catabolic process GO:0006941 striated muscle contraction GO:0007040 lysosome organization GO:0007517 muscle organ development GO:0007626 locomotory behavior GO:0008152 metabolic process GO:0009888 tissue development GO:0016020 membrane GO:0043181 vacuolar sequestering GO:0043587 tongue morphogenesis GO:0046716 muscle cell homeostasis GO:0050884 neuromuscular process controlling posture GO:0050885 neuromuscular process controlling balance GO:0055010 ventricular cardiac muscle tissue morphogenesis GO:0060048 cardiac muscle contraction |
InterPro families    | IPR000322 Glycoside hydrolase, family 31 IPR000519 P-type trefoil IPR017957 P-type trefoil, conserved site IPR011013 Glycoside hydrolase-type carbohydrate-binding IPR017853 Glycoside hydrolase, superfamily |
Orthology group | ND |
Nucleotide sequence:
ATGTATTTTTTTGTCAGACACTTTCCGTATATCTTCTTGTTGACCCTTTTCTGCTATTGG
TCGTATGATTTAACACTACAGATTATTCCGACTGAGAACCGCTACCACGACCACAACAAT
TTGTCAACCAAGAACAAGAACATCACAGAAGCGTCTTATAAGAAGGATAATAAGAAAAGT
GACTACGGCGATTTGGATCATCAGTGTGGTGATGTATCAGACAGCATGAGGTTCGACTGT
CATCCGCAGGGGGGCGCCAGTGAGGAAGCTTGTACCAAACGGGGTTGTTGTTGGGGGGCG
ACCGCTGTGCAGGGTGCTCCATACTGCTACTACCCTAAACACTACCCGAGCTACCGCTTC
ATGAACAGCACAGAGAACAAGCACAGCATGACCGTGTACTACGCTCACGGGCTGGATACA
GGGTACCCTGGACAGTGGGGAACTGTCATGGTGACCTTCAACTACCTGGCCGATGATGTC
CTGCAGATTAAGATAACCGACGCTAACAACAAAAGGTTCGAACCCCCGTACCCCGAGGTG
CCGGTGGTGTCGGGGCGGGTCACCAGTCTCCAGTACCGAGTGCTGGTGGACAGTGCCGCC
GTCGGCTTCAAGGTCATCAGGACTGAGGACAATGTCACCATAGTCGACACTCAGAACGTG
GGTGGTTTGATATTATCGGAGAAATTCCTTCAGTTATCGTCAGTACTGCCCACGGACCAC
GTGTACGGCTTGGGAGAAAAACAGGCGCCGCTCCTGAACAACTTCAATTGGAACACCTTC
ACGCTGTTCAACAGCGACATGCCGCCCATAGAGAATAAAAGTCTCTATGGGACTCATCCT
TTTTATCTGGCGTTGGAGAGAAATGGGAAGAGTCATGGGATGCTCCTGTTGAATTCGAAC
GCTATGGACATAGTCCTCCAACCGTCTCCGGCTATAACGTACCGCGCCGTGGGCGGCGTC
CTAGACTTCCTGGTGATGATGGGTCCTTCGCCCTCGCAAGTTGTATCTCAACTCACGAGC
CTTATAGGCAGACCCTTCATGCCGCCGTACTGGGCGCTCGGATTCCATCTGTGCAAATAC
GACTACGGCAGCCTCAACACCACCCGCCAGGTCATGCAGAGGAACATCGACGCCGGGATA
CCGTTGGACGCCCAGTGGAATGACCTGGACTACATGAGCACTGCGAACGACTTCACATAC
GACAAGAAGAAATATGAAGGCCTGCCGCAGTTCGTTGACGACTTACATCAGAAAGGAATG
CACTACGTAGTGCTCGTTGATCCCGGGGTGAGTGCATCCGAAACTCCGGGGAGCTATCCG
CCCTTCGACCGGGGATTGGAAATGGACGTGTTCGTGAAGAACTCCACCGACCAGCCCTTC
GTGGGGAAGGTAGAGGGGCTTTGGACACATATAGATGATATTGTTATTGGTCAAAAAAAA
ATAGTAATATAA
Protein sequence:
MYFFVRHFPYIFLLTLFCYWSYDLTLQIIPTENRYHDHNNLSTKNKNITEASYKKDNKKS
DYGDLDHQCGDVSDSMRFDCHPQGGASEEACTKRGCCWGATAVQGAPYCYYPKHYPSYRF
MNSTENKHSMTVYYAHGLDTGYPGQWGTVMVTFNYLADDVLQIKITDANNKRFEPPYPEV
PVVSGRVTSLQYRVLVDSAAVGFKVIRTEDNVTIVDTQNVGGLILSEKFLQLSSVLPTDH
VYGLGEKQAPLLNNFNWNTFTLFNSDMPPIENKSLYGTHPFYLALERNGKSHGMLLLNSN
AMDIVLQPSPAITYRAVGGVLDFLVMMGPSPSQVVSQLTSLIGRPFMPPYWALGFHLCKY
DYGSLNTTRQVMQRNIDAGIPLDAQWNDLDYMSTANDFTYDKKKYEGLPQFVDDLHQKGM
HYVVLVDPGVSASETPGSYPPFDRGLEMDVFVKNSTDQPFVGKVEGLWTHIDDIVIGQKK
IVI