DPGLEAN15868 in OGS1.0

New model in OGS2.0DPOGS214376 
Genomic Positionscaffold1959:+ 14624-18263
See gene structure
CDS Length1233
Paired RNAseq reads  336
Single RNAseq reads  733
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004000 (4e-128)
Best Drosophila hit  CG7997, isoform A (3e-134)
Best Human hitalpha-N-acetylgalactosaminidase precursor (6e-96)
Best NR hit (blastp)  alpha-N-acetylgalactosaminidase [Bombyx mori] (8e-163)
Best NR hit (blastx)  alpha-N-acetylgalactosaminidase [Bombyx mori] (3e-163)
GeneOntology terms

  
GO:0004557 alpha-galactosidase activity
GO:0005975 carbohydrate metabolic process
GO:0043169 cation binding
InterPro families



  
IPR013785 Aldolase-type TIM barrel
IPR013780 Glycosyl hydrolase, family 13, all-beta
IPR017853 Glycoside hydrolase, superfamily
IPR002241 Glycoside hydrolase, family 27
IPR000111 Glycoside hydrolase, clan GH-D
Orthology groupMCL11034

Nucleotide sequence:

ATGGGAACTCATTTAATCGCCATTTTTGCAATAATACCATATGTCTTGGCTCTCGATAAT
GGACTAGCGCTCACTCCGCCAATGGGGTGGTTGACCTGGCAGCGATTTCGATGTATAACA
GATTGCGATAAATATCCAAATGAGTGTATAAGTGAATCTCTCATTAAACGGATGGCAGAC
ATTATGGTCAACGAGGGATATTCCCACGCTGGGTACAAATACGTCGGCATCGACGACTGT
TGGCTCGAGAAAACACGTGACGCAAACGGTCGATTGGTTCCCGATAGGAAACGGTTTCCG
AACGGTATGAAGGCTGTCGCAGATTATCTGCATGATCTCGGTTTAAAATTCGCGTTATAC
CAGGATTACGGTACAAAAACCTGCGCTGGTTACCCCGGGGTACTAGGGCATGAGGCTGTT
GACGTTCAGACATTCGCCGAATGGGAAGTGGATTATATTAAATTAGACGGATGTAATGTC
AACGTTTCCAAGATGGACACCGGTTATCCGGAATTTGGAAAATTGATGAATGAAAGCGGT
CGGCCCATGGTATACTCATGTAGCTGGCCAGCGTATCAGAATAAACCTGATTATGCATCG
ATATCGAAGCACTGTAACATGTGGCGTAACTGGGACGATATCCAGGACTCGTGGGCTTCA
CTCACCACGATCATGAGCTGGTTTGCGGAAAAACAGGAAGAAATCGCCAAATACGCCGGA
CCCGGAAGATGGAATGACCCGGATATGTTGCTCATAGGAAATTTTGGATTATCACTGGAC
CAGGCGAGAGTTCAAATGGCCGTGTGGTCGATACTGGCCGCCCCACTGCTCATGAGTGTA
GATCTGGCCACCATCCGACCGGAGTTTAAGGAGGTGTTGCTTAACAAAGACATCATAGCC
ATAGATCAAGACGAGCTGGGCAAGCAAGGGTTAATGGTGTGGAATAAAGCGAAATGCGAG
ATCTGGACACGCGAATTAGTGGACGGTATAGCGGTAGCGTTTGTCAGTAAAAGAGATGAT
GGAGCGCCTCACACTGTTGATGTTACAACTGAGGATATGAAAATACCACCGACGACGTAT
CATATACAGGATCTGTACAAAGATGGACATAATTTCAAATTTGATTGCAAAGGAAACTTC
ACAACCAGAATCAATCCGTCAGGCGTCAGATTCTACAAGTTCATCCCCATAAAAGGCAAT
GAGGTTGATAGCCCTTCTATCACCTATATATAG

Protein sequence:

MGTHLIAIFAIIPYVLALDNGLALTPPMGWLTWQRFRCITDCDKYPNECISESLIKRMAD
IMVNEGYSHAGYKYVGIDDCWLEKTRDANGRLVPDRKRFPNGMKAVADYLHDLGLKFALY
QDYGTKTCAGYPGVLGHEAVDVQTFAEWEVDYIKLDGCNVNVSKMDTGYPEFGKLMNESG
RPMVYSCSWPAYQNKPDYASISKHCNMWRNWDDIQDSWASLTTIMSWFAEKQEEIAKYAG
PGRWNDPDMLLIGNFGLSLDQARVQMAVWSILAAPLLMSVDLATIRPEFKEVLLNKDIIA
IDQDELGKQGLMVWNKAKCEIWTRELVDGIAVAFVSKRDDGAPHTVDVTTEDMKIPPTTY
HIQDLYKDGHNFKFDCKGNFTTRINPSGVRFYKFIPIKGNEVDSPSITYI