New model in OGS2.0 | DPOGS214376  |
---|---|
Genomic Position | scaffold1959:+ 14624-18263 |
See gene structure | |
CDS Length | 1233 |
Paired RNAseq reads   | 336 |
Single RNAseq reads   | 733 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004000 (4e-128) |
Best Drosophila hit   | CG7997, isoform A (3e-134) |
Best Human hit | alpha-N-acetylgalactosaminidase precursor (6e-96) |
Best NR hit (blastp)   | alpha-N-acetylgalactosaminidase [Bombyx mori] (8e-163) |
Best NR hit (blastx)   | alpha-N-acetylgalactosaminidase [Bombyx mori] (3e-163) |
GeneOntology terms    | GO:0004557 alpha-galactosidase activity GO:0005975 carbohydrate metabolic process GO:0043169 cation binding |
InterPro families    | IPR013785 Aldolase-type TIM barrel IPR013780 Glycosyl hydrolase, family 13, all-beta IPR017853 Glycoside hydrolase, superfamily IPR002241 Glycoside hydrolase, family 27 IPR000111 Glycoside hydrolase, clan GH-D |
Orthology group | MCL11034 |
Nucleotide sequence:
ATGGGAACTCATTTAATCGCCATTTTTGCAATAATACCATATGTCTTGGCTCTCGATAAT
GGACTAGCGCTCACTCCGCCAATGGGGTGGTTGACCTGGCAGCGATTTCGATGTATAACA
GATTGCGATAAATATCCAAATGAGTGTATAAGTGAATCTCTCATTAAACGGATGGCAGAC
ATTATGGTCAACGAGGGATATTCCCACGCTGGGTACAAATACGTCGGCATCGACGACTGT
TGGCTCGAGAAAACACGTGACGCAAACGGTCGATTGGTTCCCGATAGGAAACGGTTTCCG
AACGGTATGAAGGCTGTCGCAGATTATCTGCATGATCTCGGTTTAAAATTCGCGTTATAC
CAGGATTACGGTACAAAAACCTGCGCTGGTTACCCCGGGGTACTAGGGCATGAGGCTGTT
GACGTTCAGACATTCGCCGAATGGGAAGTGGATTATATTAAATTAGACGGATGTAATGTC
AACGTTTCCAAGATGGACACCGGTTATCCGGAATTTGGAAAATTGATGAATGAAAGCGGT
CGGCCCATGGTATACTCATGTAGCTGGCCAGCGTATCAGAATAAACCTGATTATGCATCG
ATATCGAAGCACTGTAACATGTGGCGTAACTGGGACGATATCCAGGACTCGTGGGCTTCA
CTCACCACGATCATGAGCTGGTTTGCGGAAAAACAGGAAGAAATCGCCAAATACGCCGGA
CCCGGAAGATGGAATGACCCGGATATGTTGCTCATAGGAAATTTTGGATTATCACTGGAC
CAGGCGAGAGTTCAAATGGCCGTGTGGTCGATACTGGCCGCCCCACTGCTCATGAGTGTA
GATCTGGCCACCATCCGACCGGAGTTTAAGGAGGTGTTGCTTAACAAAGACATCATAGCC
ATAGATCAAGACGAGCTGGGCAAGCAAGGGTTAATGGTGTGGAATAAAGCGAAATGCGAG
ATCTGGACACGCGAATTAGTGGACGGTATAGCGGTAGCGTTTGTCAGTAAAAGAGATGAT
GGAGCGCCTCACACTGTTGATGTTACAACTGAGGATATGAAAATACCACCGACGACGTAT
CATATACAGGATCTGTACAAAGATGGACATAATTTCAAATTTGATTGCAAAGGAAACTTC
ACAACCAGAATCAATCCGTCAGGCGTCAGATTCTACAAGTTCATCCCCATAAAAGGCAAT
GAGGTTGATAGCCCTTCTATCACCTATATATAG
Protein sequence:
MGTHLIAIFAIIPYVLALDNGLALTPPMGWLTWQRFRCITDCDKYPNECISESLIKRMAD
IMVNEGYSHAGYKYVGIDDCWLEKTRDANGRLVPDRKRFPNGMKAVADYLHDLGLKFALY
QDYGTKTCAGYPGVLGHEAVDVQTFAEWEVDYIKLDGCNVNVSKMDTGYPEFGKLMNESG
RPMVYSCSWPAYQNKPDYASISKHCNMWRNWDDIQDSWASLTTIMSWFAEKQEEIAKYAG
PGRWNDPDMLLIGNFGLSLDQARVQMAVWSILAAPLLMSVDLATIRPEFKEVLLNKDIIA
IDQDELGKQGLMVWNKAKCEIWTRELVDGIAVAFVSKRDDGAPHTVDVTTEDMKIPPTTY
HIQDLYKDGHNFKFDCKGNFTTRINPSGVRFYKFIPIKGNEVDSPSITYI