DPGLEAN15939 in OGS1.0

New model in OGS2.0DPOGS201891 
Genomic Positionscaffold22:+ 93989-98204
See gene structure
CDS Length1821
Paired RNAseq reads  174
Single RNAseq reads  427
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006066 (2e-53)
Best Drosophila hit  larval visceral protein H (3e-141)
Best Human hitneutral and basic amino acid transport protein rBAT (5e-80)
Best NR hit (blastp)  RecName: Full=Maltase 1; Flags: Precursor (6e-159)
Best NR hit (blastx)  maltase 1 [Drosophila virilis] (5e-158)
GeneOntology terms


  
GO:0004558 alpha-glucosidase activity
GO:0006006 glucose metabolic process
GO:0005975 carbohydrate metabolic process
GO:0043169 cation binding
InterPro families



  
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR017853 Glycoside hydrolase, superfamily
IPR006589 Glycosyl hydrolase, family 13, subfamily, catalytic domain
IPR006047 Glycosyl hydrolase, family 13, catalytic domain
IPR015902 Alpha amylase
Orthology groupMCL10053

Nucleotide sequence:

ATGGGGTCGTGTAAAGCGTTGGGCATAACTGCTGCCCTTTTCTTGGGCTTAATACTGGTT
GGAGGTATTATCACAGTAGCAGTGTTGTGGTCTCAAGATAATGCAGTTGACCCTCCGATT
ATTATTCCCACTGACTGGTGGGAGCACTGCGTTCTCTATCAGATTTATCCTCGCTCTTTC
AAAGACACCGACGGAGATGGCATCGGAGATTTAAAAGGTATCACCCAAGAGCTCGAGCAT
TTCGTGGATGCCGGAGTGGACGCTATATGGATGTCTCCGATATTTGCCTCCCCGATGGTG
GACTTCGGATATGACATCAGCAACTTTTACGAGATACATTACGAGTACGGAACCATGGAG
GACTTCGAGGCTCTCCTTGAGAAGGCGCATCGTTTAGGAATAAAGGTGTTATTGGATTTC
GTGCCTAACCATGCAAGCAATGAATCGGATTACTTTAAGAAATCAGAAGCCAGAGATCCC
GAATATGAAGATTTTTTCGTTTGGGCGGACGGGATCCCGGATCCAAACAATGCCAGTAAC
ATTTTACCTCCATCTAACTGGGTAAGCCAATTTGATGGATCAGCATGGCAATGGAGCCCA
ATTCGTCAGCAGTTTTACCTTCACCAATTTGCAGTTCAGCAAGCTGACTTTAACTTTAGG
AACGAGTCGGTCAGACAAGAGATGAAGAACATCATGAAATTCTGGCTCGACAAAGGGGCA
GACGGATTCAGAGTCGACGCTCTGCCTTTTCTCATGGAAGCTAATCCTGATGACTACGGC
GGTAGATATCCTGATGATCCCCTTAGCGGAAAAATTGGACTTGAACCTCATCAACTAGGA
TACACCATTCCTCTGTACACTAAAGATCTCATTGAATTATACGATGTAGTTTACGAATGG
CGAGAATACGTGGATCAATATTGGAAGGAAAATGGCGGAGACACTCGAGTGCTGTTGTCC
GAAGGTTACGCAAATATCTCCATGACGATGCTTTACTATGGTAACAAACAGGGAAAATTC
GGTGCCCACTTCCCCTTCAACTTTGATTTCATTACCGATGTCTCTAATAATTCAAATGCA
AGGGACTTCGTTTACACCATTCAGAAATGGCTCACGTACAAGCCCTTCGCAGCAACAGCT
AACTGGGTGTTTGGCAATCATGACAATAATAGGATGGCAACTCGATTCCGAGAAGACATG
GTGGATGGTCTTAACGCCCTGGCAATGATACTACCAGGTGTAGCTGTCACCTACCAGGGA
GAAGAGATCGGTATGCAAGATGGGTACGTGAGCTGGGAGGATACTGTTGATGTAGAAGCC
CTCAACAGAGGCGACAACGAAACCTACATGCTTTACTCGCGAGACCCAGCAAGAACCCCA
TACCAATGGAACGGTTCGCTCAATGCCGGTTTCTCAACCGCCAACAAAACATGGCTACCG
GTGGCTGATAACTATAAGGAACTAAACCTACAAGCTCAAAAGGCAGCTAATGTTAGCCAT
TTTAAAGTTTATCAAAAATTAACAGCTCTTCGCAAGGAGATGTCTATGATCCATGGAGAT
TACGAAGTGAGAGCGTTTTCCGATCGCTCCTTCTACGTAGTACGAAACTTCAGGACCTAC
GACACATTTGTCTTATTGTTCAACGTCGCCGATACAGCAGATATTATCAATCTAACTAGA
ATCCAAGACATAAAAGTGCCCTCCACTGTTGAAGTAGCCAGTATTCATTCCAGTAGGAGA
GCAGGTGACGTCATCGAAGAAAACCTGATACAACTAGAAGCAGGAGAGGCGCTAGTGCTT
CGAGATGCGCCATTGGAATAA

Protein sequence:

MGSCKALGITAALFLGLILVGGIITVAVLWSQDNAVDPPIIIPTDWWEHCVLYQIYPRSF
KDTDGDGIGDLKGITQELEHFVDAGVDAIWMSPIFASPMVDFGYDISNFYEIHYEYGTME
DFEALLEKAHRLGIKVLLDFVPNHASNESDYFKKSEARDPEYEDFFVWADGIPDPNNASN
ILPPSNWVSQFDGSAWQWSPIRQQFYLHQFAVQQADFNFRNESVRQEMKNIMKFWLDKGA
DGFRVDALPFLMEANPDDYGGRYPDDPLSGKIGLEPHQLGYTIPLYTKDLIELYDVVYEW
REYVDQYWKENGGDTRVLLSEGYANISMTMLYYGNKQGKFGAHFPFNFDFITDVSNNSNA
RDFVYTIQKWLTYKPFAATANWVFGNHDNNRMATRFREDMVDGLNALAMILPGVAVTYQG
EEIGMQDGYVSWEDTVDVEALNRGDNETYMLYSRDPARTPYQWNGSLNAGFSTANKTWLP
VADNYKELNLQAQKAANVSHFKVYQKLTALRKEMSMIHGDYEVRAFSDRSFYVVRNFRTY
DTFVLLFNVADTADIINLTRIQDIKVPSTVEVASIHSSRRAGDVIEENLIQLEAGEALVL
RDAPLE