DPGLEAN03113 in OGS1.0

Genomic Positionscaffold3930:+ 18842-26768
See gene structure
CDS Length2673
Paired RNAseq reads  17
Single RNAseq reads  69
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010537 (2e-66)
Best Drosophila hit  CG9701 (1e-117)
Best Human hitlactase-phlorizin hydrolase preproprotein (3e-133)
Best NR hit (blastp)  glycoside hydrolases [Aedes aegypti] (0.0)
Best NR hit (blastx)  lactase-phlorizin hydrolase [Culex quinquefasciatus] (1e-178)
GeneOntology terms


















  
GO:0043169 cation binding
GO:0045471 response to ethanol
GO:0005887 integral to plasma membrane
GO:0000016 lactase activity
GO:0010045 response to nickel ion
GO:0010288 response to lead ion
GO:0001666 response to hypoxia
GO:0043627 response to estrogen stimulus
GO:0017042 glycosylceramidase activity
GO:0007584 response to nutrient
GO:0009725 response to hormone stimulus
GO:0042594 response to starvation
GO:0005624 membrane fraction
GO:0005886 plasma membrane
GO:0005903 brush border
GO:0016740 transferase activity
GO:0042493 response to drug
GO:0016324 apical plasma membrane
GO:0005975 carbohydrate metabolic process
GO:0009744 response to sucrose stimulus
InterPro families


  
IPR018120 Glycoside hydrolase, family 1, active site
IPR001360 Glycoside hydrolase, family 1
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR017853 Glycoside hydrolase, superfamily
Orthology groupMCL10077

Nucleotide sequence:

ATGTCCACCCAGTTGCAAGTAGGAGGTAAGTCTGAAAACATATGGGATCGCGTATCACAC
AGGGAACCTTGTGTTGTCGACAACTGCGACACAGGTGACGTTGCCGGTGATTCGTATCAT
CAATATAAGCGTGATGTGGAAATGATGCGGGAGCTAGGTCTCGACTTTTATAGGTTCTCT
CTCTCCTGGTCGAGAATATTACCAACGAGTTTTCCAGACCAAATTAATGAAAAAGCAGTA
CAATATTATAATAATTTGATAAATGAGATGCTCAAATACAACATACAACCCATGGTGACT
CTTTATCACTGGGATTTGCCTCAGAAGCTGCAAGATCTGGGAGGATGGACCAATCCCCAT
ATCGTTGATTGGTTTACCGATTACTCCAGAGTAGTGTTTCGGTTATTTGGAGATAGGGTT
AAGTATTGGATAACTATCAATGAACCGCGAGAGGTTTGTTATCAGGGATATGCAGCACAG
TCTCTAGCTCCTCTTTACAATATTTCTGGATATGCTGATTACATGTGTGCCAAAAATTTA
TTGCTAGTTCATGCCAACGTCTATCATTTATATAACAATGTATTTCGTAAAGCCCAAGGT
GGTCAAATCGGTATAACAATAAGCGCACAATGGTACGAACCTGAATCAGAGGAAGATGTA
GAGGCTGCTGAGGATTACAGACAGTTTGAGTGGGGAATTTACGCAAATCCAATATTTTCG
GAATCTGGAGACTTTCCAGCAGTCATGAAACGTAGGATAGCAGCAAAGAGTAAGGAACAA
GGATTTCCAAGATCACGATTACCACAATTCACTCCGGAGGAGGTTGATTTAATTAAAGGC
AGTTTCGACTTCTTTGGGTTGAATCATTATACTACTTATAGGGTTTACAGAAATGAATCA
GTCTATGGACATTATAATTCACCATCTACTTACGATGATCTCGAAGCAATAAGTTATCAA
GATAGTTCATGGGATTCAGCTGCTTCAAAGTGGTTAAAGCGTGTGCCCTGGGGATTTTAT
AATTTGCTTACAAAAATACGAAAGGACTACAACAACCCGCCAGTTTTCATCACTGAGAAT
GGATTCTCAACCCGAGGTGGTTTAGTTGACGACGACCGCATAAAGTATTACAGAACATAC
ATTGACGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGAGTTTACGCAGCGTGG
AGTTTGATGGACAATTTTGAATGGATGAGGGGATACAGCGAACGTTTCGGACTGTACGAG
GTGGACTACGAGAGTCCTGAACGCACCCGAACTCCTCGCAAATCTGCTTATGTATACAAG
GAGATGCTGCGCACACGAACACTGGACTATCATTATGAACCTGATATGAGCCTGGGAATG
AATGAGCCCACTGCTGAGCAACGGCCTCTTCTTGCATGGAGAAGATTCCTGGCATTTATA
TTGGCTGTACTTGCAAATAGCAAAAAAATCTCTCGACATGAAGCGAGAAAAATACCCGAC
CACTTACTTTTCGGAGCTGCTACGGCATCGTACCAAATAGAAGGCACTTGGAATGAAGAC
GGAAAATCTGAAAATATATGGGATCGCGTATCACACAGGGAACCTTGTGTTGTCGACAAC
TGCGACACAGGTGACCTTGCCGATGATTCGTATCATCAATATAAGCGTGATGTGGAAATG
ATGCGGGAACTAGGTCTCGACTTCTATAGGTTCTCTCTCTCCTGGACGAGAATATTACCA
ACGAGTTTTCCAGACCAAATAAATGAAAAAGGAGTACAATATTATAATAATTTGATAAAT
GAGATGCTCAAATACAACATACAACCCATGGTGACTCTTTATCACTGGGATTTACCTCAG
AAGTTGCAAGATCTGGGAGGATGGACCAATCCCCATATCGTTGATTGGTTTACCGATTAC
TCCAGAGTAGTGTTCCAGTTATTTGGAGATAGGGTTAAGTATTGGTGGGGACTGTATGCA
AATCCAATATTTTCCGAATTTGGGGACTATCCAGCAGTCATGAAAGATAGAATAGCAGCA
AAGAGTAAGAAACAAGGATTTCCAAGATCGCGATTACCACAATTCACTCCTGAAGAAATA
GATTTAATTAAAGGAAGTTCGGATTTCATTGGATTAAATCATTATACTACTAACATTGTT
TATAGGAACGAATCTGTTTATGGATATTATAGTTCGCCATCTTTTTATGATGATATTGAA
GTAATAAGTTATCAAGATAGTTCCTGGGAGTCAGCTGCTTCCAACTGGTTAAAGAGTGTA
CCCTGGGGATTCTATAAGTTATTAACAAAAATACGAGAGGACTACAACAACCCGCCAGTT
TTCATCACTGAGAATGGATTCTCAACCCGAGGTGGTCTAATTGACGACGACCGCGTAAAG
TATTACAGAACATACATAGATGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGA
GTTTATGCAGCGTGGAGTTTGATGGACAATTTCGAATGGATGAGGGGATACAGCGAACGT
TTCGGACTGTACGAGGTGGACTACGAGAGTCCTGACCGCACCCGAACTCCTCGCAAGTCT
GCTTACGTATACAAAGAGATGCTGCGCACACGAACACTGGACTATCATTATGAACCTGAT
ATGAGCTTGGGAATGAATGTCGATGATAATTAA

Protein sequence:

MSTQLQVGGKSENIWDRVSHREPCVVDNCDTGDVAGDSYHQYKRDVEMMRELGLDFYRFS
LSWSRILPTSFPDQINEKAVQYYNNLINEMLKYNIQPMVTLYHWDLPQKLQDLGGWTNPH
IVDWFTDYSRVVFRLFGDRVKYWITINEPREVCYQGYAAQSLAPLYNISGYADYMCAKNL
LLVHANVYHLYNNVFRKAQGGQIGITISAQWYEPESEEDVEAAEDYRQFEWGIYANPIFS
ESGDFPAVMKRRIAAKSKEQGFPRSRLPQFTPEEVDLIKGSFDFFGLNHYTTYRVYRNES
VYGHYNSPSTYDDLEAISYQDSSWDSAASKWLKRVPWGFYNLLTKIRKDYNNPPVFITEN
GFSTRGGLVDDDRIKYYRTYIDAMLDAIEDGSDIRVYAAWSLMDNFEWMRGYSERFGLYE
VDYESPERTRTPRKSAYVYKEMLRTRTLDYHYEPDMSLGMNEPTAEQRPLLAWRRFLAFI
LAVLANSKKISRHEARKIPDHLLFGAATASYQIEGTWNEDGKSENIWDRVSHREPCVVDN
CDTGDLADDSYHQYKRDVEMMRELGLDFYRFSLSWTRILPTSFPDQINEKGVQYYNNLIN
EMLKYNIQPMVTLYHWDLPQKLQDLGGWTNPHIVDWFTDYSRVVFQLFGDRVKYWWGLYA
NPIFSEFGDYPAVMKDRIAAKSKKQGFPRSRLPQFTPEEIDLIKGSSDFIGLNHYTTNIV
YRNESVYGYYSSPSFYDDIEVISYQDSSWESAASNWLKSVPWGFYKLLTKIREDYNNPPV
FITENGFSTRGGLIDDDRVKYYRTYIDAMLDAIEDGSDIRVYAAWSLMDNFEWMRGYSER
FGLYEVDYESPDRTRTPRKSAYVYKEMLRTRTLDYHYEPDMSLGMNVDDN