Genomic Position | scaffold3930:+ 18842-26768 |
---|---|
See gene structure | |
CDS Length | 2673 |
Paired RNAseq reads   | 17 |
Single RNAseq reads   | 69 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010537 (2e-66) |
Best Drosophila hit   | CG9701 (1e-117) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (3e-133) |
Best NR hit (blastp)   | glycoside hydrolases [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | lactase-phlorizin hydrolase [Culex quinquefasciatus] (1e-178) |
GeneOntology terms    | GO:0043169 cation binding GO:0045471 response to ethanol GO:0005887 integral to plasma membrane GO:0000016 lactase activity GO:0010045 response to nickel ion GO:0010288 response to lead ion GO:0001666 response to hypoxia GO:0043627 response to estrogen stimulus GO:0017042 glycosylceramidase activity GO:0007584 response to nutrient GO:0009725 response to hormone stimulus GO:0042594 response to starvation GO:0005624 membrane fraction GO:0005886 plasma membrane GO:0005903 brush border GO:0016740 transferase activity GO:0042493 response to drug GO:0016324 apical plasma membrane GO:0005975 carbohydrate metabolic process GO:0009744 response to sucrose stimulus |
InterPro families    | IPR018120 Glycoside hydrolase, family 1, active site IPR001360 Glycoside hydrolase, family 1 IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR017853 Glycoside hydrolase, superfamily |
Orthology group | MCL10077 |
Nucleotide sequence:
ATGTCCACCCAGTTGCAAGTAGGAGGTAAGTCTGAAAACATATGGGATCGCGTATCACAC
AGGGAACCTTGTGTTGTCGACAACTGCGACACAGGTGACGTTGCCGGTGATTCGTATCAT
CAATATAAGCGTGATGTGGAAATGATGCGGGAGCTAGGTCTCGACTTTTATAGGTTCTCT
CTCTCCTGGTCGAGAATATTACCAACGAGTTTTCCAGACCAAATTAATGAAAAAGCAGTA
CAATATTATAATAATTTGATAAATGAGATGCTCAAATACAACATACAACCCATGGTGACT
CTTTATCACTGGGATTTGCCTCAGAAGCTGCAAGATCTGGGAGGATGGACCAATCCCCAT
ATCGTTGATTGGTTTACCGATTACTCCAGAGTAGTGTTTCGGTTATTTGGAGATAGGGTT
AAGTATTGGATAACTATCAATGAACCGCGAGAGGTTTGTTATCAGGGATATGCAGCACAG
TCTCTAGCTCCTCTTTACAATATTTCTGGATATGCTGATTACATGTGTGCCAAAAATTTA
TTGCTAGTTCATGCCAACGTCTATCATTTATATAACAATGTATTTCGTAAAGCCCAAGGT
GGTCAAATCGGTATAACAATAAGCGCACAATGGTACGAACCTGAATCAGAGGAAGATGTA
GAGGCTGCTGAGGATTACAGACAGTTTGAGTGGGGAATTTACGCAAATCCAATATTTTCG
GAATCTGGAGACTTTCCAGCAGTCATGAAACGTAGGATAGCAGCAAAGAGTAAGGAACAA
GGATTTCCAAGATCACGATTACCACAATTCACTCCGGAGGAGGTTGATTTAATTAAAGGC
AGTTTCGACTTCTTTGGGTTGAATCATTATACTACTTATAGGGTTTACAGAAATGAATCA
GTCTATGGACATTATAATTCACCATCTACTTACGATGATCTCGAAGCAATAAGTTATCAA
GATAGTTCATGGGATTCAGCTGCTTCAAAGTGGTTAAAGCGTGTGCCCTGGGGATTTTAT
AATTTGCTTACAAAAATACGAAAGGACTACAACAACCCGCCAGTTTTCATCACTGAGAAT
GGATTCTCAACCCGAGGTGGTTTAGTTGACGACGACCGCATAAAGTATTACAGAACATAC
ATTGACGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGAGTTTACGCAGCGTGG
AGTTTGATGGACAATTTTGAATGGATGAGGGGATACAGCGAACGTTTCGGACTGTACGAG
GTGGACTACGAGAGTCCTGAACGCACCCGAACTCCTCGCAAATCTGCTTATGTATACAAG
GAGATGCTGCGCACACGAACACTGGACTATCATTATGAACCTGATATGAGCCTGGGAATG
AATGAGCCCACTGCTGAGCAACGGCCTCTTCTTGCATGGAGAAGATTCCTGGCATTTATA
TTGGCTGTACTTGCAAATAGCAAAAAAATCTCTCGACATGAAGCGAGAAAAATACCCGAC
CACTTACTTTTCGGAGCTGCTACGGCATCGTACCAAATAGAAGGCACTTGGAATGAAGAC
GGAAAATCTGAAAATATATGGGATCGCGTATCACACAGGGAACCTTGTGTTGTCGACAAC
TGCGACACAGGTGACCTTGCCGATGATTCGTATCATCAATATAAGCGTGATGTGGAAATG
ATGCGGGAACTAGGTCTCGACTTCTATAGGTTCTCTCTCTCCTGGACGAGAATATTACCA
ACGAGTTTTCCAGACCAAATAAATGAAAAAGGAGTACAATATTATAATAATTTGATAAAT
GAGATGCTCAAATACAACATACAACCCATGGTGACTCTTTATCACTGGGATTTACCTCAG
AAGTTGCAAGATCTGGGAGGATGGACCAATCCCCATATCGTTGATTGGTTTACCGATTAC
TCCAGAGTAGTGTTCCAGTTATTTGGAGATAGGGTTAAGTATTGGTGGGGACTGTATGCA
AATCCAATATTTTCCGAATTTGGGGACTATCCAGCAGTCATGAAAGATAGAATAGCAGCA
AAGAGTAAGAAACAAGGATTTCCAAGATCGCGATTACCACAATTCACTCCTGAAGAAATA
GATTTAATTAAAGGAAGTTCGGATTTCATTGGATTAAATCATTATACTACTAACATTGTT
TATAGGAACGAATCTGTTTATGGATATTATAGTTCGCCATCTTTTTATGATGATATTGAA
GTAATAAGTTATCAAGATAGTTCCTGGGAGTCAGCTGCTTCCAACTGGTTAAAGAGTGTA
CCCTGGGGATTCTATAAGTTATTAACAAAAATACGAGAGGACTACAACAACCCGCCAGTT
TTCATCACTGAGAATGGATTCTCAACCCGAGGTGGTCTAATTGACGACGACCGCGTAAAG
TATTACAGAACATACATAGATGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGA
GTTTATGCAGCGTGGAGTTTGATGGACAATTTCGAATGGATGAGGGGATACAGCGAACGT
TTCGGACTGTACGAGGTGGACTACGAGAGTCCTGACCGCACCCGAACTCCTCGCAAGTCT
GCTTACGTATACAAAGAGATGCTGCGCACACGAACACTGGACTATCATTATGAACCTGAT
ATGAGCTTGGGAATGAATGTCGATGATAATTAA
Protein sequence:
MSTQLQVGGKSENIWDRVSHREPCVVDNCDTGDVAGDSYHQYKRDVEMMRELGLDFYRFS
LSWSRILPTSFPDQINEKAVQYYNNLINEMLKYNIQPMVTLYHWDLPQKLQDLGGWTNPH
IVDWFTDYSRVVFRLFGDRVKYWITINEPREVCYQGYAAQSLAPLYNISGYADYMCAKNL
LLVHANVYHLYNNVFRKAQGGQIGITISAQWYEPESEEDVEAAEDYRQFEWGIYANPIFS
ESGDFPAVMKRRIAAKSKEQGFPRSRLPQFTPEEVDLIKGSFDFFGLNHYTTYRVYRNES
VYGHYNSPSTYDDLEAISYQDSSWDSAASKWLKRVPWGFYNLLTKIRKDYNNPPVFITEN
GFSTRGGLVDDDRIKYYRTYIDAMLDAIEDGSDIRVYAAWSLMDNFEWMRGYSERFGLYE
VDYESPERTRTPRKSAYVYKEMLRTRTLDYHYEPDMSLGMNEPTAEQRPLLAWRRFLAFI
LAVLANSKKISRHEARKIPDHLLFGAATASYQIEGTWNEDGKSENIWDRVSHREPCVVDN
CDTGDLADDSYHQYKRDVEMMRELGLDFYRFSLSWTRILPTSFPDQINEKGVQYYNNLIN
EMLKYNIQPMVTLYHWDLPQKLQDLGGWTNPHIVDWFTDYSRVVFQLFGDRVKYWWGLYA
NPIFSEFGDYPAVMKDRIAAKSKKQGFPRSRLPQFTPEEIDLIKGSSDFIGLNHYTTNIV
YRNESVYGYYSSPSFYDDIEVISYQDSSWESAASNWLKSVPWGFYKLLTKIREDYNNPPV
FITENGFSTRGGLIDDDRVKYYRTYIDAMLDAIEDGSDIRVYAAWSLMDNFEWMRGYSER
FGLYEVDYESPDRTRTPRKSAYVYKEMLRTRTLDYHYEPDMSLGMNVDDN