New model in OGS2.0 | DPOGS202008  |
---|---|
Genomic Position | scaffold790:+ 62920-71448 |
See gene structure | |
CDS Length | 1569 |
Paired RNAseq reads   | 567 |
Single RNAseq reads   | 1417 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002450 (5e-166) |
Best Drosophila hit   | CG9701 (6e-131) |
Best Human hit | lactase-phlorizin hydrolase preproprotein (2e-98) |
Best NR hit (blastp)   | similar to CG9701-PA [Papilio xuthus] (0.0) |
Best NR hit (blastx)   | similar to CG9701-PA [Papilio xuthus] (0.0) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR001360 Glycoside hydrolase, family 1 IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR018120 Glycoside hydrolase, family 1, active site IPR017853 Glycoside hydrolase, superfamily |
Orthology group | MCL39663 |
Nucleotide sequence:
ATGCAGAGCAAGAGTAGAGTCACCCATAGTGCTGCCATGTTCGCGCTGTTTAGCCTGATG
TTATTAACAAATGGAGTAAACGCGCTTATAAGCAGTAATGGGTTGAGCAACTATTCGTTT
CCAGACAATTTTATTTTTGGAGTAGCGACGGCTGCATTTCAAATAGAAGGAGGTTGGAAT
GAGGGTGGTAAAGGTGAAAGCATGTGGGATACATATTTACACAAACACCCTAAATTCACG
GTGGACCAATCGAACGGAGACGTGGCCGCCGATTCATATCACAAATACAAACAAGATTTA
ATAATGATCAAGTCAATCGAAGTAAAATATTACCGACTTTCAATATCATGGCCAAGAATA
TTACCACATGGAACTGACAACTACATCAGCAAAGATGGAGTTAGATACTATCGAAAGCTT
TTCGAAGAACTAATAAATGCCAATATAACTCCCGTTGTGACACTGTATCATTGGGATATG
CCAACAGCTCTAATGGATTTAGGCGGATGGACTAATCCCAAAATGGTGGATTACTTTGAG
GACTACGCGAGAGTAGCGTTCACACTGTTCGGAGATATTGTGAAAACGTGGACCACTATG
AACGAATTGCATCAACATTGCTTTAACGGCTATGGCGGTAATTTTTTCGTCCCTGCCCTA
AAATCACATGGTGTTGGTGCATATTTATGTTCACATTACATGCTGTTGGCGCACGCACGA
GCTTATCGGTTGTATGACAAGCAATTTAGACCACATCAGAAAGGAAAAGTTGGTATAACT
TTAGACGCATTTTGGGCTGAACCTAAAGATTATAATAAAGAGGAAGATCATGAAGCAGCA
GAACGGTATCTTCAGATGCATGTGGGTTTATTCGCTCATCCAATTTATTCAGACGAAGGA
GACTATCCTCTTCTCGTTCGAAACAGGATTGATGATATGAGCCGCAATCAAGGTTTTGCC
AGATCTCGATTACCATTTTTTACCCCTGAAGAAGTGGCCATGGTTCGAGGTAGTTCAGAT
TTCTTTGGCATCAATCACTACACCACATACTTAATGTCAAACTCATCTATGGAACCTGAA
TGGGTTATTCCCTCTGTGGACCATGACACTGGAGTAAAAATTGAACAGAGCAAAGAATGG
CCTATACCAGGCGCCGAATGGCTCTCAGTTTATCCCCCCGGATTTCGAAAACTCATTAAT
TGGATAACCAAGAGTTATGGTAAAAGAGTGCCTATCATTGTAACAGAAAATGGGGTATCG
GATTTCGGTGGTAAGAACGATTACTCTCGAGTGTCATATTTTAATAACTATTTGGAACAA
CTTTTATTGGCGATTCACGAAGACGGTTGTAATGTATCCGGATACTTCGCTTGGACTTTA
ATGGACGATTTTGAATGGAACGATGGATACAAGGTGAAATTTGGTCTATTTCACGTGGAC
TTCAACAGCCCGGGTAAAGAAAGGACTCCAAAATTATCAGCGCTCAATTACGGCGAAATA
GTTCGCACGAGGCGAGTCAATTTCAACTACATAAAGATGCCATCGTATAAATATAATACT
CTATTGTAA
Protein sequence:
MQSKSRVTHSAAMFALFSLMLLTNGVNALISSNGLSNYSFPDNFIFGVATAAFQIEGGWN
EGGKGESMWDTYLHKHPKFTVDQSNGDVAADSYHKYKQDLIMIKSIEVKYYRLSISWPRI
LPHGTDNYISKDGVRYYRKLFEELINANITPVVTLYHWDMPTALMDLGGWTNPKMVDYFE
DYARVAFTLFGDIVKTWTTMNELHQHCFNGYGGNFFVPALKSHGVGAYLCSHYMLLAHAR
AYRLYDKQFRPHQKGKVGITLDAFWAEPKDYNKEEDHEAAERYLQMHVGLFAHPIYSDEG
DYPLLVRNRIDDMSRNQGFARSRLPFFTPEEVAMVRGSSDFFGINHYTTYLMSNSSMEPE
WVIPSVDHDTGVKIEQSKEWPIPGAEWLSVYPPGFRKLINWITKSYGKRVPIIVTENGVS
DFGGKNDYSRVSYFNNYLEQLLLAIHEDGCNVSGYFAWTLMDDFEWNDGYKVKFGLFHVD
FNSPGKERTPKLSALNYGEIVRTRRVNFNYIKMPSYKYNTLL