New model in OGS2.0 | DPOGS210178  |
---|---|
Genomic Position | scaffold1224:+ 7193-10752 |
See gene structure | |
CDS Length | 1845 |
Paired RNAseq reads   | 152 |
Single RNAseq reads   | 510 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014144 (1e-97) |
Best Drosophila hit   | CG9701 (9e-70) |
Best Human hit | cytosolic beta-glucosidase isoform a (2e-64) |
Best NR hit (blastp)   | seminal fluid protein HACP047 [Heliconius erato] (2e-87) |
Best NR hit (blastx)   | seminal fluid protein HACP047 [Heliconius erato] (2e-90) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds |
InterPro families    | IPR001360 Glycoside hydrolase, family 1 IPR017853 Glycoside hydrolase, superfamily IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | MCL10002 |
Nucleotide sequence:
ATGCAGAACGAGTGGTTAGACATAGGAGATTTTTGCATCCCATTAGCACTAAAATGGCGC
ACCTTAATCCATGATTGGTCGCCAGCATTGCTAAAATTCTATCTCAATGCGTTCCAGATG
ACTCTCCCAGATCAGAGTAATTTAGTAAGATGGGGTAAAGGTACCGAAAAGACTTGCTAT
ATCTGTGGGAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAGGTACTCCTC
GATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTAGAAATCATACGTGAAGCGGTT
AGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCAGTAGGTTTT
GTGAGAGAGGGCACTAGGGCTATAAAAACAAATGTTAAGCCTTACTCCATCCTTAAAGCG
GCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATCCCCGAGGAT
ATTTGTGCGTCGGCCTCCAGACCGGACATATTCATGTATTCGCGAATCTTAAAGCGCGTT
GTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCATACCATCAAG
GTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTCGTGGATTTA
TACGCGGTAGAAGTGGGAGCGAGAGGTAAGGGTCCCAGCGTTTGGGACGATTACGTCCAC
GAGAATCGTGTGAAAATTAAGGATAATTCAAACGGAGATGTCGCGGCTGATTCCTACCAT
TTGTGGAAAGAAGACATAAAGATAACAAAGGAATTGGGTCTGCACTTTTATCGTTTCTCA
ATAAACTGGCCAAGGATTCTGCCAACTGGTTTTTCAAATAAAATAAACAAAGCTGGTGTG
AAATATTACAATGAACTTATAGATGGTCTTGTGAGTGCTGGTGTTGAACCTGTCGTCACT
CTCTATCATTGGGAGACGCCTATTATAATCCACAAACTTGGTGGGTGGACAAATCCTTTG
ATAGTGAAATGGTTTGCACATTACGCCAGAATCGTGTTTTCCCTTTTCGGTGACAGAGTT
AAAACCTGGATAACAATAAATGAAGCGAACGTTCAATGTGATTATTTTTACAACTCTGGA
ATATTCATTACTGCTAAGGAAGATGTCTTTGCACCATTTCTGTGCAATAAACACATTTTA
ATGGCGCATGCGCATGCGTACAGGATATATGAAAAAGAGTTTAAACCTAAGTATGGAGGG
AGTGTATCTTTGGCTAATAATTTTCTGTGGCTGGACCCATACATCTCGAATCACGAAGAA
CTTGCTGAGCTCGGCAGAGAACACGCGATTGGGAGATATTCCCATCCAATCTATTCCAAA
AAGGGTGGTTGGCCTCCCCTACTAGAAAAAGTCCTACTGGAGTATAGTTTGAAACAAGGA
TACAAGGAATCCAGATTACCAACATTTACGAAACAAGAGAAGGAATTTGTAAGAGGCACG
GCTGATTTTTACGGCGTGAACTATTATACGTCTAATTTGATCAGGCCAATTAAACCCGGC
GAAGATCCCGGATATTTCTTCATAACAGGAGTACCGGAACTGAACGCCATTTTGGTACAT
CCGAATAACACTTGGTATGGGGCTCTAGATATATTACCGGTGTATCCGCTAGGTCTACGC
CGCTCATTGTCTTGGTTGAAGAAAAGCTACGGTGATATCGATATTCTTATAACAGAATGT
GGATTCTCAACCGCAGGATACGATCTCAAAGATTACAAAAGAACTAACTTCTACAGAGAC
CACTTAGAACAGGCGAGTCATAATATATTCTGTAAAAAAAAATGA
Protein sequence:
MQNEWLDIGDFCIPLALKWRTLIHDWSPALLKFYLNAFQMTLPDQSNLVRWGKGTEKTCY
ICGKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIREAVSLSVARAQKGITTNERSVGF
VREGTRAIKTNVKPYSILKAATDWTIMMDTCEKQYKIPEDICASASRPDIFMYSRILKRV
VMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVVDLYAVEVGARGKGPSVWDDYVH
ENRVKIKDNSNGDVAADSYHLWKEDIKITKELGLHFYRFSINWPRILPTGFSNKINKAGV
KYYNELIDGLVSAGVEPVVTLYHWETPIIIHKLGGWTNPLIVKWFAHYARIVFSLFGDRV
KTWITINEANVQCDYFYNSGIFITAKEDVFAPFLCNKHILMAHAHAYRIYEKEFKPKYGG
SVSLANNFLWLDPYISNHEELAELGREHAIGRYSHPIYSKKGGWPPLLEKVLLEYSLKQG
YKESRLPTFTKQEKEFVRGTADFYGVNYYTSNLIRPIKPGEDPGYFFITGVPELNAILVH
PNNTWYGALDILPVYPLGLRRSLSWLKKSYGDIDILITECGFSTAGYDLKDYKRTNFYRD
HLEQASHNIFCKKK