New model in OGS2.0 | DPOGS207810  |
---|---|
Genomic Position | scaffold114:- 63644-76168 |
See gene structure | |
CDS Length | 2013 |
Paired RNAseq reads   | 773 |
Single RNAseq reads   | 2170 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005500 (0.0) |
Best Drosophila hit   | CG15117, isoform C (0.0) |
Best Human hit | beta-glucuronidase precursor (2e-134) |
Best NR hit (blastp)   | PREDICTED: similar to CG15117 CG15117-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | GL11760 [Drosophila persimilis] (0.0) |
GeneOntology terms    | GO:0004566 beta-glucuronidase activity GO:0005975 carbohydrate metabolic process GO:0043169 cation binding |
InterPro families    | IPR006101 Glycoside hydrolase, family 2 IPR006103 Glycoside hydrolase, family 2, TIM barrel IPR006104 Glycoside hydrolase, family 2, N-terminal domain IPR006102 Glycoside hydrolase, family 2, immunoglobulin-like beta-sandwich IPR013812 Glycoside hydrolase, family 2/20, immunoglobulin-like beta-sandwich domain IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR017853 Glycoside hydrolase, superfamily IPR008979 Galactose-binding domain-like IPR023232 Glycoside hydrolase, family 2, active site IPR023230 Glycoside hydrolase, family 2, conserved site |
Orthology group | MCL10553 |
Nucleotide sequence:
ATGGCTGCGTTTCGTCCGATGATGTTATCAACCAGATCTCGTCTGTTCCTCATCGCAGCG
GTTTTAACAGTAGTGGAGGCGGAGTTGCCTGGTACATCCAGCACGAACGAGATTAACCAA
ACCCCAAAGAGACGATCCACATTTGTCGGGGGTACATTATACCCCCAAGCATCGGAGACG
AGAGACCTAAAAAGACTAGACGGTATATGGAAATTCAGAAAATCACCCACCGACCCTGAA
TACGGTCAACGTAATGGCTGGTACGAACAGGATCTTGAAAAGACTGGTCCCGTGATCGAT
ATGCCGGTCCCTTCTTCATACAATGACGTGGGAGAGGATCCTTCGCTGAGGGATCACGTT
GGTCTAGTTTGGTACGATCGCCGTTTCTACGTCCCTCACTGGTGGAAAACCGCGGGACAA
CGAGTGTGGCTGAGATTCAGCAGCGTACATTACGCGGCTCTAGTTTATGTCAACGGTCAA
GCTGCCACGTATCACGAGGTGGGACACCTTCCATTCGAAGTGGAGATCACTGATATTGTC
TCATACAATACGAGCAATCTACTCACCGTCGTTGTTGACAACACTTTGCTTAGTGACACC
GTACCACAGGGCAATATCAAGGACATATTTGTGGGAAACTCCAAAATCCGTCAAGAGCAG
ACGTACACCTTCGATTTCTTCAACTACGCCGGCATTCACCGCTCCGTGTTCCTGTACTCG
ACACCACAGACATACATAGATGACGTCATCGTGAATACAGACATACAAGGACTCACAGGC
TTCGTTGTTTACAACATAACATACAAGGGTACCCCGCGAGCGCAATGTTTCGTTCAATTA
TACGACAAACTTGGCAACCAAGTGACAGCGGCTAATGAGTGCGCTGGTCTACTGGAGATC
GGGAACGCTAACTTCTGGTGGCCTTATCTGATGCACCCGGAACCAGGTTACCTCTATACT
TTGAAGACCACATTAATAGGCTCGCTCGGTGAAACTATAGACACTTACAGTCTTAAAGTT
GGCATTAGAACTGTCACGTGGACGAACACCTCAATCTACCTCAACGATAAGCCCATCTAC
CTCAGAGGGTTCGGGATGCACGAAGACTCAGACTTGCGTGGTAAAGGTTGGGACCCGGTG
TTGTGGGTGAAGAATTTCAACTTGATAAAGTGGACCGGCGGTAACGCATTCCGAACCTCG
CACTATCCTTACGCCGAAGAAATATACCAGCTGGCCGACGAGCACGGCATCATGATCATT
GACGAATGCCCCAGTGTCGATACCGACATTTTCACGGATTCACTGCTGGAGAAGCACAAA
CAGTCCCTCACTGAGCTCATAAGACGTGATAAGAACCACGCCAGCGTCATCATGTGGTCC
ATCGCCAACGAGCCGCGGTCCGCTAACATCAGAGCCGACGCGTATTTCCAAAAAGTTGTT
AAACATGTCAAATCAATGGATCTCTCTAGACCGGTCACTATAGCTATAGCTCAGAGCCAT
ATCGCTGATAGATCGGGTCAACATCTAGATGTGATATCGTTCAACCGCTACAACGGCTGG
TACTCTAACACCGGTTCGTTATTAAACATCGCCGCTAACGTCGCGGACGAGGCCACGGCC
TTCAACATCAGATACAACAAACCCATCATCATGATGGAGTACGGAGCTGACACTATCGCT
GGTCTCCATTTGTTGCCAGAATACGTATGGTCTGAGGAGTACCAAGTATCGTTGATGTCG
GAACACTTCAAGGCTTTCGATCGTCTGCGACAGGCGGGCTTCTTCGTGGGAGAGTTCATA
TGGAACTTCGCTGACTTTAAAACAGCTCAGACAATAACCCGAGTTGGCGGGAACAAGAAA
GGTATATTCACACGTTCGCGCCAACCGAAAGCGTCCGCTCATCACCTCCGCGAGCGTTAC
CTCGCGCTCGCCGCCGCCGACACTAACTCGCCACCACCCGAATCACCGTACTACGTCAGC
GACCATCTACCATTTAAACACGAAGAATTATAA
Protein sequence:
MAAFRPMMLSTRSRLFLIAAVLTVVEAELPGTSSTNEINQTPKRRSTFVGGTLYPQASET
RDLKRLDGIWKFRKSPTDPEYGQRNGWYEQDLEKTGPVIDMPVPSSYNDVGEDPSLRDHV
GLVWYDRRFYVPHWWKTAGQRVWLRFSSVHYAALVYVNGQAATYHEVGHLPFEVEITDIV
SYNTSNLLTVVVDNTLLSDTVPQGNIKDIFVGNSKIRQEQTYTFDFFNYAGIHRSVFLYS
TPQTYIDDVIVNTDIQGLTGFVVYNITYKGTPRAQCFVQLYDKLGNQVTAANECAGLLEI
GNANFWWPYLMHPEPGYLYTLKTTLIGSLGETIDTYSLKVGIRTVTWTNTSIYLNDKPIY
LRGFGMHEDSDLRGKGWDPVLWVKNFNLIKWTGGNAFRTSHYPYAEEIYQLADEHGIMII
DECPSVDTDIFTDSLLEKHKQSLTELIRRDKNHASVIMWSIANEPRSANIRADAYFQKVV
KHVKSMDLSRPVTIAIAQSHIADRSGQHLDVISFNRYNGWYSNTGSLLNIAANVADEATA
FNIRYNKPIIMMEYGADTIAGLHLLPEYVWSEEYQVSLMSEHFKAFDRLRQAGFFVGEFI
WNFADFKTAQTITRVGGNKKGIFTRSRQPKASAHHLRERYLALAAADTNSPPPESPYYVS
DHLPFKHEEL