DPGLEAN13066 in OGS1.0

New model in OGS2.0DPOGS207810 
Genomic Positionscaffold114:- 63644-76168
See gene structure
CDS Length2013
Paired RNAseq reads  773
Single RNAseq reads  2170
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005500 (0.0)
Best Drosophila hit  CG15117, isoform C (0.0)
Best Human hitbeta-glucuronidase precursor (2e-134)
Best NR hit (blastp)  PREDICTED: similar to CG15117 CG15117-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  GL11760 [Drosophila persimilis] (0.0)
GeneOntology terms

  
GO:0004566 beta-glucuronidase activity
GO:0005975 carbohydrate metabolic process
GO:0043169 cation binding
InterPro families








  
IPR006101 Glycoside hydrolase, family 2
IPR006103 Glycoside hydrolase, family 2, TIM barrel
IPR006104 Glycoside hydrolase, family 2, N-terminal domain
IPR006102 Glycoside hydrolase, family 2, immunoglobulin-like beta-sandwich
IPR013812 Glycoside hydrolase, family 2/20, immunoglobulin-like beta-sandwich domain
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR017853 Glycoside hydrolase, superfamily
IPR008979 Galactose-binding domain-like
IPR023232 Glycoside hydrolase, family 2, active site
IPR023230 Glycoside hydrolase, family 2, conserved site
Orthology groupMCL10553

Nucleotide sequence:

ATGGCTGCGTTTCGTCCGATGATGTTATCAACCAGATCTCGTCTGTTCCTCATCGCAGCG
GTTTTAACAGTAGTGGAGGCGGAGTTGCCTGGTACATCCAGCACGAACGAGATTAACCAA
ACCCCAAAGAGACGATCCACATTTGTCGGGGGTACATTATACCCCCAAGCATCGGAGACG
AGAGACCTAAAAAGACTAGACGGTATATGGAAATTCAGAAAATCACCCACCGACCCTGAA
TACGGTCAACGTAATGGCTGGTACGAACAGGATCTTGAAAAGACTGGTCCCGTGATCGAT
ATGCCGGTCCCTTCTTCATACAATGACGTGGGAGAGGATCCTTCGCTGAGGGATCACGTT
GGTCTAGTTTGGTACGATCGCCGTTTCTACGTCCCTCACTGGTGGAAAACCGCGGGACAA
CGAGTGTGGCTGAGATTCAGCAGCGTACATTACGCGGCTCTAGTTTATGTCAACGGTCAA
GCTGCCACGTATCACGAGGTGGGACACCTTCCATTCGAAGTGGAGATCACTGATATTGTC
TCATACAATACGAGCAATCTACTCACCGTCGTTGTTGACAACACTTTGCTTAGTGACACC
GTACCACAGGGCAATATCAAGGACATATTTGTGGGAAACTCCAAAATCCGTCAAGAGCAG
ACGTACACCTTCGATTTCTTCAACTACGCCGGCATTCACCGCTCCGTGTTCCTGTACTCG
ACACCACAGACATACATAGATGACGTCATCGTGAATACAGACATACAAGGACTCACAGGC
TTCGTTGTTTACAACATAACATACAAGGGTACCCCGCGAGCGCAATGTTTCGTTCAATTA
TACGACAAACTTGGCAACCAAGTGACAGCGGCTAATGAGTGCGCTGGTCTACTGGAGATC
GGGAACGCTAACTTCTGGTGGCCTTATCTGATGCACCCGGAACCAGGTTACCTCTATACT
TTGAAGACCACATTAATAGGCTCGCTCGGTGAAACTATAGACACTTACAGTCTTAAAGTT
GGCATTAGAACTGTCACGTGGACGAACACCTCAATCTACCTCAACGATAAGCCCATCTAC
CTCAGAGGGTTCGGGATGCACGAAGACTCAGACTTGCGTGGTAAAGGTTGGGACCCGGTG
TTGTGGGTGAAGAATTTCAACTTGATAAAGTGGACCGGCGGTAACGCATTCCGAACCTCG
CACTATCCTTACGCCGAAGAAATATACCAGCTGGCCGACGAGCACGGCATCATGATCATT
GACGAATGCCCCAGTGTCGATACCGACATTTTCACGGATTCACTGCTGGAGAAGCACAAA
CAGTCCCTCACTGAGCTCATAAGACGTGATAAGAACCACGCCAGCGTCATCATGTGGTCC
ATCGCCAACGAGCCGCGGTCCGCTAACATCAGAGCCGACGCGTATTTCCAAAAAGTTGTT
AAACATGTCAAATCAATGGATCTCTCTAGACCGGTCACTATAGCTATAGCTCAGAGCCAT
ATCGCTGATAGATCGGGTCAACATCTAGATGTGATATCGTTCAACCGCTACAACGGCTGG
TACTCTAACACCGGTTCGTTATTAAACATCGCCGCTAACGTCGCGGACGAGGCCACGGCC
TTCAACATCAGATACAACAAACCCATCATCATGATGGAGTACGGAGCTGACACTATCGCT
GGTCTCCATTTGTTGCCAGAATACGTATGGTCTGAGGAGTACCAAGTATCGTTGATGTCG
GAACACTTCAAGGCTTTCGATCGTCTGCGACAGGCGGGCTTCTTCGTGGGAGAGTTCATA
TGGAACTTCGCTGACTTTAAAACAGCTCAGACAATAACCCGAGTTGGCGGGAACAAGAAA
GGTATATTCACACGTTCGCGCCAACCGAAAGCGTCCGCTCATCACCTCCGCGAGCGTTAC
CTCGCGCTCGCCGCCGCCGACACTAACTCGCCACCACCCGAATCACCGTACTACGTCAGC
GACCATCTACCATTTAAACACGAAGAATTATAA

Protein sequence:

MAAFRPMMLSTRSRLFLIAAVLTVVEAELPGTSSTNEINQTPKRRSTFVGGTLYPQASET
RDLKRLDGIWKFRKSPTDPEYGQRNGWYEQDLEKTGPVIDMPVPSSYNDVGEDPSLRDHV
GLVWYDRRFYVPHWWKTAGQRVWLRFSSVHYAALVYVNGQAATYHEVGHLPFEVEITDIV
SYNTSNLLTVVVDNTLLSDTVPQGNIKDIFVGNSKIRQEQTYTFDFFNYAGIHRSVFLYS
TPQTYIDDVIVNTDIQGLTGFVVYNITYKGTPRAQCFVQLYDKLGNQVTAANECAGLLEI
GNANFWWPYLMHPEPGYLYTLKTTLIGSLGETIDTYSLKVGIRTVTWTNTSIYLNDKPIY
LRGFGMHEDSDLRGKGWDPVLWVKNFNLIKWTGGNAFRTSHYPYAEEIYQLADEHGIMII
DECPSVDTDIFTDSLLEKHKQSLTELIRRDKNHASVIMWSIANEPRSANIRADAYFQKVV
KHVKSMDLSRPVTIAIAQSHIADRSGQHLDVISFNRYNGWYSNTGSLLNIAANVADEATA
FNIRYNKPIIMMEYGADTIAGLHLLPEYVWSEEYQVSLMSEHFKAFDRLRQAGFFVGEFI
WNFADFKTAQTITRVGGNKKGIFTRSRQPKASAHHLRERYLALAAADTNSPPPESPYYVS
DHLPFKHEEL