DPGLEAN09238 in OGS1.0

New model in OGS2.0DPOGS205068 
Genomic Positionscaffold681:- 46815-52919
See gene structure
CDS Length1683
Paired RNAseq reads  348
Single RNAseq reads  1019
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006812 (7e-51)
Best Drosophila hit  CG6106 (2e-39)
Best Human hitND
Best NR hit (blastp)  allantoinase [Culex quinquefasciatus] (5e-72)
Best NR hit (blastx)  allantoinase [Culex quinquefasciatus] (4e-68)
GeneOntology terms






  
GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:0004038 allantoinase activity
GO:0016812 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amides
GO:0008270 zinc ion binding
GO:0016787 hydrolase activity
GO:0000256 allantoin catabolic process
GO:0005575 cellular_component
GO:0046872 metal ion binding
InterPro families


  
IPR017593 Allantoinase
IPR011059 Metal-dependent hydrolase, composite domain
IPR002195 Dihydroorotase, conserved site
IPR006680 Amidohydrolase 1
Orthology groupND

Nucleotide sequence:

ATGACGGGACAGTATAATTTGTTTTTAAGTAAACGCTTAGTAACAGAGGAAAAGATTTTC
GATGGTGGGGTTTTGGTTAACGAGTTCGGAAAGATTGAAGACGTGCTTGACCGATACGAT
GCTGATAATTTGATAGCCGAGAGCAAGGATAAAATTACTGTAGTAGATGGTGGAGACTGT
CCTCTCCTGGCAGGTGTGGTTGACTCCCATGTCCACGTCAACGAGCCCGGCCGCACCGCC
TGGGAGGGTTTCCGTACAGCGACCAGTGCAGCTGCTGCTGGCGGCATAACTACCATCGTC
GATAACGGCGACACTCACACACCATTTCAAATTGTCTTTCAAAACTCCATCCCTCCGACA
ACGACCTTGGAGAATCTTAAAATAAAAGCCAATTCTGCGAAGGAAAACGTATTCGTAGAT
GTTGCATTTTGGGGCGGCGTGATTGTTGGGAATCAGGATTCTCTTCGAGATTTAGTAAAA
GCTGGAGTGGTGGGTTTTAAATGTTTTCTTTGTCCAAGTGGCGTTGAAGAGTTTCCTAAT
GTAGGAATCGAAGATTTAAATTTAGCATTTGATGCCCTTGAAGGAACCGGCTCGGTTTTG
GCTTTTCACGCTGAATTTGAAGAGGAGACATCTAGCGGGAAATGTATGAAATTAGATCCT
GAAGATTATAACACGTACCTAGAGTCGCGACCATCCCAAATGGAATTAAGTGCAGTCTCA
CTCATAACCAACTTCCTACAAAAAACGGACGTTCGAGTTCATATTGTTCACGTGTCATCA
GCAAATGTGGTGCCGCTTCTGGTGAAGGCTCGTGAGGATCGTCTAGCCAAAGGCCATAAT
GCCTGGCGTGGAGGAGTGACCGCTGAGACATGCCACCACTATCTAACGCTGAGCGCAGAT
GAAGTGCCACGAGGACACAGCGAGTACAAATGTGCTCCTCCAATACGAGATGCTAATAAT
AAGGCGCCAAACGAGCAGACGGCCTACCTCATGCAAAGTAGTAATCGTCGCCCATGGCCA
CGCATCTGCAACATAAGGGATATTGCAGATGCGTGGCCGAAATTGATTGTGGGAGAGGGA
GAGAAAGGGGTAAGAAAAGACAAAAGGCATGAATGGGTGGAGAAAGGAAAAGGGGTACCG
AATCTCTCACTCTTAGGACGAAATGCAGCCATTACAGGCTACTTCACGCCGATCTTCTGT
GAGAGGGTGGCACGGGAAAAATTGTGGAAGTTTTTGCTAGATGATAAATTGGATATGGTT
GTATCAGATCACTCGCCCTGTACCCCGGAACTCAAGTGTAGCAATAACTTAAAAGCTTGG
GGTGGAATATCTTCGGTCCAGTTTGGTTTGCCATTATTTTGGACTCAAGCAAGTGCTCGT
GGATTAGATTTAAGATCAATAACCAAATATCTAAGTTCTGGTCCAGCCCATCTTTGTGGG
TTGCAAAATCGAAAAGGAGCACTTAAAAAAGGATTGGATGCCGATCTTATTTTCTTCGAT
TGCGACGCAAATTTCACTGTAACCCAAGAAATCATACGACATAAAAATAAGCTGACGCCC
TATATTGGTAAAGAATTGAAAGGCATAGTTAGGAAGACCTATTTAAGAGGACATCTGATA
TATGACGGGGGCGATTTAATTGGTTCACCTCAAGGAGAACTGTTACTCAACGATATTAAA
TAA

Protein sequence:

MTGQYNLFLSKRLVTEEKIFDGGVLVNEFGKIEDVLDRYDADNLIAESKDKITVVDGGDC
PLLAGVVDSHVHVNEPGRTAWEGFRTATSAAAAGGITTIVDNGDTHTPFQIVFQNSIPPT
TTLENLKIKANSAKENVFVDVAFWGGVIVGNQDSLRDLVKAGVVGFKCFLCPSGVEEFPN
VGIEDLNLAFDALEGTGSVLAFHAEFEEETSSGKCMKLDPEDYNTYLESRPSQMELSAVS
LITNFLQKTDVRVHIVHVSSANVVPLLVKAREDRLAKGHNAWRGGVTAETCHHYLTLSAD
EVPRGHSEYKCAPPIRDANNKAPNEQTAYLMQSSNRRPWPRICNIRDIADAWPKLIVGEG
EKGVRKDKRHEWVEKGKGVPNLSLLGRNAAITGYFTPIFCERVAREKLWKFLLDDKLDMV
VSDHSPCTPELKCSNNLKAWGGISSVQFGLPLFWTQASARGLDLRSITKYLSSGPAHLCG
LQNRKGALKKGLDADLIFFDCDANFTVTQEIIRHKNKLTPYIGKELKGIVRKTYLRGHLI
YDGGDLIGSPQGELLLNDIK