DPGLEAN09239 in OGS1.0

New model in OGS2.0DPOGS205069 
Genomic Positionscaffold681:- 38636-43841
See gene structure
CDS Length1491
Paired RNAseq reads  155
Single RNAseq reads  413
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006812 (8e-136)
Best Drosophila hit  CG6106 (4e-70)
Best Human hitdihydropyrimidinase-related protein 5 (5e-09)
Best NR hit (blastp)  allantoinase [Ctenocephalides felis] (1e-122)
Best NR hit (blastx)  PREDICTED: hypothetical protein [Nasonia vitripennis] (2e-112)
GeneOntology terms






  
GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:0004038 allantoinase activity
GO:0016812 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amides
GO:0008270 zinc ion binding
GO:0016787 hydrolase activity
GO:0000256 allantoin catabolic process
GO:0005575 cellular_component
GO:0046872 metal ion binding
InterPro families


  
IPR017593 Allantoinase
IPR011059 Metal-dependent hydrolase, composite domain
IPR002195 Dihydroorotase, conserved site
IPR006680 Amidohydrolase 1
Orthology groupMCL17892

Nucleotide sequence:

ATGGCTGCGTTTTATTATTTATTGATAACGTTTATTGGGACAGAAGTACTTGGCAGAGGA
TTACAGCCGGATAAAAGTGAAAACCAGTTATTTTTAAGTAAACGAGTTGTGACAGATTCT
GGCGAAATTGATGGTGGAGTGCTTGTGAACGAAAATGGTATAATTGAAGGTATATTTACG
AGGGACACTATAGATAAATTATTCAATGGATATGATAAGAATTTGCAGGTAATAGACGGT
GGCGATTGGGCCTTAATGGCTGGTGTCGTGGATTCTCATGTACATGTTAATGAGCCAGGC
CGTACATCATGGGAAGGGTTTGTGACTGCCACCAGGGCAGGAGCGGCAGGAGGCATCACC
ACCATTGTTGATATGCCCTTGAATTCTGTTCCACCGACCACTTCAATTGAAAATTTAAAA
ACTAAAGTCTCAGCTGCCAAAGAAGAGGTGTACGTAGATGTGGCATTTTGGGGAGGATTG
GTTCCTGGCAATGAGGAGGAATTACAGAAGCTAGTTAAAGCGGGTGTTGTTGGTTTCAAA
GGTTTTTTAATCGATAGTGGAGTGTCGGAGTTTCCTAATGTTGAAGGTGATGATTTAGAT
AAAATATTTACAACGCTGAATGGTTCTGACATTGCGGTAGCGTTCCACGCCGAATTACCC
ATTAGTGACGGCAACAACAGCAGTCTATGTGACAAATGCGAAAATCTAGATCCGGTCCTA
TATAGCACGTATCTATCATCTCGACCACCTCAAATGGAAATTGATGCTGCAACATTACTT
GCGAAATATATTGCTAAATATGACGTCCACGTGCACGTCGTTCACGTGTCAGCTGAAGGT
GTAATACCGATTTTAGAAAAAGCTAGAGAGTTTAGGATTCAAAATGGATCGAAACGTTGG
AGAGGGGGTGTCACAGCTGAAACCTGTCATCATTATCTTACATTGAGCTCGGAACAAATC
CCGCCAGGACGCACGGAGTATAAATGTTCACCTCCCATAAGAGACATCAACAATAAGTTG
CGACTGTGGGAATATATAAAGCAGAGAAGAATTGATTTGATAGCGTCCGATCACTCCCCG
TCAGTCGCTGGCCTCAAGAGTCCTAATTTCATGACCGCTTGGGGTGGTGTATCGTCCGTG
CAATTCGGCTTATCTCTATTTTGGACTGAAGCAAAAGCTCGTGGTTATAGTCTTAGCACT
GTCAGCCATTTCTTGTCGTCGGGACCCGCTCGGCTTGCTGGGTTACACGACAAAAAGGGG
GCCTTGAAACCAGGCCTTGACGCCGACCTGGTTTTCTTTGATCCTGACGCTTCATTCGTG
CTTACACCAGATAAAATATTCTACAAGAACAAGCTAAGTCCGTATATGTACAAAGTCCTA
ACCGGGAAAGTGATGCAAACTTACGTGAGAGGTCGTCTCGTGTTTAACGATGGTCAAGTG
TATGGCAACCCACAAGGAAAGTTATTGATAAACGAAGACGAGTTATATTGA

Protein sequence:

MAAFYYLLITFIGTEVLGRGLQPDKSENQLFLSKRVVTDSGEIDGGVLVNENGIIEGIFT
RDTIDKLFNGYDKNLQVIDGGDWALMAGVVDSHVHVNEPGRTSWEGFVTATRAGAAGGIT
TIVDMPLNSVPPTTSIENLKTKVSAAKEEVYVDVAFWGGLVPGNEEELQKLVKAGVVGFK
GFLIDSGVSEFPNVEGDDLDKIFTTLNGSDIAVAFHAELPISDGNNSSLCDKCENLDPVL
YSTYLSSRPPQMEIDAATLLAKYIAKYDVHVHVVHVSAEGVIPILEKAREFRIQNGSKRW
RGGVTAETCHHYLTLSSEQIPPGRTEYKCSPPIRDINNKLRLWEYIKQRRIDLIASDHSP
SVAGLKSPNFMTAWGGVSSVQFGLSLFWTEAKARGYSLSTVSHFLSSGPARLAGLHDKKG
ALKPGLDADLVFFDPDASFVLTPDKIFYKNKLSPYMYKVLTGKVMQTYVRGRLVFNDGQV
YGNPQGKLLINEDELY