DPGLEAN09841 in OGS1.0

New model in OGS2.0DPOGS208592 
Genomic Positionscaffold1804:- 10-12118
See gene structure
CDS Length1242
Paired RNAseq reads  1842
Single RNAseq reads  6457
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005717 (6e-171)
Best Drosophila hit  collapsin response mediator protein, isoform E (3e-126)
Best Human hitdihydropyrimidinase (3e-127)
Best NR hit (blastp)  PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum] (8e-155)
Best NR hit (blastx)  PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum] (7e-143)
GeneOntology terms
















  
GO:0002058 uracil binding
GO:0002059 thymine binding
GO:0004157 dihydropyrimidinase activity
GO:0005575 cellular_component
GO:0005625 soluble fraction
GO:0005829 cytosol
GO:0006208 pyrimidine base catabolic process
GO:0006210 thymine catabolic process
GO:0006212 uracil catabolic process
GO:0008150 biological_process
GO:0008270 zinc ion binding
GO:0016597 amino acid binding
GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:0019482 beta-alanine metabolic process
GO:0019860 uracil metabolic process
GO:0046872 metal ion binding
GO:0051260 protein homooligomerization
GO:0051289 protein homotetramerization
InterPro families

  
IPR011778 D-hydantoinase
IPR011059 Metal-dependent hydrolase, composite domain
IPR006680 Amidohydrolase 1
Orthology groupMCL10440

Nucleotide sequence:

ATGGAAGACGCTGATGTTTACATCGAAGATGGTGTCATCAAGCAAGTGGGTAGGAATTTA
ATAATTCCTGGTGGGACTCGCACCATAGACGCCACCGGCAAGCTGGTCATGCCTGGTGGT
ATCGATCCCCATACACACTTCGAGTTAGAGATGATGGGCGCCAAGACCGCTGACGACTTC
TATAAAGGCACGCGAGCGGCCGTGGCTGGTGGCACCACCACTATCATTGACTTTGTGCTG
CCTCAGAAAGGACAGTCGTTGATAGAAGCCTACGGGAATTGGAGGCAGAAGGCTGACAAT
AAGGCGTGTTGCGATTACGCCTTGCACGTGGGTGTGACTTGGTGGTCAGCTTCCGTTAAG
AAGGAGATCTCCCAGTTGGTGCACGATCACGGCGTGAACTCCTTCAAGATGTTCATGGCG
TACAAAGACGTTTGGATGCTGGATGACTATAACATGTGCCTGGCGATGGAGGCGTGCGCC
GAGCTGAAGGCACTACCCATGGTGCACGCGGAAAATGGTGGGATTATAGCACGTAACTCG
GAAAAGTTGCTGGAAGCTGGGGTCAAAGGTCCCGAGGGTCATGCAGCGTCCAGAGACGAC
CAGGTCGAGGCCGAGGCCGTCAACCGCGCCTGTGTCATCGCTAACCAGATGGACGCTCCG
CTGTATATAGTACACATGATGTCTGCCGCCGCCGTGCAGTCACTGCGTAACGCGCGTCGC
GTCGCCAAACATCCAATATTCGGTGAAACTTTGGCCGCGACGGTTGGCACTGATGGTTCG
CACTACAAGAACGCGTGTTTCCGCCACGCCGCCGCCCACGTCCTCTCCCCGCCACTCCGC
GACCCCAGCACACCGGAAGCCATCATCGACGCCCTCGCACAGTCGCTGCTGGTCCAAGTG
ATAGCCAGCGACAACTGCACCTTCAATGAAAAAGATAAGGAATTGGGGAAAAACGACTTC
ACCAAGATACCTAACGGCGTGAACGGGGTCGAGGACCGCATGGCTATACTGTGGCAGAAA
GCGGTCAACACTGGTGTCATGGACCCTTGTCGTTTCGTGGCCGTGACTAGTACCAACGCT
GCGAATATCTTCAACCTACCATCCAAGGGCCGCGTGGCGGTGGGCGCGGACGGTGACGTC
ATCGTTTGGGACCCTCGCCTCGAGAAGACCATTTCCGCCGCGACCCACCACCACGCCGTA
GATTTTAATATATTTGAGGTTCATCATACGCTGGTCTCATAA

Protein sequence:

MEDADVYIEDGVIKQVGRNLIIPGGTRTIDATGKLVMPGGIDPHTHFELEMMGAKTADDF
YKGTRAAVAGGTTTIIDFVLPQKGQSLIEAYGNWRQKADNKACCDYALHVGVTWWSASVK
KEISQLVHDHGVNSFKMFMAYKDVWMLDDYNMCLAMEACAELKALPMVHAENGGIIARNS
EKLLEAGVKGPEGHAASRDDQVEAEAVNRACVIANQMDAPLYIVHMMSAAAVQSLRNARR
VAKHPIFGETLAATVGTDGSHYKNACFRHAAAHVLSPPLRDPSTPEAIIDALAQSLLVQV
IASDNCTFNEKDKELGKNDFTKIPNGVNGVEDRMAILWQKAVNTGVMDPCRFVAVTSTNA
ANIFNLPSKGRVAVGADGDVIVWDPRLEKTISAATHHHAVDFNIFEVHHTLVS