New model in OGS2.0 | DPOGS208592  |
---|---|
Genomic Position | scaffold1804:- 10-12118 |
See gene structure | |
CDS Length | 1242 |
Paired RNAseq reads   | 1842 |
Single RNAseq reads   | 6457 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005717 (6e-171) |
Best Drosophila hit   | collapsin response mediator protein, isoform E (3e-126) |
Best Human hit | dihydropyrimidinase (3e-127) |
Best NR hit (blastp)   | PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum] (8e-155) |
Best NR hit (blastx)   | PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum] (7e-143) |
GeneOntology terms    | GO:0002058 uracil binding GO:0002059 thymine binding GO:0004157 dihydropyrimidinase activity GO:0005575 cellular_component GO:0005625 soluble fraction GO:0005829 cytosol GO:0006208 pyrimidine base catabolic process GO:0006210 thymine catabolic process GO:0006212 uracil catabolic process GO:0008150 biological_process GO:0008270 zinc ion binding GO:0016597 amino acid binding GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds GO:0019482 beta-alanine metabolic process GO:0019860 uracil metabolic process GO:0046872 metal ion binding GO:0051260 protein homooligomerization GO:0051289 protein homotetramerization |
InterPro families    | IPR011778 D-hydantoinase IPR011059 Metal-dependent hydrolase, composite domain IPR006680 Amidohydrolase 1 |
Orthology group | MCL10440 |
Nucleotide sequence:
ATGGAAGACGCTGATGTTTACATCGAAGATGGTGTCATCAAGCAAGTGGGTAGGAATTTA
ATAATTCCTGGTGGGACTCGCACCATAGACGCCACCGGCAAGCTGGTCATGCCTGGTGGT
ATCGATCCCCATACACACTTCGAGTTAGAGATGATGGGCGCCAAGACCGCTGACGACTTC
TATAAAGGCACGCGAGCGGCCGTGGCTGGTGGCACCACCACTATCATTGACTTTGTGCTG
CCTCAGAAAGGACAGTCGTTGATAGAAGCCTACGGGAATTGGAGGCAGAAGGCTGACAAT
AAGGCGTGTTGCGATTACGCCTTGCACGTGGGTGTGACTTGGTGGTCAGCTTCCGTTAAG
AAGGAGATCTCCCAGTTGGTGCACGATCACGGCGTGAACTCCTTCAAGATGTTCATGGCG
TACAAAGACGTTTGGATGCTGGATGACTATAACATGTGCCTGGCGATGGAGGCGTGCGCC
GAGCTGAAGGCACTACCCATGGTGCACGCGGAAAATGGTGGGATTATAGCACGTAACTCG
GAAAAGTTGCTGGAAGCTGGGGTCAAAGGTCCCGAGGGTCATGCAGCGTCCAGAGACGAC
CAGGTCGAGGCCGAGGCCGTCAACCGCGCCTGTGTCATCGCTAACCAGATGGACGCTCCG
CTGTATATAGTACACATGATGTCTGCCGCCGCCGTGCAGTCACTGCGTAACGCGCGTCGC
GTCGCCAAACATCCAATATTCGGTGAAACTTTGGCCGCGACGGTTGGCACTGATGGTTCG
CACTACAAGAACGCGTGTTTCCGCCACGCCGCCGCCCACGTCCTCTCCCCGCCACTCCGC
GACCCCAGCACACCGGAAGCCATCATCGACGCCCTCGCACAGTCGCTGCTGGTCCAAGTG
ATAGCCAGCGACAACTGCACCTTCAATGAAAAAGATAAGGAATTGGGGAAAAACGACTTC
ACCAAGATACCTAACGGCGTGAACGGGGTCGAGGACCGCATGGCTATACTGTGGCAGAAA
GCGGTCAACACTGGTGTCATGGACCCTTGTCGTTTCGTGGCCGTGACTAGTACCAACGCT
GCGAATATCTTCAACCTACCATCCAAGGGCCGCGTGGCGGTGGGCGCGGACGGTGACGTC
ATCGTTTGGGACCCTCGCCTCGAGAAGACCATTTCCGCCGCGACCCACCACCACGCCGTA
GATTTTAATATATTTGAGGTTCATCATACGCTGGTCTCATAA
Protein sequence:
MEDADVYIEDGVIKQVGRNLIIPGGTRTIDATGKLVMPGGIDPHTHFELEMMGAKTADDF
YKGTRAAVAGGTTTIIDFVLPQKGQSLIEAYGNWRQKADNKACCDYALHVGVTWWSASVK
KEISQLVHDHGVNSFKMFMAYKDVWMLDDYNMCLAMEACAELKALPMVHAENGGIIARNS
EKLLEAGVKGPEGHAASRDDQVEAEAVNRACVIANQMDAPLYIVHMMSAAAVQSLRNARR
VAKHPIFGETLAATVGTDGSHYKNACFRHAAAHVLSPPLRDPSTPEAIIDALAQSLLVQV
IASDNCTFNEKDKELGKNDFTKIPNGVNGVEDRMAILWQKAVNTGVMDPCRFVAVTSTNA
ANIFNLPSKGRVAVGADGDVIVWDPRLEKTISAATHHHAVDFNIFEVHHTLVS