DPGLEAN00139 in OGS1.0

New model in OGS2.0DPOGS208592 
Genomic Positionscaffold249:- 92373-96431
See gene structure
CDS Length1248
Paired RNAseq reads  3425
Single RNAseq reads  9239
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005717 (3e-102)
Best Drosophila hit  collapsin response mediator protein, isoform E (2e-104)
Best Human hitdihydropyrimidinase (1e-82)
Best NR hit (blastp)  PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum] (8e-117)
Best NR hit (blastx)  GJ21611 [Drosophila virilis] (1e-114)
GeneOntology terms
  
GO:0004157 dihydropyrimidinase activity
GO:0006207 'de novo' pyrimidine base biosynthetic process
InterPro families
  
IPR011059 Metal-dependent hydrolase, composite domain
IPR006680 Amidohydrolase 1
Orthology groupND

Nucleotide sequence:

GTGCACGCGGAAAATGGTGGGATTATAGCACGTAACTCGGAAAAGTTGCTGGAAGCTGGG
GTCAAAGGTCCCGAGGGTCATGCAGCGTCCAGAGACGACCAGGTGGAGGCCGAAGCCGTC
AACCGCGCCTGTGTCATCGCTAACCAGATGGACGCTCCGCTGTATATAGTACACATGATG
TCTGCCGCCGCCGTGCAGTCACTGCGTAACGCGCGTCGCGTCGCCAAACATCCAATATTC
GGTGAAACTTTGGCCGCGACGGTTGGCACTGATGGTTCGCACTACAAGAACGCGTGTTTC
CGCCACGCCGCCGCCCACGTCCTCTCCCCGCCACTCCGCGACCCCAGCACACCGGAAGCC
ATCATCGACGCCCTCGCACACGACGACCTCCAAGTGATAGCCAGCGACAACTGCACCTTC
AATGAAAAAGATAAGGAATTGGGGAAAAACGACTTCACCAAGATACCTAACGGCGTGAAC
GGGGTCGAGGACCGCATGGCTATACTGTGGCAGAAAGCGGTCAACACTGGTGTCATGGAC
CCTTGTCGTTTCGTGGCCGTGACGAGTACCAACGCTGCGAATATCTTCAACCTACCGTCC
AAGGGCCGCGTGGCGGTGGGCGCGGACGGTGACGTCATCGTTTGGGACCCTCGCCTCGAG
AAGACCATTTCCGCCGCGACCCACCACCACGCCGTAGATTTTAATATATTTGAGGGTCAG
CGCGTGGTCGGTGGACCTCAATACGTTATTGTGAACGGTCGAGTGTGTCTCGATGACGGT
GACCTTAGGGTCGCTGAAGGTTACGGTAAATTCTTACCCACACCACCAAATTCTCCGTAC
GTGTACGGTGAAGTACCCACCACGCCGCAACCGGAAAGGGTTGAATACTTGCCCTCACCC
GCCAGGGTCACTAACGGGACTCCCACAGAACTGCAGATATCTCACAAACTAGAAGCTACT
TCCGTATCCGGCTGCAGCACGCCCACCGGCCGGAAGATGAGGGAGCCCGGACAGAGAAAC
CTTCAGAATTCCACCTTCTCCATCAGCCAACTGCAGATATCTCACAAACTAGAAGCTACT
TCCGTATCCGGCTGCAGCACGCCCACCGGCCGGAAGATGAGGGAGCCCGGACAGAGAAAC
CTTCAGAATTCCACCTTCTCCATCAGCCAGGAAATGGAGGGACTCGACACGAAGACGTCA
GTGCGCGTACGGAACCCACCCGGCGGGAAGTCATCCGGTTTGTGGTAA

Protein sequence:

VHAENGGIIARNSEKLLEAGVKGPEGHAASRDDQVEAEAVNRACVIANQMDAPLYIVHMM
SAAAVQSLRNARRVAKHPIFGETLAATVGTDGSHYKNACFRHAAAHVLSPPLRDPSTPEA
IIDALAHDDLQVIASDNCTFNEKDKELGKNDFTKIPNGVNGVEDRMAILWQKAVNTGVMD
PCRFVAVTSTNAANIFNLPSKGRVAVGADGDVIVWDPRLEKTISAATHHHAVDFNIFEGQ
RVVGGPQYVIVNGRVCLDDGDLRVAEGYGKFLPTPPNSPYVYGEVPTTPQPERVEYLPSP
ARVTNGTPTELQISHKLEATSVSGCSTPTGRKMREPGQRNLQNSTFSISQLQISHKLEAT
SVSGCSTPTGRKMREPGQRNLQNSTFSISQEMEGLDTKTSVRVRNPPGGKSSGLW