New model in OGS2.0 | DPOGS208592  |
---|---|
Genomic Position | scaffold249:- 92373-96431 |
See gene structure | |
CDS Length | 1248 |
Paired RNAseq reads   | 3425 |
Single RNAseq reads   | 9239 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005717 (3e-102) |
Best Drosophila hit   | collapsin response mediator protein, isoform E (2e-104) |
Best Human hit | dihydropyrimidinase (1e-82) |
Best NR hit (blastp)   | PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum] (8e-117) |
Best NR hit (blastx)   | GJ21611 [Drosophila virilis] (1e-114) |
GeneOntology terms    | GO:0004157 dihydropyrimidinase activity GO:0006207 'de novo' pyrimidine base biosynthetic process |
InterPro families    | IPR011059 Metal-dependent hydrolase, composite domain IPR006680 Amidohydrolase 1 |
Orthology group | ND |
Nucleotide sequence:
GTGCACGCGGAAAATGGTGGGATTATAGCACGTAACTCGGAAAAGTTGCTGGAAGCTGGG
GTCAAAGGTCCCGAGGGTCATGCAGCGTCCAGAGACGACCAGGTGGAGGCCGAAGCCGTC
AACCGCGCCTGTGTCATCGCTAACCAGATGGACGCTCCGCTGTATATAGTACACATGATG
TCTGCCGCCGCCGTGCAGTCACTGCGTAACGCGCGTCGCGTCGCCAAACATCCAATATTC
GGTGAAACTTTGGCCGCGACGGTTGGCACTGATGGTTCGCACTACAAGAACGCGTGTTTC
CGCCACGCCGCCGCCCACGTCCTCTCCCCGCCACTCCGCGACCCCAGCACACCGGAAGCC
ATCATCGACGCCCTCGCACACGACGACCTCCAAGTGATAGCCAGCGACAACTGCACCTTC
AATGAAAAAGATAAGGAATTGGGGAAAAACGACTTCACCAAGATACCTAACGGCGTGAAC
GGGGTCGAGGACCGCATGGCTATACTGTGGCAGAAAGCGGTCAACACTGGTGTCATGGAC
CCTTGTCGTTTCGTGGCCGTGACGAGTACCAACGCTGCGAATATCTTCAACCTACCGTCC
AAGGGCCGCGTGGCGGTGGGCGCGGACGGTGACGTCATCGTTTGGGACCCTCGCCTCGAG
AAGACCATTTCCGCCGCGACCCACCACCACGCCGTAGATTTTAATATATTTGAGGGTCAG
CGCGTGGTCGGTGGACCTCAATACGTTATTGTGAACGGTCGAGTGTGTCTCGATGACGGT
GACCTTAGGGTCGCTGAAGGTTACGGTAAATTCTTACCCACACCACCAAATTCTCCGTAC
GTGTACGGTGAAGTACCCACCACGCCGCAACCGGAAAGGGTTGAATACTTGCCCTCACCC
GCCAGGGTCACTAACGGGACTCCCACAGAACTGCAGATATCTCACAAACTAGAAGCTACT
TCCGTATCCGGCTGCAGCACGCCCACCGGCCGGAAGATGAGGGAGCCCGGACAGAGAAAC
CTTCAGAATTCCACCTTCTCCATCAGCCAACTGCAGATATCTCACAAACTAGAAGCTACT
TCCGTATCCGGCTGCAGCACGCCCACCGGCCGGAAGATGAGGGAGCCCGGACAGAGAAAC
CTTCAGAATTCCACCTTCTCCATCAGCCAGGAAATGGAGGGACTCGACACGAAGACGTCA
GTGCGCGTACGGAACCCACCCGGCGGGAAGTCATCCGGTTTGTGGTAA
Protein sequence:
VHAENGGIIARNSEKLLEAGVKGPEGHAASRDDQVEAEAVNRACVIANQMDAPLYIVHMM
SAAAVQSLRNARRVAKHPIFGETLAATVGTDGSHYKNACFRHAAAHVLSPPLRDPSTPEA
IIDALAHDDLQVIASDNCTFNEKDKELGKNDFTKIPNGVNGVEDRMAILWQKAVNTGVMD
PCRFVAVTSTNAANIFNLPSKGRVAVGADGDVIVWDPRLEKTISAATHHHAVDFNIFEGQ
RVVGGPQYVIVNGRVCLDDGDLRVAEGYGKFLPTPPNSPYVYGEVPTTPQPERVEYLPSP
ARVTNGTPTELQISHKLEATSVSGCSTPTGRKMREPGQRNLQNSTFSISQLQISHKLEAT
SVSGCSTPTGRKMREPGQRNLQNSTFSISQEMEGLDTKTSVRVRNPPGGKSSGLW