New model in OGS2.0 | DPOGS205069  |
---|---|
Genomic Position | scaffold681:- 38636-43841 |
See gene structure | |
CDS Length | 1491 |
Paired RNAseq reads   | 155 |
Single RNAseq reads   | 413 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006812 (8e-136) |
Best Drosophila hit   | CG6106 (4e-70) |
Best Human hit | dihydropyrimidinase-related protein 5 (5e-09) |
Best NR hit (blastp)   | allantoinase [Ctenocephalides felis] (1e-122) |
Best NR hit (blastx)   | PREDICTED: hypothetical protein [Nasonia vitripennis] (2e-112) |
GeneOntology terms    | GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds GO:0004038 allantoinase activity GO:0016812 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amides GO:0008270 zinc ion binding GO:0016787 hydrolase activity GO:0000256 allantoin catabolic process GO:0005575 cellular_component GO:0046872 metal ion binding |
InterPro families    | IPR017593 Allantoinase IPR011059 Metal-dependent hydrolase, composite domain IPR002195 Dihydroorotase, conserved site IPR006680 Amidohydrolase 1 |
Orthology group | MCL17892 |
Nucleotide sequence:
ATGGCTGCGTTTTATTATTTATTGATAACGTTTATTGGGACAGAAGTACTTGGCAGAGGA
TTACAGCCGGATAAAAGTGAAAACCAGTTATTTTTAAGTAAACGAGTTGTGACAGATTCT
GGCGAAATTGATGGTGGAGTGCTTGTGAACGAAAATGGTATAATTGAAGGTATATTTACG
AGGGACACTATAGATAAATTATTCAATGGATATGATAAGAATTTGCAGGTAATAGACGGT
GGCGATTGGGCCTTAATGGCTGGTGTCGTGGATTCTCATGTACATGTTAATGAGCCAGGC
CGTACATCATGGGAAGGGTTTGTGACTGCCACCAGGGCAGGAGCGGCAGGAGGCATCACC
ACCATTGTTGATATGCCCTTGAATTCTGTTCCACCGACCACTTCAATTGAAAATTTAAAA
ACTAAAGTCTCAGCTGCCAAAGAAGAGGTGTACGTAGATGTGGCATTTTGGGGAGGATTG
GTTCCTGGCAATGAGGAGGAATTACAGAAGCTAGTTAAAGCGGGTGTTGTTGGTTTCAAA
GGTTTTTTAATCGATAGTGGAGTGTCGGAGTTTCCTAATGTTGAAGGTGATGATTTAGAT
AAAATATTTACAACGCTGAATGGTTCTGACATTGCGGTAGCGTTCCACGCCGAATTACCC
ATTAGTGACGGCAACAACAGCAGTCTATGTGACAAATGCGAAAATCTAGATCCGGTCCTA
TATAGCACGTATCTATCATCTCGACCACCTCAAATGGAAATTGATGCTGCAACATTACTT
GCGAAATATATTGCTAAATATGACGTCCACGTGCACGTCGTTCACGTGTCAGCTGAAGGT
GTAATACCGATTTTAGAAAAAGCTAGAGAGTTTAGGATTCAAAATGGATCGAAACGTTGG
AGAGGGGGTGTCACAGCTGAAACCTGTCATCATTATCTTACATTGAGCTCGGAACAAATC
CCGCCAGGACGCACGGAGTATAAATGTTCACCTCCCATAAGAGACATCAACAATAAGTTG
CGACTGTGGGAATATATAAAGCAGAGAAGAATTGATTTGATAGCGTCCGATCACTCCCCG
TCAGTCGCTGGCCTCAAGAGTCCTAATTTCATGACCGCTTGGGGTGGTGTATCGTCCGTG
CAATTCGGCTTATCTCTATTTTGGACTGAAGCAAAAGCTCGTGGTTATAGTCTTAGCACT
GTCAGCCATTTCTTGTCGTCGGGACCCGCTCGGCTTGCTGGGTTACACGACAAAAAGGGG
GCCTTGAAACCAGGCCTTGACGCCGACCTGGTTTTCTTTGATCCTGACGCTTCATTCGTG
CTTACACCAGATAAAATATTCTACAAGAACAAGCTAAGTCCGTATATGTACAAAGTCCTA
ACCGGGAAAGTGATGCAAACTTACGTGAGAGGTCGTCTCGTGTTTAACGATGGTCAAGTG
TATGGCAACCCACAAGGAAAGTTATTGATAAACGAAGACGAGTTATATTGA
Protein sequence:
MAAFYYLLITFIGTEVLGRGLQPDKSENQLFLSKRVVTDSGEIDGGVLVNENGIIEGIFT
RDTIDKLFNGYDKNLQVIDGGDWALMAGVVDSHVHVNEPGRTSWEGFVTATRAGAAGGIT
TIVDMPLNSVPPTTSIENLKTKVSAAKEEVYVDVAFWGGLVPGNEEELQKLVKAGVVGFK
GFLIDSGVSEFPNVEGDDLDKIFTTLNGSDIAVAFHAELPISDGNNSSLCDKCENLDPVL
YSTYLSSRPPQMEIDAATLLAKYIAKYDVHVHVVHVSAEGVIPILEKAREFRIQNGSKRW
RGGVTAETCHHYLTLSSEQIPPGRTEYKCSPPIRDINNKLRLWEYIKQRRIDLIASDHSP
SVAGLKSPNFMTAWGGVSSVQFGLSLFWTEAKARGYSLSTVSHFLSSGPARLAGLHDKKG
ALKPGLDADLVFFDPDASFVLTPDKIFYKNKLSPYMYKVLTGKVMQTYVRGRLVFNDGQV
YGNPQGKLLINEDELY