New model in OGS2.0 | DPOGS205068  |
---|---|
Genomic Position | scaffold681:- 46815-52919 |
See gene structure | |
CDS Length | 1683 |
Paired RNAseq reads   | 348 |
Single RNAseq reads   | 1019 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006812 (7e-51) |
Best Drosophila hit   | CG6106 (2e-39) |
Best Human hit | ND |
Best NR hit (blastp)   | allantoinase [Culex quinquefasciatus] (5e-72) |
Best NR hit (blastx)   | allantoinase [Culex quinquefasciatus] (4e-68) |
GeneOntology terms    | GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds GO:0004038 allantoinase activity GO:0016812 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amides GO:0008270 zinc ion binding GO:0016787 hydrolase activity GO:0000256 allantoin catabolic process GO:0005575 cellular_component GO:0046872 metal ion binding |
InterPro families    | IPR017593 Allantoinase IPR011059 Metal-dependent hydrolase, composite domain IPR002195 Dihydroorotase, conserved site IPR006680 Amidohydrolase 1 |
Orthology group | ND |
Nucleotide sequence:
ATGACGGGACAGTATAATTTGTTTTTAAGTAAACGCTTAGTAACAGAGGAAAAGATTTTC
GATGGTGGGGTTTTGGTTAACGAGTTCGGAAAGATTGAAGACGTGCTTGACCGATACGAT
GCTGATAATTTGATAGCCGAGAGCAAGGATAAAATTACTGTAGTAGATGGTGGAGACTGT
CCTCTCCTGGCAGGTGTGGTTGACTCCCATGTCCACGTCAACGAGCCCGGCCGCACCGCC
TGGGAGGGTTTCCGTACAGCGACCAGTGCAGCTGCTGCTGGCGGCATAACTACCATCGTC
GATAACGGCGACACTCACACACCATTTCAAATTGTCTTTCAAAACTCCATCCCTCCGACA
ACGACCTTGGAGAATCTTAAAATAAAAGCCAATTCTGCGAAGGAAAACGTATTCGTAGAT
GTTGCATTTTGGGGCGGCGTGATTGTTGGGAATCAGGATTCTCTTCGAGATTTAGTAAAA
GCTGGAGTGGTGGGTTTTAAATGTTTTCTTTGTCCAAGTGGCGTTGAAGAGTTTCCTAAT
GTAGGAATCGAAGATTTAAATTTAGCATTTGATGCCCTTGAAGGAACCGGCTCGGTTTTG
GCTTTTCACGCTGAATTTGAAGAGGAGACATCTAGCGGGAAATGTATGAAATTAGATCCT
GAAGATTATAACACGTACCTAGAGTCGCGACCATCCCAAATGGAATTAAGTGCAGTCTCA
CTCATAACCAACTTCCTACAAAAAACGGACGTTCGAGTTCATATTGTTCACGTGTCATCA
GCAAATGTGGTGCCGCTTCTGGTGAAGGCTCGTGAGGATCGTCTAGCCAAAGGCCATAAT
GCCTGGCGTGGAGGAGTGACCGCTGAGACATGCCACCACTATCTAACGCTGAGCGCAGAT
GAAGTGCCACGAGGACACAGCGAGTACAAATGTGCTCCTCCAATACGAGATGCTAATAAT
AAGGCGCCAAACGAGCAGACGGCCTACCTCATGCAAAGTAGTAATCGTCGCCCATGGCCA
CGCATCTGCAACATAAGGGATATTGCAGATGCGTGGCCGAAATTGATTGTGGGAGAGGGA
GAGAAAGGGGTAAGAAAAGACAAAAGGCATGAATGGGTGGAGAAAGGAAAAGGGGTACCG
AATCTCTCACTCTTAGGACGAAATGCAGCCATTACAGGCTACTTCACGCCGATCTTCTGT
GAGAGGGTGGCACGGGAAAAATTGTGGAAGTTTTTGCTAGATGATAAATTGGATATGGTT
GTATCAGATCACTCGCCCTGTACCCCGGAACTCAAGTGTAGCAATAACTTAAAAGCTTGG
GGTGGAATATCTTCGGTCCAGTTTGGTTTGCCATTATTTTGGACTCAAGCAAGTGCTCGT
GGATTAGATTTAAGATCAATAACCAAATATCTAAGTTCTGGTCCAGCCCATCTTTGTGGG
TTGCAAAATCGAAAAGGAGCACTTAAAAAAGGATTGGATGCCGATCTTATTTTCTTCGAT
TGCGACGCAAATTTCACTGTAACCCAAGAAATCATACGACATAAAAATAAGCTGACGCCC
TATATTGGTAAAGAATTGAAAGGCATAGTTAGGAAGACCTATTTAAGAGGACATCTGATA
TATGACGGGGGCGATTTAATTGGTTCACCTCAAGGAGAACTGTTACTCAACGATATTAAA
TAA
Protein sequence:
MTGQYNLFLSKRLVTEEKIFDGGVLVNEFGKIEDVLDRYDADNLIAESKDKITVVDGGDC
PLLAGVVDSHVHVNEPGRTAWEGFRTATSAAAAGGITTIVDNGDTHTPFQIVFQNSIPPT
TTLENLKIKANSAKENVFVDVAFWGGVIVGNQDSLRDLVKAGVVGFKCFLCPSGVEEFPN
VGIEDLNLAFDALEGTGSVLAFHAEFEEETSSGKCMKLDPEDYNTYLESRPSQMELSAVS
LITNFLQKTDVRVHIVHVSSANVVPLLVKAREDRLAKGHNAWRGGVTAETCHHYLTLSAD
EVPRGHSEYKCAPPIRDANNKAPNEQTAYLMQSSNRRPWPRICNIRDIADAWPKLIVGEG
EKGVRKDKRHEWVEKGKGVPNLSLLGRNAAITGYFTPIFCERVAREKLWKFLLDDKLDMV
VSDHSPCTPELKCSNNLKAWGGISSVQFGLPLFWTQASARGLDLRSITKYLSSGPAHLCG
LQNRKGALKKGLDADLIFFDCDANFTVTQEIIRHKNKLTPYIGKELKGIVRKTYLRGHLI
YDGGDLIGSPQGELLLNDIK