New model in OGS2.0 | DPOGS208170  |
---|---|
Genomic Position | scaffold12438:- 43-5132 |
See gene structure | |
CDS Length | 1503 |
Paired RNAseq reads   | 241 |
Single RNAseq reads   | 1421 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010258 (3e-48) |
Best Drosophila hit   | CG18143 (2e-42) |
Best Human hit | guanine deaminase (1e-40) |
Best NR hit (blastp)   | guanine deaminase [Aedes aegypti] (2e-85) |
Best NR hit (blastx)   | chlorohydrolase family protein [Aspergillus clavatus NRRL 1] (4e-54) |
GeneOntology terms    | GO:0016787 hydrolase activity GO:0008270 zinc ion binding GO:0008892 guanine deaminase activity GO:0005575 cellular_component GO:0008150 biological_process |
InterPro families    | IPR000276 GPCR, rhodopsin-like, 7TM IPR017452 GPCR, rhodopsin-like superfamily IPR006680 Amidohydrolase 1 IPR014311 Guanine deaminase |
Orthology group | MCL14391 |
Nucleotide sequence:
ATGACGGGAAAGTTTGTTTTTTCTGGAAACTTAGTAACTTCAGATGTTTCTATGAAGTTA
AAGTGTTTTCGTGGATATATAGAAGTCGTTGATGGACTGATAACAAAAATTGGTAATATA
GAAGAGTTTCAAAAACAGATTGATAATTTTAGTGATTCAAAAGTAGTTACACTTGGAGAG
AATGAGTTTTTAATGCCGGGCTTTGTGGACTGTCATACTCATGCCCCTCAATATCCCAAC
ATTGGTCTAGGATTGGACCTCCCATTGCTCGAATGGCTCAATAAATATACTTTTCCACTG
GAACGTCACTACAGTGATGCAGAATTTGCTGCTAATGTTTATGACATTGTTGTGAGAAGA
TTAATTAACAATGGCACCACAACGGCCTGTTATTTTGGATCCTTACACTTAGAAGGTACG
ATGAAGCTAGTGAAATATGTTGTTGAGCACAAACAAAGAGCATTGGTGGGAAAAGTCAGT
ATGAATGTAGAAAATGATGCTGGATACTATAACAAGACAGAAAAGGAACTGCAAGAGGTT
GAACAGTTCATCAAAAGAGTTCTAAGCTATAAGGGTGCGCCAAGTCCATCGCATTGCAGG
GAGAGGAGCTGCAACAGTGCCTGGGACTTGCGACCCAGAGCATCGTACGTGTCCGTGTTG
ACTATAGTAGCGTTCACACTCGAACGTTACCTGGCCATATGTCATCCCTTACACATCTAC
GCTGTGGCTGGTCTGCGCAGAGCTTTACGTATAGTACTCTCTCTGTGGGTGTTGTCTCTG
TTGGCTGCATCACCCTTCGCTCATTACACTACAGTCAACTATCACGAATATCCTCCGAAG
TCAAAATATATATTTCAGAGTCACATATCGGAAAATAAAAAAGAAATCGAATATGTGCTA
GAGACGAACCCGGAATGTGCGTCATATACTGATGTTTATCAACGATGCGGCATATTTAAT
GGTCCATGTATAATGGCCCACGCCGTCCACCTCACTGAGGACGAAATCAGTGAGTTCAGT
ATTCGCCAAGTGTCGGTGGCGCACTGTCCCGCCTCCAACACTAGACTGAACTCGGGGCTT
TGCCCCGTCAGGAAGCTGCTAGACAGAGGCATCAGAGTGGGCCTCGGTACCGATATTTCT
GGTGGTGACAGCGCTAGCATGTTAGACGCCATGCGGCGAGCCATGGATGTGTCGCTGCAC
TTGGCCATGATGGGCCAACCTCATACCACTTTAGACTGGAAGGAAGCTTTCTACCTCGCG
ACGCTAGGCGGCGCTAGAGCTTTACGACTGGAAGATAAAATTGGCAGTTTCGACGTTGGT
AAACAGTTCGATGCTTTACTAATAGACGTTTACGCGAAGGGCGGACCGATCGATAAATAC
GCGTACAATAGTGGCGAGCACGCGCTGGTCGAGTTGGTGCAGAGATTTGTTTATTTGGGG
GACGACAGGAACATAAGGCAGGTGTACGTGAACGGGGAAACTATAAAGGATTTTATTATA
TGA
Protein sequence:
MTGKFVFSGNLVTSDVSMKLKCFRGYIEVVDGLITKIGNIEEFQKQIDNFSDSKVVTLGE
NEFLMPGFVDCHTHAPQYPNIGLGLDLPLLEWLNKYTFPLERHYSDAEFAANVYDIVVRR
LINNGTTTACYFGSLHLEGTMKLVKYVVEHKQRALVGKVSMNVENDAGYYNKTEKELQEV
EQFIKRVLSYKGAPSPSHCRERSCNSAWDLRPRASYVSVLTIVAFTLERYLAICHPLHIY
AVAGLRRALRIVLSLWVLSLLAASPFAHYTTVNYHEYPPKSKYIFQSHISENKKEIEYVL
ETNPECASYTDVYQRCGIFNGPCIMAHAVHLTEDEISEFSIRQVSVAHCPASNTRLNSGL
CPVRKLLDRGIRVGLGTDISGGDSASMLDAMRRAMDVSLHLAMMGQPHTTLDWKEAFYLA
TLGGARALRLEDKIGSFDVGKQFDALLIDVYAKGGPIDKYAYNSGEHALVELVQRFVYLG
DDRNIRQVYVNGETIKDFII