DPGLEAN15138 in OGS1.0

New model in OGS2.0DPOGS208170 
Genomic Positionscaffold12438:- 43-5132
See gene structure
CDS Length1503
Paired RNAseq reads  241
Single RNAseq reads  1421
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010258 (3e-48)
Best Drosophila hit  CG18143 (2e-42)
Best Human hitguanine deaminase (1e-40)
Best NR hit (blastp)  guanine deaminase [Aedes aegypti] (2e-85)
Best NR hit (blastx)  chlorohydrolase family protein [Aspergillus clavatus NRRL 1] (4e-54)
GeneOntology terms



  
GO:0016787 hydrolase activity
GO:0008270 zinc ion binding
GO:0008892 guanine deaminase activity
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families


  
IPR000276 GPCR, rhodopsin-like, 7TM
IPR017452 GPCR, rhodopsin-like superfamily
IPR006680 Amidohydrolase 1
IPR014311 Guanine deaminase
Orthology groupMCL14391

Nucleotide sequence:

ATGACGGGAAAGTTTGTTTTTTCTGGAAACTTAGTAACTTCAGATGTTTCTATGAAGTTA
AAGTGTTTTCGTGGATATATAGAAGTCGTTGATGGACTGATAACAAAAATTGGTAATATA
GAAGAGTTTCAAAAACAGATTGATAATTTTAGTGATTCAAAAGTAGTTACACTTGGAGAG
AATGAGTTTTTAATGCCGGGCTTTGTGGACTGTCATACTCATGCCCCTCAATATCCCAAC
ATTGGTCTAGGATTGGACCTCCCATTGCTCGAATGGCTCAATAAATATACTTTTCCACTG
GAACGTCACTACAGTGATGCAGAATTTGCTGCTAATGTTTATGACATTGTTGTGAGAAGA
TTAATTAACAATGGCACCACAACGGCCTGTTATTTTGGATCCTTACACTTAGAAGGTACG
ATGAAGCTAGTGAAATATGTTGTTGAGCACAAACAAAGAGCATTGGTGGGAAAAGTCAGT
ATGAATGTAGAAAATGATGCTGGATACTATAACAAGACAGAAAAGGAACTGCAAGAGGTT
GAACAGTTCATCAAAAGAGTTCTAAGCTATAAGGGTGCGCCAAGTCCATCGCATTGCAGG
GAGAGGAGCTGCAACAGTGCCTGGGACTTGCGACCCAGAGCATCGTACGTGTCCGTGTTG
ACTATAGTAGCGTTCACACTCGAACGTTACCTGGCCATATGTCATCCCTTACACATCTAC
GCTGTGGCTGGTCTGCGCAGAGCTTTACGTATAGTACTCTCTCTGTGGGTGTTGTCTCTG
TTGGCTGCATCACCCTTCGCTCATTACACTACAGTCAACTATCACGAATATCCTCCGAAG
TCAAAATATATATTTCAGAGTCACATATCGGAAAATAAAAAAGAAATCGAATATGTGCTA
GAGACGAACCCGGAATGTGCGTCATATACTGATGTTTATCAACGATGCGGCATATTTAAT
GGTCCATGTATAATGGCCCACGCCGTCCACCTCACTGAGGACGAAATCAGTGAGTTCAGT
ATTCGCCAAGTGTCGGTGGCGCACTGTCCCGCCTCCAACACTAGACTGAACTCGGGGCTT
TGCCCCGTCAGGAAGCTGCTAGACAGAGGCATCAGAGTGGGCCTCGGTACCGATATTTCT
GGTGGTGACAGCGCTAGCATGTTAGACGCCATGCGGCGAGCCATGGATGTGTCGCTGCAC
TTGGCCATGATGGGCCAACCTCATACCACTTTAGACTGGAAGGAAGCTTTCTACCTCGCG
ACGCTAGGCGGCGCTAGAGCTTTACGACTGGAAGATAAAATTGGCAGTTTCGACGTTGGT
AAACAGTTCGATGCTTTACTAATAGACGTTTACGCGAAGGGCGGACCGATCGATAAATAC
GCGTACAATAGTGGCGAGCACGCGCTGGTCGAGTTGGTGCAGAGATTTGTTTATTTGGGG
GACGACAGGAACATAAGGCAGGTGTACGTGAACGGGGAAACTATAAAGGATTTTATTATA
TGA

Protein sequence:

MTGKFVFSGNLVTSDVSMKLKCFRGYIEVVDGLITKIGNIEEFQKQIDNFSDSKVVTLGE
NEFLMPGFVDCHTHAPQYPNIGLGLDLPLLEWLNKYTFPLERHYSDAEFAANVYDIVVRR
LINNGTTTACYFGSLHLEGTMKLVKYVVEHKQRALVGKVSMNVENDAGYYNKTEKELQEV
EQFIKRVLSYKGAPSPSHCRERSCNSAWDLRPRASYVSVLTIVAFTLERYLAICHPLHIY
AVAGLRRALRIVLSLWVLSLLAASPFAHYTTVNYHEYPPKSKYIFQSHISENKKEIEYVL
ETNPECASYTDVYQRCGIFNGPCIMAHAVHLTEDEISEFSIRQVSVAHCPASNTRLNSGL
CPVRKLLDRGIRVGLGTDISGGDSASMLDAMRRAMDVSLHLAMMGQPHTTLDWKEAFYLA
TLGGARALRLEDKIGSFDVGKQFDALLIDVYAKGGPIDKYAYNSGEHALVELVQRFVYLG
DDRNIRQVYVNGETIKDFII