DPGLEAN03745 in OGS1.0

New model in OGS2.0DPOGS200463 
Genomic Positionscaffold3735:+ 4139-8601
See gene structure
CDS Length1350
Paired RNAseq reads  381
Single RNAseq reads  1099
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011188 (1e-157)
Best Drosophila hit  nitrilase and fragile histidine triad fusion protein (7e-106)
Best Human hitnitrilase homolog 1 isoform 3 (2e-71)
Best NR hit (blastp)  nitrilase and fragile histidine triad fusion protein NitFhit [Culex quinquefasciatus] (3e-124)
Best NR hit (blastx)  nitrilase and fragile histidine triad fusion protein NitFhit [Culex quinquefasciatus] (3e-119)
GeneOntology terms



  
GO:0047710 bis(5'-adenosyl)-triphosphatase activity
GO:0006139 nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:0000257 nitrilase activity
GO:0005575 cellular_component
InterPro families


  
IPR003010 Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase
IPR001310 Histidine triad (HIT) protein
IPR011146 Histidine triad-like motif
IPR011151 Histidine triad motif
Orthology groupMCL12280

Nucleotide sequence:

ATGACTTCAGTTGCAGATAAATCAGCAAATTTAAATGTTGTCAGTCAGTTAATAAGCGAT
GCTGCAAAAGATGATGTTAAGATGTTATTTTTCCCTGAATGCTGCGATTATATTTGTGAG
AACAAAGACGAAACAATTAGATCGGCTGAAAATCTTTTGACGGGTGAAACTGTTAAGAAA
TACAGGGAATTGGCCGCTACGCACAATGTGTGGTTGTCAATGGGCGGATTACATGAAAAG
GATGAAGCGAGCGTAGATAAGATATTCAATACACATATAATAATTAATGATAAAGGCGAC
ATAGTACAAACATACAGAAAATTACACTTGTTTGATGTTGACATACCGGAGAGAAATATA
CGTCTGAAGGAGAGCGACTTCTGTAACCCCGGAGGGCATATAGTTGCGCCTGTTGACACA
CCGATTGGCAAGATTGGCCTTTCAATATGTTATGACCTTCGATTCCCCGAGCTCAGTACA
TCTCTAAGTATGATGAAAGCTGAAATACTAACCTATCCTTCTGCCTTTACTTATGCTACT
GGCTTGGCTCATTGGCATATACTATTAAGAGCAAGGGCAATAGAGAATCAATGCTACGTG
GTAGCGGCGGCTCAAACGGGGCAGCACAATGCTAAAAGACGCTCCTTCGGACATGCGCTC
GTAGTGGACCCGTGGGGCGAAGTCCTAGCCGACTGCGGAGACTCCGCTCCTTGTTACAAG
GTTGTCGAAATTACTGATAGATTGCAAGAAGTGAGGAAAAACATGCCCGTGTTCCAACAC
AGACGGCCGGATGTGTACTCCCTGTATTCTTTAAGTATCCGCAACAAACCGTTCAATGAG
CCTCCGCCCCCGCCGCCCCGGACTCCGCCCCTCGCCACGACCGGGAACGTGTTCGGTCAC
GTATCCGTTCCGGAAACGTGCGTCTTCCACAAGTCGGAACTGACTTACGCGTTTGTCAAC
TTACGTTGTGTGACCCCGGGCCATGTATTGGTAGCGCCTATAAGGTTGGCAGAGAGGAAT
AAAGATTTGACAGACGAAGAAGCAAGTGACTTCTTTAAAACCGTGAGATTAATACAAAAC
CTAATGGAACGAGTTCACAATACAGAGTCGTGTACCGTCACTATACAGGACGGACCAGAC
GCGGGGCAAACCGTGAAGCATCTGCACTGCCATATAATGCCAAGGAAGAAAGGAGATTTC
ATTGAAAATGATTTGATATACTTGGAGCTAGCGAAACATGATCAGATGAGGTCAGGTCAC
CCAGCGAAGCCAGCCAGGAGTTTGGAAGAAATGGAAGCAGAAGCGAAATACCTCAGAGAA
GAGTTGAAGAAGATGACAGAGACCAGCTAG

Protein sequence:

MTSVADKSANLNVVSQLISDAAKDDVKMLFFPECCDYICENKDETIRSAENLLTGETVKK
YRELAATHNVWLSMGGLHEKDEASVDKIFNTHIIINDKGDIVQTYRKLHLFDVDIPERNI
RLKESDFCNPGGHIVAPVDTPIGKIGLSICYDLRFPELSTSLSMMKAEILTYPSAFTYAT
GLAHWHILLRARAIENQCYVVAAAQTGQHNAKRRSFGHALVVDPWGEVLADCGDSAPCYK
VVEITDRLQEVRKNMPVFQHRRPDVYSLYSLSIRNKPFNEPPPPPPRTPPLATTGNVFGH
VSVPETCVFHKSELTYAFVNLRCVTPGHVLVAPIRLAERNKDLTDEEASDFFKTVRLIQN
LMERVHNTESCTVTIQDGPDAGQTVKHLHCHIMPRKKGDFIENDLIYLELAKHDQMRSGH
PAKPARSLEEMEAEAKYLREELKKMTETS