New model in OGS2.0 | DPOGS200463  |
---|---|
Genomic Position | scaffold3735:+ 4139-8601 |
See gene structure | |
CDS Length | 1350 |
Paired RNAseq reads   | 381 |
Single RNAseq reads   | 1099 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011188 (1e-157) |
Best Drosophila hit   | nitrilase and fragile histidine triad fusion protein (7e-106) |
Best Human hit | nitrilase homolog 1 isoform 3 (2e-71) |
Best NR hit (blastp)   | nitrilase and fragile histidine triad fusion protein NitFhit [Culex quinquefasciatus] (3e-124) |
Best NR hit (blastx)   | nitrilase and fragile histidine triad fusion protein NitFhit [Culex quinquefasciatus] (3e-119) |
GeneOntology terms    | GO:0047710 bis(5'-adenosyl)-triphosphatase activity GO:0006139 nucleobase, nucleoside, nucleotide and nucleic acid metabolic process GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds GO:0000257 nitrilase activity GO:0005575 cellular_component |
InterPro families    | IPR003010 Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase IPR001310 Histidine triad (HIT) protein IPR011146 Histidine triad-like motif IPR011151 Histidine triad motif |
Orthology group | MCL12280 |
Nucleotide sequence:
ATGACTTCAGTTGCAGATAAATCAGCAAATTTAAATGTTGTCAGTCAGTTAATAAGCGAT
GCTGCAAAAGATGATGTTAAGATGTTATTTTTCCCTGAATGCTGCGATTATATTTGTGAG
AACAAAGACGAAACAATTAGATCGGCTGAAAATCTTTTGACGGGTGAAACTGTTAAGAAA
TACAGGGAATTGGCCGCTACGCACAATGTGTGGTTGTCAATGGGCGGATTACATGAAAAG
GATGAAGCGAGCGTAGATAAGATATTCAATACACATATAATAATTAATGATAAAGGCGAC
ATAGTACAAACATACAGAAAATTACACTTGTTTGATGTTGACATACCGGAGAGAAATATA
CGTCTGAAGGAGAGCGACTTCTGTAACCCCGGAGGGCATATAGTTGCGCCTGTTGACACA
CCGATTGGCAAGATTGGCCTTTCAATATGTTATGACCTTCGATTCCCCGAGCTCAGTACA
TCTCTAAGTATGATGAAAGCTGAAATACTAACCTATCCTTCTGCCTTTACTTATGCTACT
GGCTTGGCTCATTGGCATATACTATTAAGAGCAAGGGCAATAGAGAATCAATGCTACGTG
GTAGCGGCGGCTCAAACGGGGCAGCACAATGCTAAAAGACGCTCCTTCGGACATGCGCTC
GTAGTGGACCCGTGGGGCGAAGTCCTAGCCGACTGCGGAGACTCCGCTCCTTGTTACAAG
GTTGTCGAAATTACTGATAGATTGCAAGAAGTGAGGAAAAACATGCCCGTGTTCCAACAC
AGACGGCCGGATGTGTACTCCCTGTATTCTTTAAGTATCCGCAACAAACCGTTCAATGAG
CCTCCGCCCCCGCCGCCCCGGACTCCGCCCCTCGCCACGACCGGGAACGTGTTCGGTCAC
GTATCCGTTCCGGAAACGTGCGTCTTCCACAAGTCGGAACTGACTTACGCGTTTGTCAAC
TTACGTTGTGTGACCCCGGGCCATGTATTGGTAGCGCCTATAAGGTTGGCAGAGAGGAAT
AAAGATTTGACAGACGAAGAAGCAAGTGACTTCTTTAAAACCGTGAGATTAATACAAAAC
CTAATGGAACGAGTTCACAATACAGAGTCGTGTACCGTCACTATACAGGACGGACCAGAC
GCGGGGCAAACCGTGAAGCATCTGCACTGCCATATAATGCCAAGGAAGAAAGGAGATTTC
ATTGAAAATGATTTGATATACTTGGAGCTAGCGAAACATGATCAGATGAGGTCAGGTCAC
CCAGCGAAGCCAGCCAGGAGTTTGGAAGAAATGGAAGCAGAAGCGAAATACCTCAGAGAA
GAGTTGAAGAAGATGACAGAGACCAGCTAG
Protein sequence:
MTSVADKSANLNVVSQLISDAAKDDVKMLFFPECCDYICENKDETIRSAENLLTGETVKK
YRELAATHNVWLSMGGLHEKDEASVDKIFNTHIIINDKGDIVQTYRKLHLFDVDIPERNI
RLKESDFCNPGGHIVAPVDTPIGKIGLSICYDLRFPELSTSLSMMKAEILTYPSAFTYAT
GLAHWHILLRARAIENQCYVVAAAQTGQHNAKRRSFGHALVVDPWGEVLADCGDSAPCYK
VVEITDRLQEVRKNMPVFQHRRPDVYSLYSLSIRNKPFNEPPPPPPRTPPLATTGNVFGH
VSVPETCVFHKSELTYAFVNLRCVTPGHVLVAPIRLAERNKDLTDEEASDFFKTVRLIQN
LMERVHNTESCTVTIQDGPDAGQTVKHLHCHIMPRKKGDFIENDLIYLELAKHDQMRSGH
PAKPARSLEEMEAEAKYLREELKKMTETS