DPGLEAN12724 in OGS1.0

Genomic Positionscaffold927:+ 138176-148933
See gene structure
CDS Length1839
Paired RNAseq reads  384
Single RNAseq reads  1038
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010352 (1e-47)
Best Drosophila hit  CG6465 (1e-41)
Best Human hitaminoacylase-1 isoform a (2e-34)
Best NR hit (blastp)  aminoacylase-1 [Glossina morsitans morsitans] (3e-52)
Best NR hit (blastx)  aminoacylase-1 [Glossina morsitans morsitans] (4e-50)
GeneOntology terms




  
GO:0004046 aminoacylase activity
GO:0005515 protein binding
GO:0005737 cytoplasm
GO:0008237 metallopeptidase activity
GO:0006520 cellular amino acid metabolic process
GO:0006508 proteolysis
InterPro families




  
IPR001261 ArgE/DapE/ACY1/CPG2/YscS, conserved site
IPR011333 BTB/POZ fold
IPR013069 BTB/POZ
IPR002933 Peptidase M20
IPR007588 Zinc finger, FLYWCH-type
IPR000210 BTB/POZ-like
Orthology groupND

Nucleotide sequence:

ATGGTACAGTCTCATTTTTCACTTTCGTGGGATTCGTATAAGTCAAATCTGTCAACTGGA
TTTAGTGGTTTGCAACAGAATGGAGAACTCGTTGATATGACGTTGGCTGCTGATGGGCAC
TTCGTTAAAGTGCATCAGGTTCTAATAGCTTTGTCCAGCTCCTATTTAAAACAACTTATT
CTATCGGCTCCGTGTCAACACCCTGTCATCTTCCTCAATAATGTCTCCAATACAACTCTA
ACATTCCTGTTAGAATACATATACACAGGACAGGTGTCTGTGCCTTCTAGTAATTTGTCA
GCATTTATTGAAGCCGCGAAAGCTTTACACATAAAAGGGCTTGAGAATGTTGAGGACAAC
AGCAAAGAAAAGCAAACACCAATTAATGTGGATGAATGTAATATAAGCTCTAATAGAATT
GCCGTTAAAAGAAAATCCAGTGCACCCGATCAGTTTCATCTACCGGCCACAGCAAGAAAA
GTTCTAGTTAAATATGGAGGAGCCACATCCACGGTAATCGTGGAAAAGAAGAATCTTAAT
GAATCCATGGATACCGATGAGGTACACAACACAATGGACCATACTGCACTCGAAGCGGAC
ACTCACTTGACTGATGAAACACAGAAGGATAAAGACAAAGGAGCAGTTCAAATGGCGTCG
TCCAATTTACAATATACGGTTTCGATACGGGGCTCCTTGCAAGTTATATTAAACAGATAT
ATATACAATTTACACTCGTCTCAGTCAACTGGTGTCCGTCGTTGGAGATGTTGCGATTAT
CGCAATAAGAAATGCAGCGCGTTTGTCGTGACACAGGATAATGTTGTCTTAAACAGAGCG
AATCCCCACAACCATTCGTTTCACGATAAGAAGATTCTGGATAAAATTGAAAAGAACGCT
ATCTACACGGCCATAGACGACGTCAAAGGCGTCATAGACAAAGAGAAGCCCAAAGACAAT
CTGCCAATGGAAAGCGAATTCGTAGGACTCTCTTTTAATACGGAATTAGATGATTTAACG
AATGATCCGAAAACAGTGTCGTCGTATAAATGTAACAACGCGATAGATCAAGACGCAATG
CTTGGCTTTGTGTACCATTTATATGTAATGGTGAGCGTGGTTTTGTGCAATCCTATACAT
TATAATTATACATTAAAAGATTTCAATAATAATCCCGCTGTCAAGAAATTACAGGAATAT
ATAACGATAGACTCGAGCCGTGTAGAAAATATCGAATTAGTAGTTGACTTCTGGAAGAGG
CAAGCAGCGGATGTCGGCCTGTCTTTTGCGGTGTATAGACCAGCTGTGTTGCCAATATGT
GTACTTACCTTAATAGGTCGTCAGCCGGACCTGCCTAGCATTATGCTAAATCATCACGGA
GATGTGGTCCCAGCATACCACAGCATGTGGAAGTATCCTCCATATTCGGCACATATTGAT
GAAAACGGCGATTTATACGGACGAGGGGCCCAAGACACTAAAAGTGTTGGAATACAATAT
ATAGAAGCTGTTAGAAGACTAATAAAAAATAACGTAACATTAGAAAGGACGTTACATCTT
ACTGTTATGCCAGATGAGGAATACGGCGGTAGCAAAGGTATCAAAGCTTTTATTTTGACG
GATGTTTTTAAATCATTAAACATTGGATTTGCATTAGATGAAGGATTTACATCTGAAGAT
GACGTGATGCTCGCGTCTTACCAGGATAAGAGACCAGTTCAGGTGCGATTCAATATTATC
GGTCAAGGTGGTCATGGTTCATCATTGGTGAACGGAAGTGCTATAGAAAAGGTGCAATAC
CTATTGAACACCGCCTTAGAATTTAGAAAGAGAAAATGA

Protein sequence:

MVQSHFSLSWDSYKSNLSTGFSGLQQNGELVDMTLAADGHFVKVHQVLIALSSSYLKQLI
LSAPCQHPVIFLNNVSNTTLTFLLEYIYTGQVSVPSSNLSAFIEAAKALHIKGLENVEDN
SKEKQTPINVDECNISSNRIAVKRKSSAPDQFHLPATARKVLVKYGGATSTVIVEKKNLN
ESMDTDEVHNTMDHTALEADTHLTDETQKDKDKGAVQMASSNLQYTVSIRGSLQVILNRY
IYNLHSSQSTGVRRWRCCDYRNKKCSAFVVTQDNVVLNRANPHNHSFHDKKILDKIEKNA
IYTAIDDVKGVIDKEKPKDNLPMESEFVGLSFNTELDDLTNDPKTVSSYKCNNAIDQDAM
LGFVYHLYVMVSVVLCNPIHYNYTLKDFNNNPAVKKLQEYITIDSSRVENIELVVDFWKR
QAADVGLSFAVYRPAVLPICVLTLIGRQPDLPSIMLNHHGDVVPAYHSMWKYPPYSAHID
ENGDLYGRGAQDTKSVGIQYIEAVRRLIKNNVTLERTLHLTVMPDEEYGGSKGIKAFILT
DVFKSLNIGFALDEGFTSEDDVMLASYQDKRPVQVRFNIIGQGGHGSSLVNGSAIEKVQY
LLNTALEFRKRK