Genomic Position | scaffold927:+ 138176-148933 |
---|---|
See gene structure | |
CDS Length | 1839 |
Paired RNAseq reads   | 384 |
Single RNAseq reads   | 1038 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010352 (1e-47) |
Best Drosophila hit   | CG6465 (1e-41) |
Best Human hit | aminoacylase-1 isoform a (2e-34) |
Best NR hit (blastp)   | aminoacylase-1 [Glossina morsitans morsitans] (3e-52) |
Best NR hit (blastx)   | aminoacylase-1 [Glossina morsitans morsitans] (4e-50) |
GeneOntology terms    | GO:0004046 aminoacylase activity GO:0005515 protein binding GO:0005737 cytoplasm GO:0008237 metallopeptidase activity GO:0006520 cellular amino acid metabolic process GO:0006508 proteolysis |
InterPro families    | IPR001261 ArgE/DapE/ACY1/CPG2/YscS, conserved site IPR011333 BTB/POZ fold IPR013069 BTB/POZ IPR002933 Peptidase M20 IPR007588 Zinc finger, FLYWCH-type IPR000210 BTB/POZ-like |
Orthology group | ND |
Nucleotide sequence:
ATGGTACAGTCTCATTTTTCACTTTCGTGGGATTCGTATAAGTCAAATCTGTCAACTGGA
TTTAGTGGTTTGCAACAGAATGGAGAACTCGTTGATATGACGTTGGCTGCTGATGGGCAC
TTCGTTAAAGTGCATCAGGTTCTAATAGCTTTGTCCAGCTCCTATTTAAAACAACTTATT
CTATCGGCTCCGTGTCAACACCCTGTCATCTTCCTCAATAATGTCTCCAATACAACTCTA
ACATTCCTGTTAGAATACATATACACAGGACAGGTGTCTGTGCCTTCTAGTAATTTGTCA
GCATTTATTGAAGCCGCGAAAGCTTTACACATAAAAGGGCTTGAGAATGTTGAGGACAAC
AGCAAAGAAAAGCAAACACCAATTAATGTGGATGAATGTAATATAAGCTCTAATAGAATT
GCCGTTAAAAGAAAATCCAGTGCACCCGATCAGTTTCATCTACCGGCCACAGCAAGAAAA
GTTCTAGTTAAATATGGAGGAGCCACATCCACGGTAATCGTGGAAAAGAAGAATCTTAAT
GAATCCATGGATACCGATGAGGTACACAACACAATGGACCATACTGCACTCGAAGCGGAC
ACTCACTTGACTGATGAAACACAGAAGGATAAAGACAAAGGAGCAGTTCAAATGGCGTCG
TCCAATTTACAATATACGGTTTCGATACGGGGCTCCTTGCAAGTTATATTAAACAGATAT
ATATACAATTTACACTCGTCTCAGTCAACTGGTGTCCGTCGTTGGAGATGTTGCGATTAT
CGCAATAAGAAATGCAGCGCGTTTGTCGTGACACAGGATAATGTTGTCTTAAACAGAGCG
AATCCCCACAACCATTCGTTTCACGATAAGAAGATTCTGGATAAAATTGAAAAGAACGCT
ATCTACACGGCCATAGACGACGTCAAAGGCGTCATAGACAAAGAGAAGCCCAAAGACAAT
CTGCCAATGGAAAGCGAATTCGTAGGACTCTCTTTTAATACGGAATTAGATGATTTAACG
AATGATCCGAAAACAGTGTCGTCGTATAAATGTAACAACGCGATAGATCAAGACGCAATG
CTTGGCTTTGTGTACCATTTATATGTAATGGTGAGCGTGGTTTTGTGCAATCCTATACAT
TATAATTATACATTAAAAGATTTCAATAATAATCCCGCTGTCAAGAAATTACAGGAATAT
ATAACGATAGACTCGAGCCGTGTAGAAAATATCGAATTAGTAGTTGACTTCTGGAAGAGG
CAAGCAGCGGATGTCGGCCTGTCTTTTGCGGTGTATAGACCAGCTGTGTTGCCAATATGT
GTACTTACCTTAATAGGTCGTCAGCCGGACCTGCCTAGCATTATGCTAAATCATCACGGA
GATGTGGTCCCAGCATACCACAGCATGTGGAAGTATCCTCCATATTCGGCACATATTGAT
GAAAACGGCGATTTATACGGACGAGGGGCCCAAGACACTAAAAGTGTTGGAATACAATAT
ATAGAAGCTGTTAGAAGACTAATAAAAAATAACGTAACATTAGAAAGGACGTTACATCTT
ACTGTTATGCCAGATGAGGAATACGGCGGTAGCAAAGGTATCAAAGCTTTTATTTTGACG
GATGTTTTTAAATCATTAAACATTGGATTTGCATTAGATGAAGGATTTACATCTGAAGAT
GACGTGATGCTCGCGTCTTACCAGGATAAGAGACCAGTTCAGGTGCGATTCAATATTATC
GGTCAAGGTGGTCATGGTTCATCATTGGTGAACGGAAGTGCTATAGAAAAGGTGCAATAC
CTATTGAACACCGCCTTAGAATTTAGAAAGAGAAAATGA
Protein sequence:
MVQSHFSLSWDSYKSNLSTGFSGLQQNGELVDMTLAADGHFVKVHQVLIALSSSYLKQLI
LSAPCQHPVIFLNNVSNTTLTFLLEYIYTGQVSVPSSNLSAFIEAAKALHIKGLENVEDN
SKEKQTPINVDECNISSNRIAVKRKSSAPDQFHLPATARKVLVKYGGATSTVIVEKKNLN
ESMDTDEVHNTMDHTALEADTHLTDETQKDKDKGAVQMASSNLQYTVSIRGSLQVILNRY
IYNLHSSQSTGVRRWRCCDYRNKKCSAFVVTQDNVVLNRANPHNHSFHDKKILDKIEKNA
IYTAIDDVKGVIDKEKPKDNLPMESEFVGLSFNTELDDLTNDPKTVSSYKCNNAIDQDAM
LGFVYHLYVMVSVVLCNPIHYNYTLKDFNNNPAVKKLQEYITIDSSRVENIELVVDFWKR
QAADVGLSFAVYRPAVLPICVLTLIGRQPDLPSIMLNHHGDVVPAYHSMWKYPPYSAHID
ENGDLYGRGAQDTKSVGIQYIEAVRRLIKNNVTLERTLHLTVMPDEEYGGSKGIKAFILT
DVFKSLNIGFALDEGFTSEDDVMLASYQDKRPVQVRFNIIGQGGHGSSLVNGSAIEKVQY
LLNTALEFRKRK