New model in OGS2.0 | DPOGS204431  |
---|---|
Genomic Position | scaffold1355:- 3285-8763 |
See gene structure | |
CDS Length | 1302 |
Paired RNAseq reads   | 2033 |
Single RNAseq reads   | 5421 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007728 (0.0) |
Best Drosophila hit   | CG17337 (3e-152) |
Best Human hit | cytosolic non-specific dipeptidase isoform 1 (1e-142) |
Best NR hit (blastp)   | glutamate carboxypeptidase [Aedes aegypti] (4e-170) |
Best NR hit (blastx)   | glutamate carboxypeptidase [Aedes aegypti] (7e-160) |
GeneOntology terms    | GO:0005515 protein binding GO:0006508 proteolysis GO:0008237 metallopeptidase activity |
InterPro families    | IPR001261 ArgE/DapE/ACY1/CPG2/YscS, conserved site IPR002933 Peptidase M20 IPR011650 Peptidase M20, dimerisation |
Orthology group | MCL14127 |
Nucleotide sequence:
ATGGTATACTGGATGAGGGACAAGTTAAAAGAAGTCGGTGCATCGACTGAAATAAGAGAT
GTAGGTTATCAGAATTTCGATGGCAAGGAAGTAAAACTACCACCGGTTCTGGTTGGCGTT
CTTGAAAATGATCCCAAAAAAAATACAATCTGCATTTATGGCCATTTAGATGTCCAACCT
GCTCTTAAATCTGATGGCTGGGAATCTGAACCATTTGATTTAGTGGAGCGTGATGGAAAG
TTGTTTGGTAGAGGTGCTACAGATGATAAAGGACCAGTACTCGGTTGGCTTCATGCTATC
AATGCATACAAAGCTACTGGCGAGGAGCTGCCAGTGAATCTCAAATTCGTATTTGAATGT
ATGGAAGAATCTGGTTCAGAGGGTCTTGATGAGTTGCTAATGCAGAAATTGAAGCCGGAA
GGTTTCTTTGATTCCGTGGACTTTGTCTGTATTTCTGACAACTATTGGCTGGGAACCACT
AAACCTTGCATCACTTACGGTCTGAGAGGCATTAGCTATTATTTCTTGGAGGTTGAATGC
GCTAAAATGGATCTCCACAGTGGTGTATATGGAGGAACTGTACATGAAGCCATGTCCGAT
CTCATATACCTTATGAACACTCTGGTTGATAAAGATGGTAAGATCTTAATCACCGACATA
TACAAGTCGGTAGCACCGCTCACAGATAATGAACAGAAACTGTACAATACAATCGACTTC
AACCCAGAGGCCTACAGACAATCAATAAGCGCCCATAAACTGGCCCACAATGGTGTAAAG
GAACAACTACTGATGCACCGATGGAGGTATCCAAGCCTGTCACTCCATGGAATTGAAGGC
GCTGCCTTCCAGCCTGGTGCGAAGACTGTCATCCCCGGGAAGGTCATTGGCAAATTCTCA
ATTCGTATCGTCCCTAACCAGGAGCCGGAGGAAGTCGAGAAACTTGTGTTTGACTATGTT
CACAAGAAGTGGGAAGAACGCGGGTCTCCCAACAAGATGCGTATAACTGCTCAGTCCGGA
CGCGCTTGGACCGAGAACCCTGAACATCCACACTACCAGGCCGCTGCTAGAGCCACACGA
CTCATATACAAGACTGAGCCGGACATGTCTCGTGAGGGTGGATCCATACCAGTGACGATC
ACGCTCCAAGAGGCCAGCGCCAAGAACGTGCTGCTGCTGCCCATGGGCGCGGGAGACGAT
ATGGCGCACTCACAGAACGAGAAGATCAACGTCCGGAACTATATAGAGGGGATCAAACTC
TTCGCTGCATACTTATATGAAGTCGGTAAACTACCTAAATAG
Protein sequence:
MVYWMRDKLKEVGASTEIRDVGYQNFDGKEVKLPPVLVGVLENDPKKNTICIYGHLDVQP
ALKSDGWESEPFDLVERDGKLFGRGATDDKGPVLGWLHAINAYKATGEELPVNLKFVFEC
MEESGSEGLDELLMQKLKPEGFFDSVDFVCISDNYWLGTTKPCITYGLRGISYYFLEVEC
AKMDLHSGVYGGTVHEAMSDLIYLMNTLVDKDGKILITDIYKSVAPLTDNEQKLYNTIDF
NPEAYRQSISAHKLAHNGVKEQLLMHRWRYPSLSLHGIEGAAFQPGAKTVIPGKVIGKFS
IRIVPNQEPEEVEKLVFDYVHKKWEERGSPNKMRITAQSGRAWTENPEHPHYQAAARATR
LIYKTEPDMSREGGSIPVTITLQEASAKNVLLLPMGAGDDMAHSQNEKINVRNYIEGIKL
FAAYLYEVGKLPK