New model in OGS2.0 | DPOGS215632  |
---|---|
Genomic Position | scaffold1007:+ 15475-59025 |
See gene structure | |
CDS Length | 3684 |
Paired RNAseq reads   | 399 |
Single RNAseq reads   | 1205 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003532 (2e-19) |
Best Drosophila hit   | CG32627, isoform C (6e-82) |
Best Human hit | cytosolic carboxypeptidase 1 (6e-132) |
Best NR hit (blastp)   | PREDICTED: similar to ATP/GTP binding protein 1 [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to ATP/GTP binding protein 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0008233 peptidase activity GO:0008237 metallopeptidase activity GO:0005634 nucleus GO:0006508 proteolysis GO:0004181 metallocarboxypeptidase activity GO:0046872 metal ion binding GO:0005488 binding GO:0005737 cytoplasm |
InterPro families   | IPR000834 Peptidase M14, carboxypeptidase A |
Orthology group | MCL16762 |
Nucleotide sequence:
ATGAATGAGAAATGTCAGGATTTGGTTATCGGAGCCGACTGCTCGGCGCAAACACAGGAA
ATCAAAAAACCAAAGAAAAGAGCGTATCGCATGGCGGACGACGCGGGCGACTGCCTGTTC
GAGCGGCTCCGACTGCACCAGCAACGCGCCCCGGACGCGACTGAGGTCGCGCGTGCTATC
ACAGCTAGAATCAACTCGCGTCTCACATCACACGACAAACACATCCGACAGAGCACTCTT
GACAAACTATGGAACAAGCAAACTGGTGCGATACAAATGCTTCTATCTATATTAGAGAAT
TCAAGAGATACAGCGACATCAACCTATATAACATCAATATTTAGAGAAGCTCTCTGCCTT
AAACAAGGAAAAGGAAAAAAATGTTCAGTAGCGAATGAAGCCTTGGGGTCGAAAAAGAAG
GAAAGCAAAAAAGGCAAAGAAAATAAAACACCACTGAATAAAAAGGCAAACAACGTAGCG
CGACAGCAGTGCTCACAACAATTCATCGCCCTAAACGGCACGCAAGCGGTGATACGTACA
CTACTTGCCACACATAGCAAACGCGACAGCAATGTTAGCACGGAACTAATGCTACAAGAC
CTCGTCTGGATACTTGCAGCACTTGCTCCAAGAGCAAACAACATACAACGGCAGCAATGT
GCGCAGGATTTTGTGGAAATGAAAGGAACACAGACCATTATTAGACGAGTGATATCAACG
GTCCATGATAAGAGGGAGCATCATGCTGGGGTGGAATTAGTATTAAATGATCTGGTGTGG
ATTCTCGCTCCGTTGGGGTCAAAAGATCCAAAATTTGCAATGAAGGTGAGAATGTTAGGA
TGCGTTAGGACGATGCATCTAATTTTGAAAGGACATTTCACTGACAATAAGTTGATATTT
CCACTATTAGTCATCATGAAACAGCTAGCCAAAAATTCCGTAACAACTTCCATTTTAATA
CGCGACGGAGTCATCGCGACTTACGATCGAGTGTTAATAAGTCTAGGCTTCATTCCTACA
GCGAGACTGAGACTCTGTTTGGACGCTATAGACTACTTTAGCAAGAACAAGGTGTGCTGC
ATGCAGATTGTGAAGACAGGACTGTGTGGAGTGCTAATAAGAGTGTTCGACCGCTGGGAC
CGCTACGAGGGGCGCATGAGGCTCAAGATATGCGCACACATCCTCCAGACATTACAGCAC
CTCTGCAACATCAAGGCTGGACGTCGTGCTCTTTGCACCAAGAAGCACGTGCAGACCCTC
CACAGGTTCTGTTCCCAGTGTCCTGATGAAATCGAGTTCGACGGACTGTTGGCTAGAGTC
TGTTCCGTCATAACATTGTGTCTCAAGCATCAGGCATTACCAGTGCCATCATCTAGTCCA
GCTACCTTCAACCTGAACCCTATACTTAAAGGAACAAATACCACATGGCCATGTCACGAA
GATGATGATGACGGCGGAAACTCAGACTCGAAGACAATCAATTCGGATTTGGAAGACGAT
GCTCCTGACAGTGATAACGAGGTCATTGATGACTTCCCAGATATTGATTTCGAAGAAAAT
GACTTAAAGAATAATGAAGATTTGGAGAAATCACATACAAAAAGTGGGGAGAGCATACAG
AGTGCACTGTGGATCAATCCCAATGAGAGAGATATCGAAGACTTAAAGAGATACTATATT
TTCTTCAAGGAGTTCGGTTCCTATAACAAGCAAATAAGGTTGGTAAAAAGCCGGTCCAAT
TCCCGGGGGTCCATACTAGATGACATTTTTATATCTCAAAATAGCGCTAACCGAAGCCAG
CCGTCGCCGACAAATCTTTCTTTAACTGCAGTACTTGGGAATACCGACTACGACAGCGCT
TTAGGATCATCTCAAACGCTTTCATTCCTGCAAGGATATCATAAGATACACGAAAGCACT
TCGACAACATCTTGTTCTTCCTTAAAAATCCATAAAGATATATCAAAATACAGTCCGCTC
GAGTCAGTTTATTCAATAATATCATCGAGAGTTAAAAGCATCATTCCATTTGTAAAAGTT
GCCTACCCAGATATGACAGGCGGGCAGGGTGCAACACAACCAGAGCCATTAAATAAAATG
GAGAGAACAGCTTGCAGAAATAAATTACTCGCTTGCGTCGAGAGAGCAATTAATCCGGAA
GCGTATATGAATGAAGTTGTGTATGATCTGGATGCTTTGAACAGTTCGAGCTCAAACGCA
GACACGACTTCGCAGAAAAGCAACGAGAGTTTATTTTTAATTAACACCGACGAACAAGAA
ATAACAAAAGTCAATAGTTTCTCATCGAGACTAAATTTTGAATCGAGATTTGAGTCGGGA
AATTTAAGAAAAGCCATACAGGTAGGTCCAAGAGAATATGAATTAATTTTAATGCCAGAC
GTAAATTCTCCAAAACGGCATCAGTGGTTTTACTTCGAAGTGCGTAATATGCAACAGGGA
CGGCCCTATATATTTAATATTGTGAATTGTGAGAAATCAGATAGCCAATTCAACTTCGGC
ATGAAGCCTGTTATGTATTCTGTGAAGGAAGCCGTCCTTGGAAGACCCGGGTGGGTGAGA
GCCGGTTCGGACATTTGCTATTACAGGAACAGCTACCACTATTCCAATCAAAGAAACAAC
AAGTGCTACCTAACAGTTACGTTCAACATCGACTTTCCCCACACAAACGACGTCTGCTAC
CTCGCTTACCACTTCCCATTCACTTACTCCATGATGATGACTAGAATTTTCCAATGGAGT
TCTCAATTGCCTCCTGGCGCTTATCTACGAGCTGAGCCCTTATGTTATACACTTAACAAC
AACGAAGTTCCTCTGTTGACTATATCAGCTGATGATACTCCGTCCAATCCCATAGTTGAC
AGGGAGATAGTATTCCTTACGGCTCGAGTCCACCCTGGTGAAAGCAACGCGTCCTGGGTA
ATGGATGGAACGCTGCGTTTCCTGCTCACAGACACTTCATCCGCAGCGGCCCTCCGTAAC
AAGTACGTGTTCAAAATCGTGCCGATGCTCAACGTCGAAGGTGTCGTTAATGGCTGCCAT
CGATGCGGCTTAACTAATGAAGATTTAAATCGACGCTGGTGCAAGCCGAGCCCCGTTTTG
CATCCTTCTATTTACCATACCAAGGGCTTAATAGAATATTTGGTGCGTGTTTGGAAGAAA
CCTCCGGTAGTTTATTGCGACTACCACGGTCATTCGCGCAAGAAGAACGTGTTCTTTTAC
GGTTGCGCCGGCGCAGAGAGCTGGTGCAGCAACGACCGGCTTGTCCCGGACGAGCCTGTT
AAATATCTCATGCTTCCAGCTTTAATGCACCGGCTATCACCGGCGTTCGCTCTTGGTTCG
TGTTCCTTTCGTGTTGAACGTGAGCGTGAGAGCACAGCGCGAGTCACTGTGTGGCGCCAC
CTAGGAGTCACACGGTCCTACACTATGGAAGCATCATTTTGTGGATTTGATAGGGGACCG
TTTAAAGGATTTCATCTCAACACCCAGCATCTGCAGAGCGTGGGCAGTGACTTTTGCGAA
GCTCTCAACGGTCTCGGAGATACAGCCAACAATGTTGACATACAACTCACTAAAGATCTC
AATGGCGAAATAGCAATAGACAGTGAAGCTGGCTCGGGGTCGGACAGCGTGTTGAAAACA
GATTCGGATGAAGATTTCGATTAG
Protein sequence:
MNEKCQDLVIGADCSAQTQEIKKPKKRAYRMADDAGDCLFERLRLHQQRAPDATEVARAI
TARINSRLTSHDKHIRQSTLDKLWNKQTGAIQMLLSILENSRDTATSTYITSIFREALCL
KQGKGKKCSVANEALGSKKKESKKGKENKTPLNKKANNVARQQCSQQFIALNGTQAVIRT
LLATHSKRDSNVSTELMLQDLVWILAALAPRANNIQRQQCAQDFVEMKGTQTIIRRVIST
VHDKREHHAGVELVLNDLVWILAPLGSKDPKFAMKVRMLGCVRTMHLILKGHFTDNKLIF
PLLVIMKQLAKNSVTTSILIRDGVIATYDRVLISLGFIPTARLRLCLDAIDYFSKNKVCC
MQIVKTGLCGVLIRVFDRWDRYEGRMRLKICAHILQTLQHLCNIKAGRRALCTKKHVQTL
HRFCSQCPDEIEFDGLLARVCSVITLCLKHQALPVPSSSPATFNLNPILKGTNTTWPCHE
DDDDGGNSDSKTINSDLEDDAPDSDNEVIDDFPDIDFEENDLKNNEDLEKSHTKSGESIQ
SALWINPNERDIEDLKRYYIFFKEFGSYNKQIRLVKSRSNSRGSILDDIFISQNSANRSQ
PSPTNLSLTAVLGNTDYDSALGSSQTLSFLQGYHKIHESTSTTSCSSLKIHKDISKYSPL
ESVYSIISSRVKSIIPFVKVAYPDMTGGQGATQPEPLNKMERTACRNKLLACVERAINPE
AYMNEVVYDLDALNSSSSNADTTSQKSNESLFLINTDEQEITKVNSFSSRLNFESRFESG
NLRKAIQVGPREYELILMPDVNSPKRHQWFYFEVRNMQQGRPYIFNIVNCEKSDSQFNFG
MKPVMYSVKEAVLGRPGWVRAGSDICYYRNSYHYSNQRNNKCYLTVTFNIDFPHTNDVCY
LAYHFPFTYSMMMTRIFQWSSQLPPGAYLRAEPLCYTLNNNEVPLLTISADDTPSNPIVD
REIVFLTARVHPGESNASWVMDGTLRFLLTDTSSAAALRNKYVFKIVPMLNVEGVVNGCH
RCGLTNEDLNRRWCKPSPVLHPSIYHTKGLIEYLVRVWKKPPVVYCDYHGHSRKKNVFFY
GCAGAESWCSNDRLVPDEPVKYLMLPALMHRLSPAFALGSCSFRVERERESTARVTVWRH
LGVTRSYTMEASFCGFDRGPFKGFHLNTQHLQSVGSDFCEALNGLGDTANNVDIQLTKDL
NGEIAIDSEAGSGSDSVLKTDSDEDFD