DPBloomfilter: Securing Bloom Filters with Differential Privacy
Yekun Ke, Yingyu Liang, Zhizhou Sha, Zhenmei Shi, Zhao Song
TL;DR
This work targets privacy leakage in Bloom filters used for membership queries by integrating a differential privacy mechanism. It introduces DPBloomfilter, which applies random response to every Bloom filter bit, achieving $(\epsilon,\delta)$-DP while preserving the standard Bloom filter’s running time. The authors provide per-bit privacy proofs, a quantified DP budgeting via a quantified bit-change variable $W$, and utility analyses that bound the impact on query accuracy. They accompany the theory with extensive simulations showing high utility under practical DP budgets and clear behavior as DP parameters vary. Overall, DPBloomfilter is the first approach to furnish differential privacy guarantees for Bloom filter membership queries without sacrificing efficiency, with meaningful implications for privacy-preserving large-scale data processing.
Abstract
The Bloom filter is a simple yet space-efficient probabilistic data structure that supports membership queries for dramatically large datasets. It is widely utilized and implemented across various industrial scenarios, often handling massive datasets that include sensitive user information necessitating privacy preservation. To address the challenge of maintaining privacy within the Bloom filter, we have developed the DPBloomfilter. This innovation integrates the classical differential privacy mechanism, specifically the Random Response technique, into the Bloom filter, offering robust privacy guarantees under the same running complexity as the standard Bloom filter. Through rigorous simulation experiments, we have demonstrated that our DPBloomfilter algorithm maintains high utility while ensuring privacy protections. To the best of our knowledge, this is the first work to provide differential privacy guarantees for the Bloom filter for membership query problems.
