Purifying Approximate Differential Privacy with Randomized Post-processing
Yingyu Lin, Erchi Wang, Yi-An Ma, Yu-Xiang Wang
TL;DR
This work introduces purification, a framework that converts $(\varepsilon,\delta)$-DP mechanisms into $(\varepsilon+\varepsilon')$-pure DP mechanisms via randomized post-processing and calibrated Laplace noise, under domain-geometry assumptions. The approach relies on a TV-to-$W_{\infty}$ distance conversion and a uniform-mixing step to enable pure-DP guarantees with near-optimal utility across tasks such as DP-ERM, PTR, mode release, and query release. The authors provide concrete purified algorithms (DP-SGD, DP-Frank-Wolfe, PTR purification, Pure DP mode release, AdaSSP-based regression, and MWEM-based query release) with matching or near-best-known pure-DP rates, and show how purification can be leveraged to derive lower bounds for approximate DP via contrapositive arguments. Overall, purification offers a practical blueprint for designing pure-DP algorithms by first leveraging approximate-DP routines and then applying structured post-processing to erase the delta while preserving utility. The framework broadens the toolkit for pure-DP algorithm design and provides a principled way to translate established approximate-DP methods into strong, zero-delta privacy guarantees with quantifiable utility.
Abstract
We propose a framework to convert $(\varepsilon, δ)$-approximate Differential Privacy (DP) mechanisms into $(\varepsilon', 0)$-pure DP mechanisms under certain conditions, a process we call ``purification.'' This algorithmic technique leverages randomized post-processing with calibrated noise to eliminate the $δ$ parameter while achieving near-optimal privacy-utility tradeoff for pure DP. It enables a new design strategy for pure DP algorithms: first run an approximate DP algorithm with certain conditions, and then purify. This approach allows one to leverage techniques such as strong composition and propose-test-release that require $δ>0$ in designing pure-DP methods with $δ=0$. We apply this framework in various settings, including Differentially Private Empirical Risk Minimization (DP-ERM), stability-based release, and query release tasks. To the best of our knowledge, this is the first work with a statistically and computationally efficient reduction from approximate DP to pure DP. Finally, we illustrate the use of this reduction for proving lower bounds under approximate DP constraints with explicit dependence in $δ$, avoiding the sophisticated fingerprinting code construction.
