Differential Privacy with Multiple Selections
Ashish Goel, Zhihao Jiang, Aleksandra Korolova, Kamesh Munagala, Sahasrajit Sarmasarkar
TL;DR
This work introduces a multi-selection architecture for differential privacy, where a server returns a small set of candidate results and the user privately chooses the best match. It proves that, on a one-dimensional domain under $\epsilon$-geographic DP, the optimal user action is to add Laplace noise with scale $1/\epsilon$, and the utility loss decreases as $O(1/(\epsilon k))$ when the disutility is the identity. The authors develop a novel infinite-dimensional linear-programming framework, Differential-Integral Linear Programs (DILPs), along with a weak-duality theory and a dual-fitting technique to certify the optimality of Laplace noise and to derive server response strategies. They also extend the analysis to generalized GDP and discuss high-dimensional extensions with PCA, showing that Laplace noise retains optimality under broad conditions. Overall, the paper provides a tight privacy-utility trade-off for multi-selection DP and a rigorous optimization toolkit that could inform practical privacy-preserving retrieval and recommendation systems.
Abstract
We consider the setting where a user with sensitive features wishes to obtain a recommendation from a server in a differentially private fashion. We propose a ``multi-selection'' architecture where the server can send back multiple recommendations and the user chooses one from these that matches best with their private features. When the user feature is one-dimensional -- on an infinite line -- and the accuracy measure is defined w.r.t some increasing function $\mathfrak{h}(.)$ of the distance on the line, we precisely characterize the optimal mechanism that satisfies differential privacy. The specification of the optimal mechanism includes both the distribution of the noise that the user adds to its private value, and the algorithm used by the server to determine the set of results to send back as a response and further show that Laplace is an optimal noise distribution. We further show that this optimal mechanism results in an error that is inversely proportional to the number of results returned when the function $\mathfrak{h}(.)$ is the identity function.
