Naturally Private Recommendations with Determinantal Point Processes
Jack Fitzsimons, Agustín Freitas Pasqualini, Robert Pisarczyk, Dmitrii Usynin
TL;DR
The work investigates whether determinantal point processes (DPPs), which inherently balance popularity and diversity, can provide natural privacy protection without explicit noise. It casts DPP sampling as an instance of the exponential mechanism by relating eigenvalue-based scores to $u(\lambda)=\log(\lambda)$ and derives a bound $\Delta_u = n \log\left(1 + \dfrac{\Delta_L}{\sigma \sqrt{n}}\right)$ under a jitter $\sigma$, leading to $\epsilon$-privacy for the eigenvalue step via $\epsilon = 2\Delta_u$. It further analyzes eigenvector-based sampling, invoking the Davis–Kahan theorem to relate perturbations to eigengap, and notes that the total privacy budget also depends on the subset-construction step, which remains to be quantified. The study highlights practical limitations, such as dependence on eigenvalue separation and potentially large privacy budgets, and proposes integrating DPPs with standard DP techniques (e.g., SVT) to improve the privacy-utility balance in future work.
Abstract
Often we consider machine learning models or statistical analysis methods which we endeavour to alter, by introducing a randomized mechanism, to make the model conform to a differential privacy constraint. However, certain models can often be implicitly differentially private or require significantly fewer alterations. In this work, we discuss Determinantal Point Processes (DPPs) which are dispersion models that balance recommendations based on both the popularity and the diversity of the content. We introduce DPPs, derive and discuss the alternations required for them to satisfy epsilon-Differential Privacy and provide an analysis of their sensitivity. We conclude by proposing simple alternatives to DPPs which would make them more efficient with respect to their privacy-utility trade-off.
