Randomized algorithms for precise measurement of differentially-private, personalized recommendations
Allegra Laro, Yanqing Chen, Hao He, Babak Aghazadeh
TL;DR
This work tackles the tension between precise measurement and user privacy in personalized recommendations by moving differential privacy noise into the content-selection step rather than the measurement phase. The proposed algorithm combines private on-device scoring with DP-based selection mechanisms (RR or SNM) to allow exact payment accounting for content creators while protecting Type I user data, enabling accurate server-side records and billing. Through offline experiments on public and internal ad- auctions data, the study shows that differential privacy can yield a practical privacy-utility trade-off, with Randomized Response performing well at moderate to high privacy levels and SNM offering data-dependent advantages at low privacy. The findings suggest that privacy-preserving, precise measurement in personalized recommendations is feasible at realistic privacy parameters, and that incorporating privacy-amplification strategies or exploring adaptive DP mechanisms could further improve utility in production systems.
Abstract
Personalized recommendations form an important part of today's internet ecosystem, helping artists and creators to reach interested users, and helping users to discover new and engaging content. However, many users today are skeptical of platforms that personalize recommendations, in part due to historically careless treatment of personal data and data privacy. Now, businesses that rely on personalized recommendations are entering a new paradigm, where many of their systems must be overhauled to be privacy-first. In this article, we propose an algorithm for personalized recommendations that facilitates both precise and differentially-private measurement. We consider advertising as an example application, and conduct offline experiments to quantify how the proposed privacy-preserving algorithm affects key metrics related to user experience, advertiser value, and platform revenue compared to the extremes of both (private) non-personalized and non-private, personalized implementations.
