Harm Mitigation in Recommender Systems under User Preference Dynamics
Jerry Chee, Shankar Kalyanaraman, Sindhu Kiranmai Ernala, Udi Weinsberg, Sarah Dean, Stratis Ioannidis
TL;DR
This paper tackles harm mitigation in recommender systems where user preferences evolve under exposure to recommendations. It introduces a dynamic attraction-based model that couples recommendation decisions to steady-state user profiles, producing a non-convex optimization problem for policy design. A contraction-based fixed-point analysis guarantees a unique stationary profile, and gradient-based methods via the implicit-function theorem enable effective policy optimization at stationarity. Empirical results on semi-synthetic MovieLens data show gradient-based policies achieve substantial improvements in the CTR–harm tradeoff, outperforming baselines by up to 77% and illustrating the importance of incorporating user dynamics into harm mitigation strategies.
Abstract
We consider a recommender system that takes into account the interplay between recommendations, the evolution of user interests, and harmful content. We model the impact of recommendations on user behavior, particularly the tendency to consume harmful content. We seek recommendation policies that establish a tradeoff between maximizing click-through rate (CTR) and mitigating harm. We establish conditions under which the user profile dynamics have a stationary point, and propose algorithms for finding an optimal recommendation policy at stationarity. We experiment on a semi-synthetic movie recommendation setting initialized with real data and observe that our policies outperform baselines at simultaneously maximizing CTR and mitigating harm.
