Fair Incentives for Repeated Engagement
Daniel Freund, Chamsi Hssaine
TL;DR
The paper addresses designing fair, unconditional monetary incentives for repeated engagement when agent retention is stochastic and heterogeneous by type. It shows that under stringent fairness constraints, a static fluid-optimal policy is asymptotically optimal in a large-market limit, while dynamic policies may outperform static ones only in discriminatory ways; a key two-reward structure enables efficient computation of the fluid optimum and yields convergence rates of $O(1/\theta)$ under mild conditions. The authors develop a deterministic relaxation (fluid model) and prove that the fluid heuristic achieves near-optimal long-run profit, with numerical experiments confirming the theory and illustrating when dispersion (lotteries) is beneficial. The work contributes to fair retention design by characterizing when fairness constraints restrict dynamic gains and by providing a fast, tractable policy construction applicable to large populations, with practical implications for loyalty programs and service platforms.
Abstract
We study a decision-maker's problem of finding optimal monetary incentive schemes for retention when faced with agents whose participation decisions (stochastically) depend on the incentive they receive. Our focus is on policies constrained to fulfill two fairness properties that preclude outcomes wherein different groups of agents experience different treatment on average. We formulate the problem as a high-dimensional stochastic optimization problem, and study it through the use of a closely related deterministic variant. We show that the optimal static solution to this deterministic variant is asymptotically optimal for the dynamic problem under fairness constraints. Though solving for the optimal static solution gives rise to a non-convex optimization problem, we uncover a structural property that allows us to design a tractable, fast-converging heuristic policy. Traditional schemes for retention ignore fairness constraints; indeed, the goal in these is to use differentiation to incentivize repeated engagement with the system. Our work (i) shows that even in the absence of explicit discrimination, dynamic policies may unintentionally discriminate between agents of different types by varying the type composition of the system, and (ii) presents an asymptotically optimal policy to avoid such discriminatory outcomes.
