Optimal Policy Design for Repeated Decision-Making under Social Influence
Chiara Ravazzi, Valentina Breschi, Paolo Frasca, Fabrizio Dabbene, Mara Tanelli
TL;DR
This work extends the Friedkin–Johnsen opinion dynamics model by incorporating random uncontrollable perturbations and external budgeted nudges, while enforcing a time-scale separation between rapid individual decisions and slower social imitation. It establishes ergodicity and convergence to a steady-state mean $x^*=(I-\Lambda \bar P)^{-1}(I-\Lambda)\eta^0$, and shows that $E[y(t)]=x^*$ in the long run, enabling model-based policy design. Two policy classes are developed: (i) constant nudges (MFCCP and MBCCP) that optimize asymptotic acceptance via a quadratic program, and (ii) a Model Predictive Control framework that handles transients, with both oracle and estimated-mean variants. Numerical results on a 100-node modular network demonstrate that adaptive MPC approaches substantially improve acceptance of a target action under budget constraints, with performance depending on the time-scale separation parameter $\alpha$ and network topology.
Abstract
In this paper, we present a novel model to characterize individual tendencies in repeated decision-making scenarios, with the goal of designing model-based control strategies that promote virtuous choices amidst social and external influences. Our approach builds on the classical Friedkin and Johnsen model of social influence, extending it to include random factors (e.g., inherent variability in individual needs) and controllable external inputs. We explicitly account for the temporal separation between two processes that shape opinion dynamics: individual decision-making and social imitation. While individual decisions occur at regular, frequent intervals, the influence of social imitation unfolds over longer periods. The inclusion of random factors naturally leads to dynamics that do not converge in the classical sense. However, under specific conditions, we prove that opinions exhibit ergodic behavior. Building on this result, we propose a constrained asymptotic optimal control problem designed to foster, on average, social acceptance of a target action within a network. To address the transient dynamics of opinions, we reformulate this problem within a Model Predictive Control (MPC) framework. Simulations highlight the significance of accounting for these transient effects in steering individuals toward virtuous choices while managing policy costs.
