Steering the Herd: A Framework for LLM-based Control of Social Learning
Raghu Arghal, Kevin He, Shirin Saeedi Bidokhti, Saswati Sarkar
TL;DR
We study how an information-mediating planner, such as an LLM, can strategically control the precision of private signals in a sequential social-learning setting. The model embeds this control in a dynamic programming framework with Bayesian belief updates, analyzing altruistic versus biased planners and proving the convexity of the altruistic value function along with a structured, threshold-based policy characterization. The biased planner can, in some regimes, obfuscate signals to steer actions, with substantial welfare implications depending on alignment with agent goals. Empirical simulations using LLMs show that planners exhibit near-optimal strategic reasoning and emergent behavior consistent with the theory, while non-Bayesian agent biases can both distort learning and be mitigated by alignment-aware mediation. Overall, the work provides a tractable foundation for understanding and regulating LLM-based information mediators in social learning environments.
Abstract
Algorithms increasingly serve as information mediators--from social media feeds and targeted advertising to the increasing ubiquity of LLMs. This engenders a joint process where agents combine private, algorithmically-mediated signals with learning from peers to arrive at decisions. To study such settings, we introduce a model of controlled sequential social learning in which an information-mediating planner (e.g. an LLM) controls the information structure of agents while they also learn from the decisions of earlier agents. The planner may seek to improve social welfare (altruistic planner) or to induce a specific action the planner prefers (biased planner). Our framework presents a new optimization problem for social learning that combines dynamic programming with decentralized action choices and Bayesian belief updates. We prove the convexity of the value function and characterize the optimal policies of altruistic and biased planners, which attain desired tradeoffs between the costs they incur and the payoffs they earn from induced agent choices. Notably, in some regimes the biased planner intentionally obfuscates the agents' signals. Even under stringent transparency constraints--information parity with individuals, no lying or cherry-picking, and full observability--we show that information mediation can substantially shift social welfare in either direction. We complement our theory with simulations in which LLMs act as both planner and agents. Notably, the LLM planner in our simulations exhibits emergent strategic behavior in steering public opinion that broadly mirrors the trends predicted, though key deviations suggest the influence of non-Bayesian reasoning consistent with the cognitive patterns of both humans and LLMs trained on human-like data. Together, we establish our framework as a tractable basis for studying the impact and regulation of LLM information mediators.
