Language Agents as Digital Representatives in Collective Decision-Making
Daniel Jarrett, Miruna Pîslar, Michiel A. Bakker, Michael Henry Tessler, Raphael Köster, Jan Balaguer, Romuald Elie, Christopher Summerfield, Andrea Tacchetti
TL;DR
The paper addresses representing individual human preferences in collective decision-making through digital proxies. It formalizes the framework with a social choice function $h$, a mediation mechanism $\tau$, outcome mapping $f$, and participant utilities $g_i$, and introduces a value-based notion of representativity via the Bellman operator $\mathbb{B}_{\pi,\tau}$ and value function $Q_{\pi,\tau}^t$ to compare true and proxy policies. It proposes three levels of equivalence—conditional cloning, transition-level equivalence, and trajectory-level equivalence—and argues that trajectory-level, value-aware equivalence best captures representativity. Through a consensus-finding case study using a large UK-demographic dataset and fine-tuned 1B and 30B adaptations of the Chinchilla LLM, the authors show that larger, fine-tuned digital representatives can closely reproduce ground-truth critiques and preserve consensus payoffs. The work demonstrates the feasibility of scalable simulations for mechanism design and policy deliberation while emphasizing the need for human validation and careful interpretation of simulators for real-world deployment.
Abstract
Consider the process of collective decision-making, in which a group of individuals interactively select a preferred outcome from among a universe of alternatives. In this context, "representation" is the activity of making an individual's preferences present in the process via participation by a proxy agent -- i.e. their "representative". To this end, learned models of human behavior have the potential to fill this role, with practical implications for multi-agent scenario studies and mechanism design. In this work, we investigate the possibility of training \textit{language agents} to behave in the capacity of representatives of human agents, appropriately expressing the preferences of those individuals whom they stand for. First, we formalize the setting of \textit{collective decision-making} -- as the episodic process of interaction between a group of agents and a decision mechanism. On this basis, we then formalize the problem of \textit{digital representation} -- as the simulation of an agent's behavior to yield equivalent outcomes from the mechanism. Finally, we conduct an empirical case study in the setting of \textit{consensus-finding} among diverse humans, and demonstrate the feasibility of fine-tuning large language models to act as digital representatives.
