Intuitions of Compromise: Utilitarianism vs. Contractualism
Jared Moore, Yejin Choi, Sydney Levine
TL;DR
The paper questions which value-aggregation rule yields intuitively plausible compromises when groups differ in what they value, by contrasting the Utilitarian Sum with the Nash Product under a broad, systematically generated set of scenarios. It introduces precise visual aids (area and volume charts) to communicate the proposals and conducts parallel assessments with humans and GPT-family LLMs, revealing a robust human preference for the Nash Product in disagreement cases and varying alignment in agreement cases. The results challenge the default utilitarian approach in AI alignment and decision-support contexts and suggest contractualist intuitions may better capture lay intuitions about fair trade-offs, especially when accompanied by effective visual communication. The work also highlights limitations and variability across LLMs, underscoring the need for careful design of AI-assisted value aggregation tools and further exploration of contractualist mechanisms beyond the Nash Product.
Abstract
What is the best compromise in a situation where different people value different things? The most commonly accepted method for answering this question -- in fields across the behavioral and social sciences, decision theory, philosophy, and artificial intelligence development -- is simply to add up utilities associated with the different options and pick the solution with the largest sum. This ``utilitarian'' approach seems like the obvious, theory-neutral way of approaching the problem. But there is an important, though often-ignored, alternative: a ``contractualist'' approach, which advocates for an agreement-driven method of deciding. Remarkably, no research has presented empirical evidence directly comparing the intuitive plausibility of these two approaches. In this paper, we systematically explore the proposals suggested by each algorithm (the ``Utilitarian Sum'' and the contractualist ''Nash Product''), using a paradigm that applies those algorithms to aggregating preferences across groups in a social decision-making context. While the dominant approach to value aggregation up to now has been utilitarian, we find that people strongly prefer the aggregations recommended by the contractualist algorithm. Finally, we compare the judgments of large language models (LLMs) to that of our (human) participants, finding important misalignment between model and human preferences.
