Joint Scoring Rules: Zero-Sum Competition Avoids Performative Prediction
Rubi Hudson
TL;DR
The paper tackles performative prediction when eliciting conditional predictions from multiple agents and shows that single symmetric scoring rules cannot deterministically realize the principal's top choice. It proposes a joint zero-sum evaluation among predictors combined with optimistic-max decision making, proving a quasi-strictly proper mechanism that yields honest reporting and identification of the principal's preferred action $a^*$; a uniqueness result argues that zero-sum is essential. The authors demonstrate efficient action search using $O(\log|\mathcal{A}|)$ comparisons and extend the framework to stochastic decision rules, including mean-max and random-max variants, while preserving incentives. Empirical results in a toy environment show that zero-sum training increases predictive accuracy and principal utility and can erase learned performativity, supporting the practical viability of the approach for safer AI systems and Oracle AI applications. Overall, the work provides both theoretical foundations and initial empirical validation for using zero-sum joint scoring to safely harness conditional predictions in decision-making under potential performativity.
Abstract
In a decision-making scenario, a principal could use conditional predictions from an expert agent to inform their choice. However, this approach would introduce a fundamental conflict of interest. An agent optimizing for predictive accuracy is incentivized to manipulate their principal towards more predictable actions, which prevents that principal from being able to deterministically select their true preference. We demonstrate that this impossibility result can be overcome through the joint evaluation of multiple agents. When agents are made to engage in zero-sum competition, their incentive to influence the action taken is eliminated, and the principal can identify and take the action they most prefer. We further prove that this zero-sum setup is unique, efficiently implementable, and applicable under stochastic choice. Experiments in a toy environment demonstrate that training on a zero-sum objective significantly enhances both predictive accuracy and principal utility, and can eliminate previously learned manipulative behavior.
