Hedging and Approximate Truthfulness in Traditional Forecasting Competitions
Mary Monroe, Anish Thilagar, Melody Hsu, Rafael Frongillo
TL;DR
This paper analyzes the traditional Simple Max mechanism for forecasting contests across multiple events and formalizes incentive issues. Using a geometric view with the quadratic score $S(r,y)=1-(r-y)^2$, it shows that long-run truthfulness can fail: even a leading forecaster may benefit from hedging toward others’ reports. It then develops a positive result in a two-forecaster regime with sufficient uncertainty, proving approximate truthfulness via approximate affineness of the utility and Edgeworth expansions, yielding a rate $\gamma=O(m^{-1/4})$ for the distance to truth. The results have practical implications for leaderboard design and suggest avenues such as truncated/soft-max alternatives to mitigate extremizing incentives, along with directions for extending the theory to more general settings.
Abstract
In forecasting competitions, the traditional mechanism scores the predictions of each contestant against the outcome of each event, and the contestant with the highest total score wins. While it is well-known that this traditional mechanism can suffer from incentive issues, it is folklore that contestants will still be roughly truthful as the number of events grows. Yet thus far the literature lacks a formal analysis of this traditional mechanism. This paper gives the first such analysis. We first demonstrate that the ''long-run truthfulness'' folklore is false: even for arbitrary numbers of events, the best forecaster can have an incentive to hedge, reporting more moderate beliefs to increase their win probability. On the positive side, however, we show that two contestants will be approximately truthful when they have sufficient uncertainty over the relative quality of their opponent and the outcomes of the events, a case which may arise in practice.
