Robust Decision Aggregation with Adversarial Experts
Yongkang Guo, Yuqing Kong
TL;DR
This work develops a robust, minimax framework for aggregating reports from truthful and adversarial experts in binary and multi-state settings. It establishes that under L1 loss the $k$-truncated mean is optimal for soft forecasts, with regret scaling linearly in the adversary ratio $\gamma$; under L2 loss the optimal soft aggregators are piecewise linear, and for hard decisions a randomized truncation rule (random dictator) is optimal. The theory yields closed-form solutions in the adversarial L2 setting for small $\gamma$, including a hard sigmoid with separators at $\{k,n-k\}$, and extends to the multi-state case via a $k$-ignorance random dictator. Numerical experiments on CIFAR-10 ensemble classification confirm practical gains over majority voting and averaging, and the results reveal how adversaries qualitatively alter optimal aggregation rules. The paper also highlights limitations in the general information-structure setting, offering negative results and outlining directions for future work in forecast aggregation and broader information spaces.
Abstract
We consider a robust aggregation problem in the presence of both truthful and adversarial experts. The truthful experts will report their private signals truthfully, while the adversarial experts can report arbitrarily. We assume experts are marginally symmetric in the sense that they share the same common prior and marginal posteriors. The rule maker needs to design an aggregator to predict the true world state from these experts' reports, without knowledge of the underlying information structures or adversarial strategies. We aim to find the optimal aggregator that outputs a forecast minimizing regret under the worst information structure and adversarial strategies. The regret is defined by the difference in expected loss between the aggregator and a benchmark who aggregates optimally given the information structure and reports of truthful experts. We focus on binary states and reports. Under L1 loss, we show that the truncated mean aggregator is optimal. When there are at most k adversaries, this aggregator discards the k lowest and highest reported values and averages the remaining ones. For L2 loss, the optimal aggregators are piecewise linear functions. All the optimalities hold when the ratio of adversaries is bounded above by a value determined by the experts' priors and posteriors. The regret only depends on the ratio of adversaries, not on their total number. For hard aggregators that output a decision, we prove that a random version of the truncated mean is optimal for both L1 and L2. This aggregator randomly follows a remaining value after discarding the $k$ lowest and highest reported values. We extend the hard aggregator to multi-state setting. We evaluate our aggregators numerically in an ensemble learning task. We also obtain negative results for general adversarial aggregation problems under broader information structures and report spaces.
