How Global Calibration Strengthens Multiaccuracy
Sílvia Casacuberta, Parikshit Gopalan, Varun Kanade, Omer Reingold
TL;DR
This work analyzes the power of multiaccuracy as a learning primitive and shows that, while multiaccuracy alone can be weak, pairing it with global calibration yields strong learning guarantees and optimal hardcore distributions. The authors establish precise results: calibrated multiaccuracy enables strong agnostic learning; multiaccuracy alone yields restricted weak learning only when the target correlation exceeds $1/2$; and combining calibration with weighted multiaccuracy achieves hardcore densities of $2\delta$ with favorable oracle complexity $q=O(1/(\varepsilon^2\delta^2))$, matching the best-known density bounds while staying computationally efficient. The results illuminate the complementary roles of multiaccuracy and calibration and explain why their combination yields substantially stronger guarantees than either notion alone. Practically, this suggests that implementing calibration alongside standard multiaccuracy can unlock the strengths of multicalibration at lower cost, with implications for fair prediction, robustness, and hardness amplification in learning systems.
Abstract
Multiaccuracy and multicalibration are multigroup fairness notions for prediction that have found numerous applications in learning and computational complexity. They can be achieved from a single learning primitive: weak agnostic learning. Here we investigate the power of multiaccuracy as a learning primitive, both with and without the additional assumption of calibration. We find that multiaccuracy in itself is rather weak, but that the addition of global calibration (this notion is called calibrated multiaccuracy) boosts its power substantially, enough to recover implications that were previously known only assuming the stronger notion of multicalibration. We give evidence that multiaccuracy might not be as powerful as standard weak agnostic learning, by showing that there is no way to post-process a multiaccurate predictor to get a weak learner, even assuming the best hypothesis has correlation $1/2$. Rather, we show that it yields a restricted form of weak agnostic learning, which requires some concept in the class to have correlation greater than $1/2$ with the labels. However, by also requiring the predictor to be calibrated, we recover not just weak, but strong agnostic learning. A similar picture emerges when we consider the derivation of hardcore measures from predictors satisfying multigroup fairness notions. On the one hand, while multiaccuracy only yields hardcore measures of density half the optimal, we show that (a weighted version of) calibrated multiaccuracy achieves optimal density. Our results yield new insights into the complementary roles played by multiaccuracy and calibration in each setting. They shed light on why multiaccuracy and global calibration, although not particularly powerful by themselves, together yield considerably stronger notions.
