Multigroup Robustness
Lunjia Hu, Charlotte Peale, Judy Hanwen Shen
TL;DR
This work introduces multigroup robustness, a fine-grained notion that ensures a learning model’s degradation in predictions on any subpopulation is controlled by the amount of corruption within that subpopulation. It connects multigroup robustness to multiaccuracy and provides a practical post-processing approach that augments arbitrary learners with both multigroup robustness and multiaccuracy guarantees while preserving accuracy. The authors prove that multiaccuracy can yield multigroup robustness under distribution shift, and, with uniform convergence, extend robustness to stronger adversaries; they also show lower bounds establishing MA as a necessary property for non-trivial MR. Empirically, standard models exhibit vulnerability to simple subpopulation attacks, while the proposed post-processing method restores robustness without sacrificing performance, illustrating the practical impact for deployment in diverse subpopulations. Overall, the work bridges fairness and robustness by leveraging MA-based guarantees to achieve subgroup-protected robustness in realistic, non-i.i.d. data settings.
Abstract
To address the shortcomings of real-world datasets, robust learning algorithms have been designed to overcome arbitrary and indiscriminate data corruption. However, practical processes of gathering data may lead to patterns of data corruption that are localized to specific partitions of the training dataset. Motivated by critical applications where the learned model is deployed to make predictions about people from a rich collection of overlapping subpopulations, we initiate the study of multigroup robust algorithms whose robustness guarantees for each subpopulation only degrade with the amount of data corruption inside that subpopulation. When the data corruption is not distributed uniformly over subpopulations, our algorithms provide more meaningful robustness guarantees than standard guarantees that are oblivious to how the data corruption and the affected subpopulations are related. Our techniques establish a new connection between multigroup fairness and robustness.
