Efficient Bias Mitigation Without Privileged Information
Mateo Espinosa Zarlenga, Swami Sankaranarayanan, Jerone T. A. Andrews, Zohreh Shams, Mateja Jamnik, Alice Xiang
TL;DR
The paper tackles the problem of biased DNNs arising from spuriously correlated cues in the absence of group labels. It introduces Targeted Augmentations for Bias mitigation (TAB), a hyperparameter-free unsupervised pipeline that uses the full training history of a helper model to identify bias-conflicting samples via per-class loss-history clustering and to generate a group-balanced dataset for retraining. TAB achieves improved worst-group accuracy across synthetic and real-world vision tasks while maintaining competitive mean accuracy and without costly hyperparameter searches, enabling more practical deployment. The approach is simple to implement, does not require group annotations during training or validation, and reduces model-selection overhead, though it entails a double-training loop and clustering on loss histories, with opportunities for scalability and augmentation enhancements. Overall, TAB advances practical debiasing by delivering robust performance improvements in realistic resource-constrained settings.
Abstract
Deep neural networks trained via empirical risk minimisation often exhibit significant performance disparities across groups, particularly when group and task labels are spuriously correlated (e.g., "grassy background" and "cows"). Existing bias mitigation methods that aim to address this issue often either rely on group labels for training or validation, or require an extensive hyperparameter search. Such data and computational requirements hinder the practical deployment of these methods, especially when datasets are too large to be group-annotated, computational resources are limited, and models are trained through already complex pipelines. In this paper, we propose Targeted Augmentations for Bias Mitigation (TAB), a simple hyperparameter-free framework that leverages the entire training history of a helper model to identify spurious samples, and generate a group-balanced training set from which a robust model can be trained. We show that TAB improves worst-group performance without any group information or model selection, outperforming existing methods while maintaining overall accuracy.
