FairDropout: Using Example-Tied Dropout to Enhance Generalization of Minority Groups
Geraldin Nanfack, Eugene Belilovsky
TL;DR
The paper addresses the problem of minority-group generalization under spurious correlations in deep networks trained with empirical risk minimization. It introduces FairDropout, an example-tied dropout that allocates memorizing neurons per example during training and drops them at inference, with tunable probabilities $p_ ext{gen}$ and $p_ ext{mem}$ and flexible placement in large architectures. Across vision, language, and medical benchmarks, FairDropout reduces reliance on spurious features and improves worst-group accuracy without requiring training-time group labels, achieving strong gains on datasets like MultiNLI and MIMIC-CXR. The approach highlights a practical, scalable mechanism to mitigate minority-group overfitting by rechanneling memorization away from decision boundaries. Limitations include potential beneficial memorization in some contexts and the need for further study of memorization-generalization interactions.
Abstract
Deep learning models frequently exploit spurious features in training data to achieve low training error, often resulting in poor generalization when faced with shifted testing distributions. To address this issue, various methods from imbalanced learning, representation learning, and classifier recalibration have been proposed to enhance the robustness of deep neural networks against spurious correlations. In this paper, we observe that models trained with empirical risk minimization tend to generalize well for examples from the majority groups while memorizing instances from minority groups. Building on recent findings that show memorization can be localized to a limited number of neurons, we apply example-tied dropout as a method we term FairDropout, aimed at redirecting this memorization to specific neurons that we subsequently drop out during inference. We empirically evaluate FairDropout using the subpopulation benchmark suite encompassing vision, language, and healthcare tasks, demonstrating that it significantly reduces reliance on spurious correlations, and outperforms state-of-the-art methods.
