Spurious Correlation-Aware Embedding Regularization for Worst-Group Robustness
Subeen Park, Joowang Kim, Hakyung Lee, Sunjae Yoo, Kyungwoo Song
TL;DR
SCER addresses spurious correlations that degrade worst-group robustness under subpopulation shifts by directly regularizing the embedding space. It decomposes worst-group error into spurious and core components using group-wise mean embeddings and a Sigma-norm, then optimizes an embedding loss that penalizes spurious alignment while promoting core-aligned representations. Theoretical analysis links worst-group error to the product of alignment with spurious directions and their magnitudes, guiding the embedding regularization; empirically, SCER achieves state-of-the-art worst-group accuracy across vision and language benchmarks and remains effective when environment labels are inferred. The approach offers a practical, single-stage method for robust generalization under distribution shifts with strong cross-domain performance.
Abstract
Deep learning models achieve strong performance across various domains but often rely on spurious correlations, making them vulnerable to distribution shifts. This issue is particularly severe in subpopulation shift scenarios, where models struggle in underrepresented groups. While existing methods have made progress in mitigating this issue, their performance gains are still constrained. They lack a rigorous theoretical framework connecting the embedding space representations with worst-group error. To address this limitation, we propose Spurious Correlation-Aware Embedding Regularization for Worst-Group Robustness (SCER), a novel approach that directly regularizes feature representations to suppress spurious cues. We show theoretically that worst-group error is influenced by how strongly the classifier relies on spurious versus core directions, identified from differences in group-wise mean embeddings across domains and classes. By imposing theoretical constraints at the embedding level, SCER encourages models to focus on core features while reducing sensitivity to spurious patterns. Through systematic evaluation on multiple vision and language, we show that SCER outperforms prior state-of-the-art studies in worst-group accuracy. Our code is available at \href{https://github.com/MLAI-Yonsei/SCER}{https://github.com/MLAI-Yonsei/SCER}.
