Mitigating Spurious Correlation via Distributionally Robust Learning with Hierarchical Ambiguity Sets
Sung Ho Jo, Seonghwi Kim, Minwoo Chae
TL;DR
This work tackles spurious correlations under distribution shifts by introducing a hierarchical distributionally robust optimization framework. By formulating a two-level ambiguity set over inter-group and intra-group distributions via a latent-space $W_ty$ distance, the method generalizes Group DRO and standard DRO to account for minority-group shifts. An efficient iterative training algorithm updates latent perturbations, group proportions, and model parameters, with theoretical convergence guarantees. Empirical results on CMNIST, Waterbirds, and CelebA under both original and minority-group shifted distributions show improved worst-group robustness while preserving strong performance on standard benchmarks, highlighting practical robustness advantages in real-world settings.
Abstract
Conventional supervised learning methods are often vulnerable to spurious correlations, particularly under distribution shifts in test data. To address this issue, several approaches, most notably Group DRO, have been developed. While these methods are highly robust to subpopulation or group shifts, they remain vulnerable to intra-group distributional shifts, which frequently occur in minority groups with limited samples. We propose a hierarchical extension of Group DRO that addresses both inter-group and intra-group uncertainties, providing robustness to distribution shifts at multiple levels. We also introduce new benchmark settings that simulate realistic minority group distribution shifts-an important yet previously underexplored challenge in spurious correlation research. Our method demonstrates strong robustness under these conditions-where existing robust learning methods consistently fail-while also achieving superior performance on standard benchmarks. These results highlight the importance of broadening the ambiguity set to better capture both inter-group and intra-group distributional uncertainties.
