Enhancing Distributional Stability among Sub-populations
Jiashuo Liu, Jiayun Wu, Jie Peng, Xiaoyu Wu, Yang Zheng, Bo Li, Peng Cui
TL;DR
This work targets Out-of-Distribution generalization under latent heterogeneity by introducing distributional stability, a continuous measure of how prediction mechanisms $Y|X$ vary across sub-populations. It defines $ ext{DS}_{eta_0}$ via KL-divergence over sub-populations and derives an $(eta_0,s)$-learnability framework with an OOD generalization bound that scales with the stability gap. Motivated by theory, the authors propose Stable Risk Minimization (SRM), a two-player optimization that alternates between finding the worst sub-population (variation explorer) and learning a stable predictor (stable learner) under a KL-based stability penalty. Empirical results on simulated selection-bias tasks and a large real-world multi-environment dataset show SRM improves both accuracy and stability under distribution shifts, supporting its applicability for robust, environment-agnostic learning in settings with latent sub-population structure.
Abstract
Enhancing the stability of machine learning algorithms under distributional shifts is at the heart of the Out-of-Distribution (OOD) Generalization problem. Derived from causal learning, recent works of invariant learning pursue strict invariance with multiple training environments. Although intuitively reasonable, strong assumptions on the availability and quality of environments are made to learn the strict invariance property. In this work, we come up with the ``distributional stability" notion to mitigate such limitations. It quantifies the stability of prediction mechanisms among sub-populations down to a prescribed scale. Based on this, we propose the learnability assumption and derive the generalization error bound under distribution shifts. Inspired by theoretical analyses, we propose our novel stable risk minimization (SRM) algorithm to enhance the model's stability w.r.t. shifts in prediction mechanisms ($Y|X$-shifts). Experimental results are consistent with our intuition and validate the effectiveness of our algorithm. The code can be found at https://github.com/LJSthu/SRM.
