Table of Contents
Fetching ...

Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

Jiayun Wu, Jiashuo Liu, Peng Cui, Zhiwei Steven Wu

TL;DR

This work links multicalibration to out-of-distribution generalization by extending the calibration notion to joint grouping functions over covariates and outcomes, enabling robustness beyond covariate shift. It establishes theoretical connections to invariance (IRM) under concept shift, characterizes the maximal grouping function space as a linear span of density ratios, and decomposes grouping functions to balance accuracy and invariance. A post-processing algorithm, MC-PseudoLabel, is proposed to achieve extended multicalibration and invariant predictions with lightweight optimization and convergence guarantees in Gaussian regimes. Empirical evaluation on real-world distribution-shift datasets demonstrates strong performance advantages and robust generalization under distribution shifts. The framework offers a practical pathway for robust regression under OOD conditions and suggests avenues for extending to classification tasks.

Abstract

We establish a new model-agnostic optimization framework for out-of-distribution generalization via multicalibration, a criterion that ensures a predictor is calibrated across a family of overlapping groups. Multicalibration is shown to be associated with robustness of statistical inference under covariate shift. We further establish a link between multicalibration and robustness for prediction tasks both under and beyond covariate shift. We accomplish this by extending multicalibration to incorporate grouping functions that consider covariates and labels jointly. This leads to an equivalence of the extended multicalibration and invariance, an objective for robust learning in existence of concept shift. We show a linear structure of the grouping function class spanned by density ratios, resulting in a unifying framework for robust learning by designing specific grouping functions. We propose MC-Pseudolabel, a post-processing algorithm to achieve both extended multicalibration and out-of-distribution generalization. The algorithm, with lightweight hyperparameters and optimization through a series of supervised regression steps, achieves superior performance on real-world datasets with distribution shift.

Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

TL;DR

This work links multicalibration to out-of-distribution generalization by extending the calibration notion to joint grouping functions over covariates and outcomes, enabling robustness beyond covariate shift. It establishes theoretical connections to invariance (IRM) under concept shift, characterizes the maximal grouping function space as a linear span of density ratios, and decomposes grouping functions to balance accuracy and invariance. A post-processing algorithm, MC-PseudoLabel, is proposed to achieve extended multicalibration and invariant predictions with lightweight optimization and convergence guarantees in Gaussian regimes. Empirical evaluation on real-world distribution-shift datasets demonstrates strong performance advantages and robust generalization under distribution shifts. The framework offers a practical pathway for robust regression under OOD conditions and suggests avenues for extending to classification tasks.

Abstract

We establish a new model-agnostic optimization framework for out-of-distribution generalization via multicalibration, a criterion that ensures a predictor is calibrated across a family of overlapping groups. Multicalibration is shown to be associated with robustness of statistical inference under covariate shift. We further establish a link between multicalibration and robustness for prediction tasks both under and beyond covariate shift. We accomplish this by extending multicalibration to incorporate grouping functions that consider covariates and labels jointly. This leads to an equivalence of the extended multicalibration and invariance, an objective for robust learning in existence of concept shift. We show a linear structure of the grouping function class spanned by density ratios, resulting in a unifying framework for robust learning by designing specific grouping functions. We propose MC-Pseudolabel, a post-processing algorithm to achieve both extended multicalibration and out-of-distribution generalization. The algorithm, with lightweight hyperparameters and optimization through a series of supervised regression steps, achieves superior performance on real-world datasets with distribution shift.
Paper Structure (29 sections, 30 theorems, 116 equations, 4 figures, 4 tables, 1 algorithm)

This paper contains 29 sections, 30 theorems, 116 equations, 4 figures, 4 tables, 1 algorithm.

Key Result

Theorem 2.3

For a source measure $P_S(X,Y)$ and a set of probability measures $\mathcal{P}(X)$ containing $P_S(X)$, given a predictor $f:\mathcal{X} \rightarrow [0,1]$ with finite range $m:=|\text{Range}(f)|$, consider a grouping function class $\mathcal{H}$ closed under affine transformation and satisfying Ass

Figures (4)

  • Figure 1: Accuracy-on-the-line beyond covariate shift: correlation between models' in-distribution and out-of-distribution risks on VesselPower.
  • Figure 2: Results (RMSE) on the synthetic dataset. Curves show method performances across hyperparameters. Methods without extra hyperparameters are marked with dotted lines.
  • Figure 3: Evolution of pseudolabels during MC-Pseudolabel. The first row plots values of pseudolabels against model predictions. The second row plots values of pseudolabels against $V$. Columns represent different snapshots during optimization.
  • Figure 4: Correlation between models' in-distribution and out-of-distribution risks on VesselPower.

Theorems & Definitions (59)

  • Definition 2.1: Multicalibration with Joint Grouping Functions
  • Theorem 2.3: Risk Bound under Covariate Shift
  • Remark 2.4
  • Theorem 3.1: Risk Bound under Concept Shift
  • Definition 3.2: Invariant Predictor
  • Remark 3.3
  • Theorem 3.4: Equivalence of Multicalibration and Invariance
  • Remark 3.5
  • Proposition 4.1: Maximal Grouping Function Class
  • Theorem 4.2: Spanning Set
  • ...and 49 more