TRACE: Theoretical Risk Attribution under Covariate-shift Effects

Hosein Anjidani; S. Yahya S. R. Tehrani; Mohammad Mahdi Mojahedian; Mohammad Hossein Yassaee

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

Hosein Anjidani, S. Yahya S. R. Tehrani, Mohammad Mahdi Mojahedian, Mohammad Hossein Yassaee

TL;DR

TRACE introduces a principled, computable attribution framework for the risk change |ΔR| that occurs when replacing a source-trained model with one trained on covariate-shifted data. It decomposes |ΔR| into four interpretable components—source/generalization gap, target/generalization gap, model-change penalty, and covariate-shift penalty—and instantiates each with data-driven proxies based on Optimal Transport or Maximum Mean Discrepancy, feature-space transport, and high-quantile gradient norms. The bound is integrated into a practical diagnostic and deployment gate, validated on synthetic and DomainNet vision benchmarks with strong monotonic relationships to true risk degradation and near-perfect gating performance. The approach supports safe, label-efficient model replacement by offering actionable diagnostics that separate geometric data shift from algorithmic retraining effects. The framework is demonstrated theoretically in ridge regression and empirically across synthetic and real-world shifts, underscoring its relevance for anchor-aware deployment decisions and broader risk-sensitive ML workflows.

Abstract

When a source-trained model $Q$ is replaced by a model $\tilde{Q}$ trained on shifted data, its performance on the source domain can change unpredictably. To address this, we study the two-model risk change, $ΔR := R_P(\tilde{Q}) - R_P(Q)$, under covariate shift. We introduce TRACE (Theoretical Risk Attribution under Covariate-shift Effects), a framework that decomposes $|ΔR|$ into an interpretable upper bound. This decomposition disentangles the risk change into four actionable factors: two generalization gaps, a model change penalty, and a covariate shift penalty, transforming the bound into a powerful diagnostic tool for understanding why performance has changed. To make TRACE a fully computable diagnostic, we instantiate each term. The covariate shift penalty is estimated via a model sensitivity factor (from high-quantile input gradients) and a data-shift measure; we use feature-space Optimal Transport (OT) by default and provide a robust alternative using Maximum Mean Discrepancy (MMD). The model change penalty is controlled by the average output distance between the two models on the target sample. Generalization gaps are estimated on held-out data. We validate our framework in an idealized linear regression setting, showing the TRACE bound correctly captures the scaling of the true risk difference with the magnitude of the shift. Across synthetic and vision benchmarks, TRACE diagnostics are valid and maintain a strong monotonic relationship with the true performance degradation. Crucially, we derive a deployment gate score that correlates strongly with $|ΔR|$ and achieves high AUROC/AUPRC for gating decisions, enabling safe, label-efficient model replacement.

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

TL;DR

Abstract

When a source-trained model

is replaced by a model

trained on shifted data, its performance on the source domain can change unpredictably. To address this, we study the two-model risk change,

, under covariate shift. We introduce TRACE (Theoretical Risk Attribution under Covariate-shift Effects), a framework that decomposes

into an interpretable upper bound. This decomposition disentangles the risk change into four actionable factors: two generalization gaps, a model change penalty, and a covariate shift penalty, transforming the bound into a powerful diagnostic tool for understanding why performance has changed. To make TRACE a fully computable diagnostic, we instantiate each term. The covariate shift penalty is estimated via a model sensitivity factor (from high-quantile input gradients) and a data-shift measure; we use feature-space Optimal Transport (OT) by default and provide a robust alternative using Maximum Mean Discrepancy (MMD). The model change penalty is controlled by the average output distance between the two models on the target sample. Generalization gaps are estimated on held-out data. We validate our framework in an idealized linear regression setting, showing the TRACE bound correctly captures the scaling of the true risk difference with the magnitude of the shift. Across synthetic and vision benchmarks, TRACE diagnostics are valid and maintain a strong monotonic relationship with the true performance degradation. Crucially, we derive a deployment gate score that correlates strongly with

and achieves high AUROC/AUPRC for gating decisions, enabling safe, label-efficient model replacement.

Paper Structure (74 sections, 6 theorems, 53 equations, 2 figures, 5 tables)

This paper contains 74 sections, 6 theorems, 53 equations, 2 figures, 5 tables.

Introduction
Related Work
Domain Adaptation and Divergence Measures.
Model Disagreement and Algorithmic Stability.
Generalization Under Distribution Mismatch.
Shift Detection and Monitoring.
Problem Setup
Setting and Goal.
Loss and Risks.
Quantities for Attribution.
Assumptions.
General Applicability.
Wasserstein Distance.
From Decomposition to a Computable TRACE Diagnostic
A General Four-Term Decomposition
...and 59 more sections

Key Result

Lemma 1

The absolute risk change is bounded as follows: where $\mathsf{COSP} := |R_{\tilde{P}}(\tilde{Q}) - R_P(\tilde{Q})|$ represents the population-level risk change for a fixed model $\tilde{Q}$ due to the shift from $P_X$ to $\tilde{P}_X$.

Figures (2)

Figure 1: Conceptual overview of TRACE.
Figure 2: The TRACE diagnostic pipeline. The process takes models and data as input, computes the five core components of the bound in parallel, aggregates them, and produces the final diagnostic score $\widehat{\mathcal{B}}$ and its attribution report.

Theorems & Definitions (11)

Definition 1: 1-Wasserstein
Lemma 1: General TRACE Decomposition
Proposition 1: Model Change Bound
Lemma 2: Empirical Shift Bound
Corollary 1: Computable TRACE Diagnostic
proof
proof : Proof of Lemma \ref{['lem:trace-decomp']}
proof : Proof of Lemma \ref{['lem:emp-shift-label']}
Definition 2: MMD
Proposition 2: MMD-based Covariate Shift Control
...and 1 more

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

TL;DR

Abstract

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (11)