Evaluating and Correcting Performative Effects of Decision Support Systems via Causal Domain Shift

Philip Boeken; Onno Zoeter; Joris M. Mooij

Evaluating and Correcting Performative Effects of Decision Support Systems via Causal Domain Shift

Philip Boeken, Onno Zoeter, Joris M. Mooij

TL;DR

This work treats the deployment of decision support predictions as a causal domain shift, introducing the domain indicator D to distinguish pre- and post-deployment in high-stakes settings. It defines deployment and retraining effects, and shows that evaluating these effects and correcting performative bias can be cast as domain adaptation problems solvable via domain pivots {X,Z} and a repeated regression estimator. The approach accommodates selection bias and missing labels, yielding identifiability results and a practical estimation strategy without requiring randomized deployments. By linking evaluation and bias correction under a unified causal framework, it offers a principled method to anticipate, monitor, and mitigate performative effects of DSSs in fields like healthcare and law. The framework thus supports responsible, data-driven deployment of predictive alarms by enabling pre- and post-deployment assessment and bias-aware retraining.

Abstract

When predicting a target variable $Y$ from features $X$, the prediction $\hat{Y}$ can be performative: an agent might act on this prediction, affecting the value of $Y$ that we eventually observe. Performative predictions are deliberately prevalent in algorithmic decision support, where a Decision Support System (DSS) provides a prediction for an agent to affect the value of the target variable. When deploying a DSS in high-stakes settings (e.g. healthcare, law, predictive policing, or child welfare screening) it is imperative to carefully assess the performative effects of the DSS. In the case that the DSS serves as an alarm for a predicted negative outcome, naive retraining of the prediction model is bound to result in a model that underestimates the risk, due to effective workings of the previous model. In this work, we propose to model the deployment of a DSS as causal domain shift and provide novel cross-domain identification results for the conditional expectation $E[Y | X]$, allowing for pre- and post-hoc assessment of the deployment of the DSS, and for retraining of a model that assesses the risk under a baseline policy where the DSS is not deployed. Using a running example, we empirically show that a repeated regression procedure provides a practical framework for estimating these quantities, even when the data is affected by sample selection bias and selective labelling, offering for a practical, unified solution for multiple forms of target variable bias.

Evaluating and Correcting Performative Effects of Decision Support Systems via Causal Domain Shift

TL;DR

Abstract

When predicting a target variable

from features

, the prediction

can be performative: an agent might act on this prediction, affecting the value of

that we eventually observe. Performative predictions are deliberately prevalent in algorithmic decision support, where a Decision Support System (DSS) provides a prediction for an agent to affect the value of the target variable. When deploying a DSS in high-stakes settings (e.g. healthcare, law, predictive policing, or child welfare screening) it is imperative to carefully assess the performative effects of the DSS. In the case that the DSS serves as an alarm for a predicted negative outcome, naive retraining of the prediction model is bound to result in a model that underestimates the risk, due to effective workings of the previous model. In this work, we propose to model the deployment of a DSS as causal domain shift and provide novel cross-domain identification results for the conditional expectation

, allowing for pre- and post-hoc assessment of the deployment of the DSS, and for retraining of a model that assesses the risk under a baseline policy where the DSS is not deployed. Using a running example, we empirically show that a repeated regression procedure provides a practical framework for estimating these quantities, even when the data is affected by sample selection bias and selective labelling, offering for a practical, unified solution for multiple forms of target variable bias.

Paper Structure (16 sections, 5 theorems, 18 equations, 5 figures, 2 tables)

This paper contains 16 sections, 5 theorems, 18 equations, 5 figures, 2 tables.

Introduction
Contributions
Related work
Causal modelling of decision support systems
Application A: Evaluation
Application B: Bias correction
Equivalence of T1--3 and their non-identifiability
Domain pivots: mediators of the prediction and outcome
Estimation
Sample selection bias and selective labelling
Discussion
Relation to existing literature
Performative prediction
Off-policy evaluation
Surrogate indices
...and 1 more sections

Key Result

Proposition 6

Given a Markov kernel $\mathbb{P}(Y=1 | X, A)$, consider the SCM $X\sim \mathbb{P}(X), A = D \cdot \mathbbm{1}{\{\hat{Y} > \varepsilon(X)\}}, \hat{Y} = \hat{y}(X), Y \sim \mathbb{P}(Y=1 \mathop{\mathrm{|}}\limits X, A)$ with $\varepsilon(x) := \mathbb{P}(Y=1 \mathop{\mathrm{|}}\limits X=x, A=1)$ and for $\mathbb{P}(X)$-almost all $x\in \mathcal{X}$.

Figures (5)

Figure 1: In epoch $t=1$ the DSS is not deployed. In $t=2$ a DSS $\hat{Y}$ is deployed that is trained on data from $t=1$, effectively reducing the mean of $Y$. In $t=3$, a DSS $\hat{Y}$ that is naively retrained on data from $t=2$ is deployed, increasing the mean of $Y$.
Figure 2: Modelling the deployment of the DSS with prediction $\hat{Y}$ as domain shift.
Figure 3: Performative prediction through a mediator $A$, with an observed common cause $C$.
Figure 4: $t=3, \hat{Y} = \hat{\mathbb{E}}[Y|X,\mathop{\mathrm{\mathrm{do}}}\limits(D=0)]$
Figure 5: A causal graph with selection variable $S$.

Theorems & Definitions (14)

Example 1
Definition 2: Deployment effect
Definition 3: Retraining effect
Definition 4: Baseline predictor
Definition 5: Performative bias
Proposition 6
Definition 7: Identifiability
Lemma 8
Proposition 9
Definition 10: Domain pivot
...and 4 more

Evaluating and Correcting Performative Effects of Decision Support Systems via Causal Domain Shift

TL;DR

Abstract

Evaluating and Correcting Performative Effects of Decision Support Systems via Causal Domain Shift

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (14)