Table of Contents
Fetching ...

Microfoundation Inference for Strategic Prediction

Daniele Bracale, Subha Maity, Felipe Maia Polo, Seamus Somerstep, Moulinath Banerjee, Yuekai Sun

TL;DR

This work tackles performative prediction by learning the microfoundations of agent responses to predictive models. It models agent behavior as a cost-adjusted utility maximization with a known benefit $B_\theta$ and an unknown cost $c$ restricted to a Bregman divergence $c_\varphi(z,z') = \varphi(z') - \varphi(z) - \nabla\varphi(z)^\top(z'-z)$, and aims to identify $\varphi$ from pre- and post-model distributions. The authors propose an optimal-transport-based estimator that aligns ex-ante and ex-post distributions to recover the gradient $\nabla\varphi$, enabling accurate reconstruction of the response map $T_\theta$ and robust minimization of performative risk. They establish identifiability conditions and convergence rates (e.g., $\mathbb{E}[\|\widehat{\gamma}-\gamma^*\|_2^2] \le K n^{-2/d}$ under strong convexity), and validate the approach on a credit-scoring dataset, showing robustness to misspecification of the benefit function and competitive performance relative to baselines. Overall, the framework provides a principled, data-driven path to infer social impacts of predictions and to exploit fast, constraint-aware optimization in strategic settings.

Abstract

Often in prediction tasks, the predictive model itself can influence the distribution of the target variable, a phenomenon termed performative prediction. Generally, this influence stems from strategic actions taken by stakeholders with a vested interest in predictive models. A key challenge that hinders the widespread adaptation of performative prediction in machine learning is that practitioners are generally unaware of the social impacts of their predictions. To address this gap, we propose a methodology for learning the distribution map that encapsulates the long-term impacts of predictive models on the population. Specifically, we model agents' responses as a cost-adjusted utility maximization problem and propose estimates for said cost. Our approach leverages optimal transport to align pre-model exposure (ex ante) and post-model exposure (ex post) distributions. We provide a rate of convergence for this proposed estimate and assess its quality through empirical demonstrations on a credit-scoring dataset.

Microfoundation Inference for Strategic Prediction

TL;DR

This work tackles performative prediction by learning the microfoundations of agent responses to predictive models. It models agent behavior as a cost-adjusted utility maximization with a known benefit and an unknown cost restricted to a Bregman divergence , and aims to identify from pre- and post-model distributions. The authors propose an optimal-transport-based estimator that aligns ex-ante and ex-post distributions to recover the gradient , enabling accurate reconstruction of the response map and robust minimization of performative risk. They establish identifiability conditions and convergence rates (e.g., under strong convexity), and validate the approach on a credit-scoring dataset, showing robustness to misspecification of the benefit function and competitive performance relative to baselines. Overall, the framework provides a principled, data-driven path to infer social impacts of predictions and to exploit fast, constraint-aware optimization in strategic settings.

Abstract

Often in prediction tasks, the predictive model itself can influence the distribution of the target variable, a phenomenon termed performative prediction. Generally, this influence stems from strategic actions taken by stakeholders with a vested interest in predictive models. A key challenge that hinders the widespread adaptation of performative prediction in machine learning is that practitioners are generally unaware of the social impacts of their predictions. To address this gap, we propose a methodology for learning the distribution map that encapsulates the long-term impacts of predictive models on the population. Specifically, we model agents' responses as a cost-adjusted utility maximization problem and propose estimates for said cost. Our approach leverages optimal transport to align pre-model exposure (ex ante) and post-model exposure (ex post) distributions. We provide a rate of convergence for this proposed estimate and assess its quality through empirical demonstrations on a credit-scoring dataset.

Paper Structure

This paper contains 22 sections, 4 theorems, 50 equations, 8 figures, 1 algorithm.

Key Result

Theorem 4.1

In the ex-ante problem, denote the transportation maps from the barycenter to the measures $P, Q_{\theta}, \dots Q_{\theta_m}$ as $T_0, T_1, \dots, T_m$, and similarly, in (ex-post), denote them as $T_1, \dots, T_m$. Then an equivalent condition for eq:identifiability1 is that all conservative sol

Figures (8)

  • Figure 1: The function $\varphi'$ is well estimated when he benefit function $B_\theta$ is correctly specified. On the other hand, misspecification of $B_\theta$ leads to biased estimates of $\varphi'$.
  • Figure 2: The estimation of the transport map $T_{\tilde{\theta}}$ is robust to the misspecification of the benefit function for values of $\tilde{\theta}$ different from those used to induce the ex-post distribution and estimate $\varphi'$, i.e., $\theta$.
  • Figure 3: Performative performance for different $\#$ published classifiers: the plots depict the performative test accuracy/cross-entropy loss as the $\#$ classifiers increases. Compared to the baselines, our method converges much faster to the optimal classifier.
  • Figure 4: The estimation error of our method, when estimating $T_\theta$, decreases with growing sample size.
  • Figure 5: Estimation error for the map $T_\theta$ in the multivariate case with a convex neural network.
  • ...and 3 more figures

Theorems & Definitions (9)

  • Example 2.1: Causal Strategic OLS shavit2020Causal
  • Theorem 4.1
  • Corollary 4.2
  • Theorem 4.3
  • Lemma A.1
  • proof : Proof of Lemma \ref{['lemma:missp-map-minimization']}
  • proof : Proof of Theorem \ref{['lemma:identifiable']}
  • proof : Proof of Corollary \ref{['lemma:identifiable2']}
  • proof : Proof of Theorem \ref{['thm:rate-of-convergence']}