Dissecting Performative Prediction: A Comprehensive Survey
Thomas Kehrenberg, Javier Sanguino, Jose A. Lozano, Novi Quadrianto
TL;DR
This survey formalizes performative prediction through the distribution map $\mathcal{D}(\theta)$, outlining how deploying a predictor shifts the environment and creates a feedback loop. It defines two core objectives—performative stability $\theta_{PS}$ and performative optimality $\theta_{PO}$—and introduces a classification by access level to the distribution map, guiding a comprehensive review of models, fitting methods, and optimization algorithms. The paper surveys mathematical models of distribution maps (including strategic classification, base-distribution families, and transition maps), techniques to fit such maps, and algorithms (RRM, RGDescent, stochastic methods, bilevel, etc.) to reach stable or optimal points, with extensions to stateful and multi-deployer settings. It further discusses cross-pollination with adversarial robustness, algorithmic recourse, delayed impact, and fairness, and highlights practical challenges in data collection, benchmarks, and the need for standardized datasets for PP research.
Abstract
The field of performative prediction had its beginnings in 2020 with the seminal paper "Performative Prediction" by Perdomo et al., which established a novel machine learning setup where the deployment of a predictive model causes a distribution shift in the environment, which in turn causes a mismatch between the distribution expected by the predictive model and the real distribution. This shift is defined by a so-called distribution map. In the half-decade since, a literature has emerged which has, among other things, introduced new solution concepts to the original setup, extended the setup, offered new theoretical analyses, and examined the intersection of performative prediction and other established fields. In this survey, we first lay out the performative prediction setting and explain the different optimization targets: performative stability and performative optimality. We introduce a new way of classifying different performative prediction settings, based on how much information is available about the distribution map. We survey existing implementations of distribution maps and existing methods to address the problem of performative prediction, while examining different ways to categorize them. Finally, we point out known and previously unknown connections that can be drawn to other fields, in the hopes of stimulating future research.
