PAC Learnability in the Presence of Performativity
Ivan Kirev, Lyuben Baltadzhiev, Nikola Konstantinov
TL;DR
The paper addresses PAC learnability under model-induced distribution shifts caused by performativity, formalizing a performative PAC framework relative to a distribution map $\tilde{\mathcal{D}}(\cdot, \mathcal{D})$ and introducing a linear posterior drift family. It derives the performative empirical risk $PR_n(h)$ as an unbiased estimator of the true performative risk $PR(h)$ and shows that minimizing $PR_n(h)$ yields a PAC learner whenever the standard binary-classification class is PAC-learnable, even under linear drift. Generalization bounds are obtained via Rademacher complexity, with extensions to imperfect information about the distribution map and to broader performative maps using density-ratio reweighting. Empirically, PERM improves performance over ERM across synthetic data and real-world credit/income datasets, and analysis of RERM illustrates both its potential and its limitations under certain shift dynamics. The work offers a principled statistical approach to learning under self-induced distribution shifts and points to future work on richer performative maps and real-world experimental validation.
Abstract
Following the wide-spread adoption of machine learning models in real-world applications, the phenomenon of performativity, i.e. model-dependent shifts in the test distribution, becomes increasingly prevalent. Unfortunately, since models are usually trained solely based on samples from the original (unshifted) distribution, this performative shift may lead to decreased test-time performance. In this paper, we study the question of whether and when performative binary classification problems are learnable, via the lens of the classic PAC (Probably Approximately Correct) learning framework. We motivate several performative scenarios, accounting in particular for linear shifts in the label distribution, as well as for more general changes in both the labels and the features. We construct a performative empirical risk function, which depends only on data from the original distribution and on the type performative effect, and is yet an unbiased estimate of the true risk of a classifier on the shifted distribution. Minimizing this notion of performative risk allows us to show that any PAC-learnable hypothesis space in the standard binary classification setting remains PAC-learnable for the considered performative scenarios. We also conduct an extensive experimental evaluation of our performative risk minimization method and showcase benefits on synthetic and real data.
