Statistical Inference under Performativity
Xiang Li, Yunai Li, Huiying Zhong, Lihua Lei, Zhun Deng
TL;DR
This work tackles statistical inference when predictions influence the data they predict (performativity) by developing an end-to-end framework for inference under performativity. It establishes a central limit theorem for estimators produced by repeated risk minimization and extends prediction-powered inference to dynamic performative settings, including data-driven covariance estimation via score matching and a policy-perturbation scheme. The framework enables bias-aware confidence regions for the performative stable point and tighter inference through PPI, with a greedy method to select weighting parameters that improve efficiency. Through simulations and a semi-synthetic credit-scoring case, the approach yields robust coverage and narrower confidence intervals, demonstrating practical value for policy design and economic decision-making under shifting data-generating processes.
Abstract
Performativity of predictions refers to the phenomenon where prediction-informed decisions influence the very targets they aim to predict -- a dynamic commonly observed in policy-making, social sciences, and economics. In this paper, we initiate an end-to-end framework of statistical inference under performativity. Our contributions are twofold. First, we establish a central limit theorem for estimation and inference in the performative setting, enabling standard inferential tasks such as constructing confidence intervals and conducting hypothesis tests in policy-making contexts. Second, we leverage this central limit theorem to study prediction-powered inference (PPI) under performativity. This approach yields more precise estimates and tighter confidence regions for the model parameters (i.e., policies) of interest in performative prediction. We validate the effectiveness of our framework through numerical experiments. To the best of our knowledge, this is the first work to establish a complete statistical inference under performativity, introducing new challenges and inference settings that we believe will provide substantial value to policy-making, statistics, and machine learning.
