All AI Models are Wrong, but Some are Optimal
Akhil S Anand, Shambhuraj Sawant, Dirk Reinhardt, Sebastien Gros
TL;DR
This work tackles the gap between predictive accuracy and decision quality in sequential decision-making by introducing decision-oriented predictive models and proving necessary and sufficient conditions for when such models yield optimal policies. It shows that models best fitting data do not necessarily maximize decision performance and that deterministic predictions can be optimal for stochastic systems under the proposed framework. Through battery energy storage and smart-home heat-pump examples, the authors demonstrate how RL-based fine-tuning of data-fitted predictors can materially improve closed-loop performance, even when using simple, differentiable MPC schemes. The results offer practical guidance for constructing decision-oriented predictive models and highlight future directions for integrating prediction, optimization, and learning in complex, real-world systems.
Abstract
AI models that predict the future behavior of a system (a.k.a. predictive AI models) are central to intelligent decision-making. However, decision-making using predictive AI models often results in suboptimal performance. This is primarily because AI models are typically constructed to best fit the data, and hence to predict the most likely future rather than to enable high-performance decision-making. The hope that such prediction enables high-performance decisions is neither guaranteed in theory nor established in practice. In fact, there is increasing empirical evidence that predictive models must be tailored to decision-making objectives for performance. In this paper, we establish formal (necessary and sufficient) conditions that a predictive model (AI-based or not) must satisfy for a decision-making policy established using that model to be optimal. We then discuss their implications for building predictive AI models for sequential decision-making.
