Prospective Learning in Retrospect
Yuxin Bai, Cecelia Shuai, Ashwin De Silva, Siyu Yu, Pratik Chaudhari, Joshua T. Vogelstein
TL;DR
This work addresses the misalignment of PAC-based learning with nonstationary data and evolving objectives by adopting prospective learning, which treats data as drawn from a stochastic process and aims to minimize future risk via a time-indexed sequence of predictors. The authors extend the framework with practical instantiations: Prospective-ERM and Prospective-MLP using a time embedding, and nonparametric Prospective CART/GBTs, plus a foraging application that demonstrates sequential decision-making under a one-life constraint. Key findings show that (i) Prospective-MLPs remain robust under heterogeneous sampling and can handle infinite-task regimes with appropriate time embeddings, (ii) streaming-online training is feasible but slows convergence compared with batched training, (iii) prospective forests perform competitively with Prospective-MLPs and offer interpretability, and (iv) prospective foraging can surpass standard actor–critic RL in a one-life task when time embeddings are used. Collectively, these results validate prospective learning as a versatile framework for dynamic environments, with potential impact in supervised and reinforcement-learning contexts where task distributions drift over time.
Abstract
In most real-world applications of artificial intelligence, the distributions of the data and the goals of the learners tend to change over time. The Probably Approximately Correct (PAC) learning framework, which underpins most machine learning algorithms, fails to account for dynamic data distributions and evolving objectives, often resulting in suboptimal performance. Prospective learning is a recently introduced mathematical framework that overcomes some of these limitations. We build on this framework to present preliminary results that improve the algorithm and numerical results, and extend prospective learning to sequential decision-making scenarios, specifically foraging. Code is available at: https://github.com/neurodata/prolearn2.
