The Need for a Big World Simulator: A Scientific Challenge for Continual Learning
Saurabh Kumar, Hong Jun Jeon, Alex Lewandowski, Benjamin Van Roy
TL;DR
This work argues that existing continual learning benchmarks fail to capture the true complexity of learning in a big world with bounded agents. It formalizes an information-theoretic framework for environments and agents, introduces two design desiderata for a big world simulator—no diminishing returns to capacity and ongoing learning for finite-capacity agents—and connects these ideas to forgetting and implasticity through a decomposition of prediction error. The authors propose a Turing-complete, Rule 110-based prediction environment as a concrete illustrative example that satisfies the desiderata and demonstrates capacity-driven improvements and persistent non-stationarity. The practical impact is a principled blueprint for building simulators that enable rapid prototyping at small scale while preserving relevance to real-world continual learning challenges. The work motivates future research toward robust evaluation metrics and algorithms that can better sustain continual engagement with a complex, evolving world.
Abstract
The "small agent, big world" frame offers a conceptual view that motivates the need for continual learning. The idea is that a small agent operating in a much bigger world cannot store all information that the world has to offer. To perform well, the agent must be carefully designed to ingest, retain, and eject the right information. To enable the development of performant continual learning agents, a number of synthetic environments have been proposed. However, these benchmarks suffer from limitations, including unnatural distribution shifts and a lack of fidelity to the "small agent, big world" framing. This paper aims to formalize two desiderata for the design of future simulated environments. These two criteria aim to reflect the objectives and complexity of continual learning in practical settings while enabling rapid prototyping of algorithms on a smaller scale.
