General agents contain world models

Jonathan Richens; David Abel; Alexis Bellot; Tom Everitt

General agents contain world models

Jonathan Richens, David Abel, Alexis Bellot, Tom Everitt

TL;DR

It is shown that any agent capable of generalizing to multi-step goal-directed tasks must have learned a predictive model of its environment, and that increasing the agents performance or the complexity of the goals it can achieve requires learning increasingly accurate world models.

Abstract

Are world models a necessary ingredient for flexible, goal-directed behaviour, or is model-free learning sufficient? We provide a formal answer to this question, showing that any agent capable of generalizing to multi-step goal-directed tasks must have learned a predictive model of its environment. We show that this model can be extracted from the agent's policy, and that increasing the agents performance or the complexity of the goals it can achieve requires learning increasingly accurate world models. This has a number of consequences: from developing safe and general agents, to bounding agent capabilities in complex environments, and providing new algorithms for eliciting world models from agents.

General agents contain world models

TL;DR

Abstract

General agents contain world models

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (29)