A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

Olivier Sigaud; Gianluca Baldassarre; Cedric Colas; Stephane Doncieux; Richard Duro; Pierre-Yves Oudeyer; Nicolas Perrin-Gilbert; Vieri Giuliano Santucci

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

Olivier Sigaud, Gianluca Baldassarre, Cedric Colas, Stephane Doncieux, Richard Duro, Pierre-Yves Oudeyer, Nicolas Perrin-Gilbert, Vieri Giuliano Santucci

TL;DR

The paper addresses the lack of a formal, consensus definition for open-ended learning by isolating an elementary property: the periodic emergence of novelty from an observer's perspective over an infinite horizon. It formalizes open-ended learning problems and, in particular, open-ended goal-conditioned reinforcement learning (GCRL), introducing first-order and second-order variants of open-ended GCRL. It then shows how to combine OEL with lifelong and autotelic or teachable goal-generation ideas, and discusses evaluation strategies and practical limitations. The work provides a principled framework to study evolving goal spaces and curricula, with implications for developmental AI and continual-learning research, while highlighting the need for explicit progress metrics and richer goal-discovery mechanisms in future work.

Abstract

A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what distinguishes open-ended learning from related concepts such as continual learning, lifelong learning or autotelic learning. In this paper, we contribute to fixing this situation. After illustrating the genealogy of the concept and more recent perspectives about what it truly means, we outline that open-ended learning is generally conceived as a composite notion encompassing a set of diverse properties. In contrast with previous approaches, we propose to isolate a key elementary property of open-ended processes, which is to produce elements from time to time (e.g., observations, options, reward functions, and goals), over an infinite horizon, that are considered novel from an observer's perspective. From there, we build the notion of open-ended learning problems and focus in particular on the subset of open-ended goal-conditioned reinforcement learning problems in which agents can learn a growing repertoire of goal-driven skills. Finally, we highlight the work that remains to be performed to fill the gap between our elementary definition and the more involved notions of open-ended learning that developmental AI researchers may have in mind.

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

TL;DR

Abstract

Paper Structure (17 sections, 2 figures)

This paper contains 17 sections, 2 figures.

Introduction
Defining OEL: Elements from the literature
Genealogy of the notion
Current perspectives
Open-ended learning: a definition
Open-ended process
Open-ended RL
Open-ended GCRL
First-order and second-order open-ended GCRL
Combining open-ended learning with other properties
Goal-conditioned continual reinforcement learning
Lifelong open-ended learning
The origin of goals: extrinsic, autotelic, and teachable open-ended learning agents
Evaluating open-ended learning agents
Assessing the open-ended learning property
...and 2 more sections

Figures (2)

Figure 1: Schematic overview of the genealogy of the notion of open-endedness in the Developmental AI and Artificial Life literature.
Figure 2: The set of definitions used in this paper. In blue, notions related to defining the open-ended goal-conditioned RL problems (key definitions are darker). In red, notions related to defining open-ended GCRL agents, that is solutions. An arrow expresses that a definition depends on another.

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

TL;DR

Abstract

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

Authors

TL;DR

Abstract

Table of Contents

Figures (2)