Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity

Emiliyan Gospodinov; Vaisakh Shaj; Philipp Becker; Stefan Geyer; Gerhard Neumann

Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity

Emiliyan Gospodinov, Vaisakh Shaj, Philipp Becker, Stefan Geyer, Gerhard Neumann

TL;DR

A new formalism, Hidden Parameter-POMDP, designed for control with adaptive world models is introduced, which enables learning robust behaviors across a variety of non-stationary RL benchmarks and effectively learns task abstractions in an unsupervised manner.

Abstract

Developing foundational world models is a key research direction for embodied intelligence, with the ability to adapt to non-stationary environments being a crucial criterion. In this work, we introduce a new formalism, Hidden Parameter-POMDP, designed for control with adaptive world models. We demonstrate that this approach enables learning robust behaviors across a variety of non-stationary RL benchmarks. Additionally, this formalism effectively learns task abstractions in an unsupervised manner, resulting in structured, task-aware latent spaces.

Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity

TL;DR

Abstract

Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)

Theorems & Definitions (1)