Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

Linfeng Zhao; Lawson L. S. Wong

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

Linfeng Zhao, Lawson L. S. Wong

TL;DR

The paper tackles zero-shot navigation in unseen mazes by supplying abstract 2-D maps $m \\in \\mathbb{R}^{N imes N}$ and goal grids $g \\in \\mathbb{R}^{2 imes N imes N}$. It introduces MMN, a model-based framework where a task-conditioned hypermodel $h_\psi$ outputs transition weights $\phi$ for a latent dynamics model $f_\phi$ conditioned on context $c=(m,g)$, enabling planning via MuZero-style MCTS without explicit localization on the map. MMN is trained with an auxiliary model loss and $n$-step hindsight experience replay to cope with sparse rewards, and is evaluated against a model-free baseline (MAH) and a DQN variant, showing superior long-horizon navigation and robustness to map perturbations in DeepMind Lab. The results demonstrate that end-to-end planning with map-conditioned dynamics generalizes to novel layouts and can leverage a hierarchical subgoal strategy to achieve global objectives, highlighting the practical potential of abstract-map-guided navigation without environment-specific training. The work suggests promising directions for subgoal generation and visual-domain extensions in robust, transferable navigation systems.

Abstract

Learning navigation capabilities in different environments has long been one of the major challenges in decision-making. In this work, we focus on zero-shot navigation ability using given abstract $2$-D top-down maps. Like human navigation by reading a paper map, the agent reads the map as an image when navigating in a novel layout, after learning to navigate on a set of training maps. We propose a model-based reinforcement learning approach for this multi-task learning problem, where it jointly learns a hypermodel that takes top-down maps as input and predicts the weights of the transition network. We use the DeepMind Lab environment and customize layouts using generated maps. Our method can adapt better to novel environments in zero-shot and is more robust to noise.

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

TL;DR

The paper tackles zero-shot navigation in unseen mazes by supplying abstract 2-D maps

and goal grids

. It introduces MMN, a model-based framework where a task-conditioned hypermodel

outputs transition weights

for a latent dynamics model

conditioned on context

, enabling planning via MuZero-style MCTS without explicit localization on the map. MMN is trained with an auxiliary model loss and

-step hindsight experience replay to cope with sparse rewards, and is evaluated against a model-free baseline (MAH) and a DQN variant, showing superior long-horizon navigation and robustness to map perturbations in DeepMind Lab. The results demonstrate that end-to-end planning with map-conditioned dynamics generalizes to novel layouts and can leverage a hierarchical subgoal strategy to achieve global objectives, highlighting the practical potential of abstract-map-guided navigation without environment-specific training. The work suggests promising directions for subgoal generation and visual-domain extensions in robust, transferable navigation systems.

Abstract

Learning navigation capabilities in different environments has long been one of the major challenges in decision-making. In this work, we focus on zero-shot navigation ability using given abstract

-D top-down maps. Like human navigation by reading a paper map, the agent reads the map as an image when navigating in a novel layout, after learning to navigate on a set of training maps. We propose a model-based reinforcement learning approach for this multi-task learning problem, where it jointly learns a hypermodel that takes top-down maps as input and predicts the weights of the transition network. We use the DeepMind Lab environment and customize layouts using generated maps. Our method can adapt better to novel environments in zero-shot and is more robust to noise.

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

TL;DR

Abstract

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)