Neural Map: Structured Memory for Deep Reinforcement Learning

Emilio Parisotto; Ruslan Salakhutdinov

Neural Map: Structured Memory for Deep Reinforcement Learning

Emilio Parisotto, Ruslan Salakhutdinov

TL;DR

The paper tackles memory in partially observable deep reinforcement learning by introducing Neural Map, a spatially structured 2D external memory with location-aligned, sparse writes. It defines differentiable read/write/update operations, including global and context-based reads and a local write, with variants such as key-value addressing and GRU-based writes. Empirical results show Neural Map outperforming LSTM and MemNN baselines in 2D mazes and achieving strong generalization in a 3D Doom environment, especially when using GRU-based updates; an ego-centric extension further removes reliance on absolute pose. The work provides a practical, end-to-end trainable memory architecture that scales to complex 3D environments and offers useful inductive biases for spatial navigation in DRL.

Abstract

A critical component to enabling intelligent reasoning in partially observable environments is memory. Despite this importance, Deep Reinforcement Learning (DRL) agents have so far used relatively simple memory architectures, with the main methods to overcome partial observability being either a temporal convolution over the past k frames or an LSTM layer. More recent work (Oh et al., 2016) has went beyond these architectures by using memory networks which can allow more sophisticated addressing schemes over the past k frames. But even these architectures are unsatisfactory due to the reason that they are limited to only remembering information from the last k frames. In this paper, we develop a memory system with an adaptable write operator that is customized to the sorts of 3D environments that DRL agents typically interact with. This architecture, called the Neural Map, uses a spatially structured 2D memory image to learn to store arbitrary information about the environment over long time lags. We demonstrate empirically that the Neural Map surpasses previous DRL memories on a set of challenging 2D and 3D maze environments and show that it is capable of generalizing to environments that were not seen during training.

Neural Map: Structured Memory for Deep Reinforcement Learning

TL;DR

Abstract

Neural Map: Structured Memory for Deep Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)