The NetHack Learning Environment

Heinrich Küttler; Nantas Nardelli; Alexander H. Miller; Roberta Raileanu; Marco Selvatici; Edward Grefenstette; Tim Rocktäschel

The NetHack Learning Environment

Heinrich Küttler, Nantas Nardelli, Alexander H. Miller, Roberta Raileanu, Marco Selvatici, Edward Grefenstette, Tim Rocktäschel

TL;DR

The NetHack Learning Environment (NLE) introduces a fast, complex, procedurally generated RL testbed built on NetHack to push exploration, planning, memory, and generalization research. By combining a rich symbolic observation space, a large action set, and long-horizon dynamics with a scalable Gym interface, NLE enables diverse tasks and robust benchmarks, including baseline IMPALA and Random Network Distillation (RND) methods. Empirical results show meaningful gains from exploration strategies across several tasks, with thorough generalization and qualitative analyses revealing core failure modes and dynamics such as complex entity interactions and rare events like locating the Oracle. The work argues that NetHack’s depth, randomness, and abundance of embedded knowledge enable long-term progress toward robust, transferable RL algorithms, while remaining accessible to resource-constrained research groups. Future directions include upgrading to NetHack 3.7, scripting for user-defined tasks, and harnessing language-based signals for auxiliary learning.

Abstract

Progress in Reinforcement Learning (RL) algorithms goes hand-in-hand with the development of challenging environments that test the limits of current methods. While existing RL environments are either sufficiently complex or based on fast simulation, they are rarely both. Here, we present the NetHack Learning Environment (NLE), a scalable, procedurally generated, stochastic, rich, and challenging environment for RL research based on the popular single-player terminal-based roguelike game, NetHack. We argue that NetHack is sufficiently complex to drive long-term research on problems such as exploration, planning, skill acquisition, and language-conditioned RL, while dramatically reducing the computational resources required to gather a large amount of experience. We compare NLE and its task suite to existing alternatives, and discuss why it is an ideal medium for testing the robustness and systematic generalization of RL agents. We demonstrate empirical success for early stages of the game using a distributed Deep RL baseline and Random Network Distillation exploration, alongside qualitative analysis of various agents trained in the environment. NLE is open source at https://github.com/facebookresearch/nle.

The NetHack Learning Environment

TL;DR

Abstract

The NetHack Learning Environment

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)