MRSD: Multi-Resolution Skill Discovery for HRL Agents

Shashank Sharma; Janina Hoffmann; Vinay Namboodiri

MRSD: Multi-Resolution Skill Discovery for HRL Agents

Shashank Sharma, Janina Hoffmann, Vinay Namboodiri

TL;DR

MRSD tackles long-horizon control by learning multiple skill encoders at temporal scales $l_i$ and a dynamic interleaving policy, enabling both fine- and coarse-grained control. It uses CVAE-based Skill encodings to model abstract state transitions and an exploratory objective based on reconstruction error to drive diverse skill discovery without external rewards. Empirical results on the DeepMind Control Suite show faster convergence and competitive final performance relative to state-of-the-art skill discovery and HRL baselines, with performance approaching non-HRL methods like DreamerV2 on several tasks. The approach offers a scalable path to versatile agents that combine multi-resolution skills for more efficient and flexible control, with well-characterized limitations and clear directions for future work.

Abstract

Hierarchical reinforcement learning (HRL) relies on abstract skills to solve long-horizon tasks efficiently. While existing skill discovery methods learns these skills automatically, they are limited to a single skill per task. In contrast, humans learn and use both fine-grained and coarse motor skills simultaneously. Inspired by human motor control, we propose Multi-Resolution Skill Discovery (MRSD), an HRL framework that learns multiple skill encoders at different temporal resolutions in parallel. A high-level manager dynamically selects among these skills, enabling adaptive control strategies over time. We evaluate MRSD on tasks from the DeepMind Control Suite and show that it outperforms prior state-of-the-art skill discovery and HRL methods, achieving faster convergence and higher final performance. Our findings highlight the benefits of integrating multi-resolution skills in HRL, paving the way for more versatile and efficient agents.

MRSD: Multi-Resolution Skill Discovery for HRL Agents

TL;DR

MRSD tackles long-horizon control by learning multiple skill encoders at temporal scales

and a dynamic interleaving policy, enabling both fine- and coarse-grained control. It uses CVAE-based Skill encodings to model abstract state transitions and an exploratory objective based on reconstruction error to drive diverse skill discovery without external rewards. Empirical results on the DeepMind Control Suite show faster convergence and competitive final performance relative to state-of-the-art skill discovery and HRL baselines, with performance approaching non-HRL methods like DreamerV2 on several tasks. The approach offers a scalable path to versatile agents that combine multi-resolution skills for more efficient and flexible control, with well-characterized limitations and clear directions for future work.

MRSD: Multi-Resolution Skill Discovery for HRL Agents

TL;DR

Abstract

MRSD: Multi-Resolution Skill Discovery for HRL Agents

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)