MRSD: Multi-Resolution Skill Discovery for HRL Agents
Shashank Sharma, Janina Hoffmann, Vinay Namboodiri
TL;DR
MRSD tackles long-horizon control by learning multiple skill encoders at temporal scales $l_i$ and a dynamic interleaving policy, enabling both fine- and coarse-grained control. It uses CVAE-based Skill encodings to model abstract state transitions and an exploratory objective based on reconstruction error to drive diverse skill discovery without external rewards. Empirical results on the DeepMind Control Suite show faster convergence and competitive final performance relative to state-of-the-art skill discovery and HRL baselines, with performance approaching non-HRL methods like DreamerV2 on several tasks. The approach offers a scalable path to versatile agents that combine multi-resolution skills for more efficient and flexible control, with well-characterized limitations and clear directions for future work.
Abstract
Hierarchical reinforcement learning (HRL) relies on abstract skills to solve long-horizon tasks efficiently. While existing skill discovery methods learns these skills automatically, they are limited to a single skill per task. In contrast, humans learn and use both fine-grained and coarse motor skills simultaneously. Inspired by human motor control, we propose Multi-Resolution Skill Discovery (MRSD), an HRL framework that learns multiple skill encoders at different temporal resolutions in parallel. A high-level manager dynamically selects among these skills, enabling adaptive control strategies over time. We evaluate MRSD on tasks from the DeepMind Control Suite and show that it outperforms prior state-of-the-art skill discovery and HRL methods, achieving faster convergence and higher final performance. Our findings highlight the benefits of integrating multi-resolution skills in HRL, paving the way for more versatile and efficient agents.
