Graceful task adaptation with a bi-hemispheric RL agent
Grant Nicholas, Levin Kuhlmann, Gideon Kowadlo
TL;DR
This work introduces a bi-hemispheric reinforcement learning agent inspired by the Novelty-Routine Hypothesis, assigning a generalist right-hemisphere and a specialist left-hemisphere to tackle novel tasks. A gating network dynamically allocates responsibility between hemispheres, with a loss term that encourages shifting control toward the left as tasks are learned and right-hemisphere competency grows, all under a GRU-based architecture. The right hemisphere is meta-trained with RL$^2$ to promote generalisation, while the left hemisphere learns task-specific policies; experiments on Meta-world show improved initial performance on some tasks when the right hemisphere is competent, but results are variable across tasks and novelty levels. The study discusses improvements to meta-training, potential decoupling of gating from left-hemisphere training, and extensions toward continual learning, highlighting the approach’s potential to enable rapid adaptation with a structured, neuroscience-inspired inductive bias.
Abstract
In humans, responsibility for performing a task gradually shifts from the right hemisphere to the left. The Novelty-Routine Hypothesis (NRH) states that the right and left hemispheres are used to perform novel and routine tasks respectively, enabling us to learn a diverse range of novel tasks while performing the task capably. Drawing on the NRH, we develop a reinforcement learning agent with specialised hemispheres that can exploit generalist knowledge from the right-hemisphere to avoid poor initial performance on novel tasks. In addition, we find that this design has minimal impact on its ability to learn novel tasks. We conclude by identifying improvements to our agent and exploring potential expansion to the continual learning setting.
