Task and Domain Adaptive Reinforcement Learning for Robot Control
Yu Tang Liu, Nilaksh Singh, Aamir Ahmad
TL;DR
The paper addresses robustness of deep RL for robot control under changing tasks and environments by introducing an adaptive agent that combines task transfer (via an arbiter over specialized primitives) with domain transfer (via a learned environment state extractor trained through Rapid Motor Adaptation). The core method, Arbiter-SF, leverages successor features to enable zero-shot transfer to unseen tasks and a two-stage training scheme to estimate environment state from interactions, all validated on a parallelized blimp control platform built in IsaacGym and demonstrated in real-world flight. Compared with strong baselines, the approach achieves superior transfer performance and sample efficiency, supported by ablation studies and extensive real-world experiments. The work advances multitask, domain-robust RL in robotics and showcases practical viability for real-time control on embedded hardware with a scalable simulation pipeline.
Abstract
Deep reinforcement learning (DRL) has shown remarkable success in simulation domains, yet its application in designing robot controllers remains limited, due to its single-task orientation and insufficient adaptability to environmental changes. To overcome these limitations, we present a novel adaptive agent that leverages transfer learning techniques to dynamically adapt policy in response to different tasks and environmental conditions. The approach is validated through the blimp control challenge, where multitasking capabilities and environmental adaptability are essential. The agent is trained using a custom, highly parallelized simulator built on IsaacGym. We perform zero-shot transfer to fly the blimp in the real world to solve various tasks. We share our code at https://github.com/robot-perception-group/adaptive_agent.
