Multi-Task Reinforcement Learning for Quadrotors

Jiaxu Xing; Ismail Geles; Yunlong Song; Elie Aljalbout; Davide Scaramuzza

Multi-Task Reinforcement Learning for Quadrotors

Jiaxu Xing, Ismail Geles, Yunlong Song, Elie Aljalbout, Davide Scaramuzza

TL;DR

The paper tackles the challenge of creating a generalist quadrotor controller capable of multiple tasks without retraining. It introduces a multi-task reinforcement learning framework that shares information through a dynamics-aware encoder and uses a multi-critic setup to handle task-specific rewards. The approach enables a single policy to perform high-speed stabilization, autonomous racing, and velocity tracking, validated in both simulation (Flightmare) and real-world flights. Results show improved sample efficiency and robust cross-task performance compared with single-task baselines, marking a step toward versatile, real-world quadrotor systems.

Abstract

Reinforcement learning (RL) has shown great effectiveness in quadrotor control, enabling specialized policies to develop even human-champion-level performance in single-task scenarios. However, these specialized policies often struggle with novel tasks, requiring a complete retraining of the policy from scratch. To address this limitation, this paper presents a novel multi-task reinforcement learning (MTRL) framework tailored for quadrotor control, leveraging the shared physical dynamics of the platform to enhance sample efficiency and task performance. By employing a multi-critic architecture and shared task encoders, our framework facilitates knowledge transfer across tasks, enabling a single policy to execute diverse maneuvers, including high-speed stabilization, velocity tracking, and autonomous racing. Our experimental results, validated both in simulation and real-world scenarios, demonstrate that our framework outperforms baseline approaches in terms of sample efficiency and overall task performance.

Multi-Task Reinforcement Learning for Quadrotors

TL;DR

Abstract

Multi-Task Reinforcement Learning for Quadrotors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)