Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems

Boyi Liu; Lujia Wang; Ming Liu

Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems

Boyi Liu, Lujia Wang, Ming Liu

TL;DR

This work addresses enabling robots to accumulate and reuse experience across environments in cloud robotics. It introduces Lifelong Federated Reinforcement Learning (LFRL), combining a cloud-based knowledge fusion algorithm with two transfer-learning strategies to fuse private models into a powerful shared model and transfer it back to robots. Through simulations and real-world Turtlebot3 experiments, the approach reduces training time while improving navigation performance and generalization to new environments. A cloud navigation-learning website is released to demonstrate practical deployment and service provision for cloud robotic systems.

Abstract

This paper was motivated by the problem of how to make robots fuse and transfer their experience so that they can effectively use prior knowledge and quickly adapt to new environments. To address the problem, we present a learning architecture for navigation in cloud robotic systems: Lifelong Federated Reinforcement Learning (LFRL). In the work, We propose a knowledge fusion algorithm for upgrading a shared model deployed on the cloud. Then, effective transfer learning methods in LFRL are introduced. LFRL is consistent with human cognitive science and fits well in cloud robotic systems. Experiments show that LFRL greatly improves the efficiency of reinforcement learning for robot navigation. The cloud robotic system deployment also shows that LFRL is capable of fusing prior knowledge. In addition, we release a cloud robotic navigation-learning website based on LFRL.

Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems

TL;DR

Abstract

Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)