Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

Haochen Wang; Zhiwei Shi; Chengxi Zhu; Yafei Qiao; Cheng Zhang; Fan Yang; Pengjie Ren; Lan Lu; Dong Xuan

Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

Haochen Wang, Zhiwei Shi, Chengxi Zhu, Yafei Qiao, Cheng Zhang, Fan Yang, Pengjie Ren, Lan Lu, Dong Xuan

TL;DR

Hamlet presents a novel hybrid control framework for agile badminton robots that combines model-based chassis locomotion with learning-based arm manipulation. A physics-informed IL+RL training pipeline uses a privileged model to guide imitation learning and subsequent reinforcement learning, with a critic warmed up during IL to prevent performance drops. Real-world experiments show high success against a serving machine (94.5%) and humans (90.7%), plus zero-shot transfer across different chassis without arm retraining. This approach addresses sim-to-real gaps, enables safe exploration, and generalizes to other agile mobile manipulation tasks such as high-speed catching.

Abstract

Learning-based methods, such as imitation learning (IL) and reinforcement learning (RL), can produce excel control policies over challenging agile robot tasks, such as sports robot. However, no existing work has harmonized learning-based policy with model-based methods to reduce training complexity and ensure the safety and stability for agile badminton robot control. In this paper, we introduce Hamlet, a novel hybrid control system for agile badminton robots. Specifically, we propose a model-based strategy for chassis locomotion which provides a base for arm policy. We introduce a physics-informed "IL+RL" training framework for learning-based arm policy. In this train framework, a model-based strategy with privileged information is used to guide arm policy training during both IL and RL phases. In addition, we train the critic model during IL phase to alleviate the performance drop issue when transitioning from IL to RL. We present results on our self-engineered badminton robot, achieving 94.5% success rate against the serving machine and 90.7% success rate against human players. Our system can be easily generalized to other agile mobile manipulation tasks such as agile catching and table tennis. Our project website: https://dreamstarring.github.io/HAMLET/.

Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

TL;DR

Abstract

Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)