Learning Dexterous Bimanual Catch Skills through Adversarial-Cooperative Heterogeneous-Agent Reinforcement Learning

Taewoo Kim; Youngwoo Yoon; Jaehong Kim

Learning Dexterous Bimanual Catch Skills through Adversarial-Cooperative Heterogeneous-Agent Reinforcement Learning

Taewoo Kim, Youngwoo Yoon, Jaehong Kim

TL;DR

The paper tackles dexterous bimanual catching by introducing a heterogeneous-agent reinforcement learning framework with an adversarial-cooperative reward between a virtual thrower and a two‑handed catcher. The method combines centralized training with decentralized execution (CTDE) using a HAPPO objective and a compound reward $r_{\text{total}} = [t] \alpha r_{\text{catch}} + (1-\alpha) r_{\text{throw}}$, enabling adaptive throwing difficulty and robust catching across 15 object types. Key contributions include the first integration of bimanual catching within HARL, a novel adversarial-cooperative learning dynamic, and extensive simulation validation showing approximately a twofold improvement over single-agent baselines. The approach yields an implicit curriculum that enhances learning efficiency and generalization, though real-world deployment and broader object diversity remain avenues for future work.

Abstract

Robotic catching has traditionally focused on single-handed systems, which are limited in their ability to handle larger or more complex objects. In contrast, bimanual catching offers significant potential for improved dexterity and object handling but introduces new challenges in coordination and control. In this paper, we propose a novel framework for learning dexterous bimanual catching skills using Heterogeneous-Agent Reinforcement Learning (HARL). Our approach introduces an adversarial reward scheme, where a throw agent increases the difficulty of throws-adjusting speed-while a catch agent learns to coordinate both hands to catch objects under these evolving conditions. We evaluate the framework in simulated environments using 15 different objects, demonstrating robustness and versatility in handling diverse objects. Our method achieved approximately a 2x increase in catching reward compared to single-agent baselines across 15 diverse objects.

Learning Dexterous Bimanual Catch Skills through Adversarial-Cooperative Heterogeneous-Agent Reinforcement Learning

TL;DR

Abstract

Learning Dexterous Bimanual Catch Skills through Adversarial-Cooperative Heterogeneous-Agent Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)