Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

Zhi Cen; Huaijin Pi; Sida Peng; Qing Shuai; Yujun Shen; Hujun Bao; Xiaowei Zhou; Ruizhen Hu

Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

Zhi Cen, Huaijin Pi, Sida Peng, Qing Shuai, Yujun Shen, Hujun Bao, Xiaowei Zhou, Ruizhen Hu

TL;DR

This paper introduces Ready-to-React, an online reaction policy that enables two independently acting characters to interact in real time. It combines a VQ-VAE-based latent space with a transformer-conditioned diffusion predictor and an online decoder to generate next poses streaming from observed histories, mitigating error accumulation. Evaluated on a boxing dataset (DuoBox), the method outperforms baselines in reactive and two-character generation, including long sequences, and supports sparse, controllable inputs for VR applications. The approach advances online, interactive motion generation with practical implications for robotics, gaming, and immersive environments.

Abstract

This paper addresses the task of generating two-character online interactions. Previously, two main settings existed for two-character interaction generation: (1) generating one's motions based on the counterpart's complete motion sequence, and (2) jointly generating two-character motions based on specific conditions. We argue that these settings fail to model the process of real-life two-character interactions, where humans will react to their counterparts in real time and act as independent individuals. In contrast, we propose an online reaction policy, called Ready-to-React, to generate the next character pose based on past observed motions. Each character has its own reaction policy as its "brain", enabling them to interact like real humans in a streaming manner. Our policy is implemented by incorporating a diffusion head into an auto-regressive model, which can dynamically respond to the counterpart's motions while effectively mitigating the error accumulation throughout the generation process. We conduct comprehensive experiments using the challenging boxing task. Experimental results demonstrate that our method outperforms existing baselines and can generate extended motion sequences. Additionally, we show that our approach can be controlled by sparse signals, making it well-suited for VR and other online interactive environments.

Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

TL;DR

Abstract

Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)