RMG: Real-Time Expressive Motion Generation with Self-collision Avoidance for 6-DOF Companion Robotic Arms
Jiansheng Li, Haotian Song, Jinni Zhou, Qiang Nie, Yi Cai
TL;DR
This work presents RMG, a real-time expressive motion generation framework for 6-DOF robotic arms that leverages a dance-derived expressive motion dataset, diffusion-based trajectory generation in both joint and Cartesian spaces, and a PSO-based collision-avoidance optimization. By transforming human dance data into robot motions, training joint-space and Cartesian-space diffusion models, and post-processing with a collision-free optimizer, the approach achieves expressive, smooth trajectories under real-time constraints (under roughly 0.5 seconds). The joint-space model outperforms the Cartesian-space model in planning accuracy, expressivity, and robustness, and the system is demonstrated on a low-cost Mycobot 280 with a gesture-based interaction scenario. Limitations include overshoot and speed-control challenges, with future work aiming to endow the robot with autonomous emotional reasoning to select endpoints and durations contextually.
Abstract
The six-degree-of-freedom (6-DOF) robotic arm has gained widespread application in human-coexisting environments. While previous research has predominantly focused on functional motion generation, the critical aspect of expressive motion in human-robot interaction remains largely unexplored. This paper presents a novel real-time motion generation planner that enhances interactivity by creating expressive robotic motions between arbitrary start and end states within predefined time constraints. Our approach involves three key contributions: first, we develop a mapping algorithm to construct an expressive motion dataset derived from human dance movements; second, we train motion generation models in both Cartesian and joint spaces using this dataset; third, we introduce an optimization algorithm that guarantees smooth, collision-free motion while maintaining the intended expressive style. Experimental results demonstrate the effectiveness of our method, which can generate expressive and generalized motions in under 0.5 seconds while satisfying all specified constraints.
