Imitation Learning with Additional Constraints on Motion Style using Parametric Bias
Kento Kawaharazuka, Yoichiro Kawamura, Kei Okada, Masayuki Inaba
TL;DR
This work addresses the tendency of imitation learning to converge to a single average motion style by introducing parametric bias (p) within an RNNPB framework to encode and control motion style through soft constraints. By formulating a loss that combines state-prediction with constraints on muscle tension, muscle length velocity, and joint velocity, and by updating $\bm{p}$ either offline or online, the method enables deliberate shaping of motion style while preserving task reproduction. Experimental validation across a simulated tendon arm, PR2, and MusashiLarm demonstrates the ability to modulate velocity and force and to adapt to changing robot configurations, though challenges remain in balancing multiple constraints and extending to non-quantifiable styles. The approach offers a practical pathway to versatile, style-aware imitation in complex robotic systems, with potential for richer multimodal constraints and longer-horizon tasks in future work.
Abstract
Imitation learning is one of the methods for reproducing human demonstration adaptively in robots. So far, it has been found that generalization ability of the imitation learning enables the robots to perform tasks adaptably in untrained environments. However, motion styles such as motion trajectory and the amount of force applied depend largely on the dataset of human demonstration, and settle down to an average motion style. In this study, we propose a method that adds parametric bias to the conventional imitation learning network and can add constraints to the motion style. By experiments using PR2 and the musculoskeletal humanoid MusashiLarm, we show that it is possible to perform tasks by changing its motion style as intended with constraints on joint velocity, muscle length velocity, and muscle tension.
