Table of Contents
Fetching ...

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

Kin Man Lee, Sean Ye, Qingyu Xiao, Zixuan Wu, Zulfiqar Zaidi, David B. D'Ambrosio, Pannag R. Sanketi, Matthew Gombolay

TL;DR

This work tackles sample-inefficient, constraint-averse learning for agile robotic striking by introducing Kinematic Constraint Gradient Guidance (KCGG), an offline diffusion-based imitation-learning framework. KCGG computes gradients through both the forward kinematics $F(q)$ and the diffusion model to steer samples toward task constraints while preserving the training data distribution, enabling multimodal, high-speed motions from limited demonstrations. Empirical results in simulated AirHockey and real table-tennis setups show that KCGG outperforms baselines, achieving substantial gains in block/success rates and demonstrating robust constraint satisfaction even under tight timing constraints. The approach offers a practical route to deploy constrained, diverse robotic skills without requiring high-fidelity simulators, with broad applicability to dynamic manipulation tasks.

Abstract

Advances in robot learning have enabled robots to generate skills for a variety of tasks. Yet, robot learning is typically sample inefficient, struggles to learn from data sources exhibiting varied behaviors, and does not naturally incorporate constraints. These properties are critical for fast, agile tasks such as playing table tennis. Modern techniques for learning from demonstration improve sample efficiency and scale to diverse data, but are rarely evaluated on agile tasks. In the case of reinforcement learning, achieving good performance requires training on high-fidelity simulators. To overcome these limitations, we develop a novel diffusion modeling approach that is offline, constraint-guided, and expressive of diverse agile behaviors. The key to our approach is a kinematic constraint gradient guidance (KCGG) technique that computes gradients through both the forward kinematics of the robot arm and the diffusion model to direct the sampling process. KCGG minimizes the cost of violating constraints while simultaneously keeping the sampled trajectory in-distribution of the training data. We demonstrate the effectiveness of our approach for time-critical robotic tasks by evaluating KCGG in two challenging domains: simulated air hockey and real table tennis. In simulated air hockey, we achieved a 25.4% increase in block rate, while in table tennis, we saw a 17.3% increase in success rate compared to imitation learning baselines.

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

TL;DR

This work tackles sample-inefficient, constraint-averse learning for agile robotic striking by introducing Kinematic Constraint Gradient Guidance (KCGG), an offline diffusion-based imitation-learning framework. KCGG computes gradients through both the forward kinematics and the diffusion model to steer samples toward task constraints while preserving the training data distribution, enabling multimodal, high-speed motions from limited demonstrations. Empirical results in simulated AirHockey and real table-tennis setups show that KCGG outperforms baselines, achieving substantial gains in block/success rates and demonstrating robust constraint satisfaction even under tight timing constraints. The approach offers a practical route to deploy constrained, diverse robotic skills without requiring high-fidelity simulators, with broad applicability to dynamic manipulation tasks.

Abstract

Advances in robot learning have enabled robots to generate skills for a variety of tasks. Yet, robot learning is typically sample inefficient, struggles to learn from data sources exhibiting varied behaviors, and does not naturally incorporate constraints. These properties are critical for fast, agile tasks such as playing table tennis. Modern techniques for learning from demonstration improve sample efficiency and scale to diverse data, but are rarely evaluated on agile tasks. In the case of reinforcement learning, achieving good performance requires training on high-fidelity simulators. To overcome these limitations, we develop a novel diffusion modeling approach that is offline, constraint-guided, and expressive of diverse agile behaviors. The key to our approach is a kinematic constraint gradient guidance (KCGG) technique that computes gradients through both the forward kinematics of the robot arm and the diffusion model to direct the sampling process. KCGG minimizes the cost of violating constraints while simultaneously keeping the sampled trajectory in-distribution of the training data. We demonstrate the effectiveness of our approach for time-critical robotic tasks by evaluating KCGG in two challenging domains: simulated air hockey and real table tennis. In simulated air hockey, we achieved a 25.4% increase in block rate, while in table tennis, we saw a 17.3% increase in success rate compared to imitation learning baselines.
Paper Structure (22 sections, 4 equations, 5 figures, 5 tables, 1 algorithm)

This paper contains 22 sections, 4 equations, 5 figures, 5 tables, 1 algorithm.

Figures (5)

  • Figure 1: Overview of KCGG: At inference, KCGG computes the cost given the constraint and intermediate trajectory at each denoising step. The gradient with respect to the prior trajectory through the diffusion model is used to update the sample.
  • Figure 2: Visualization of Diffusion Sampling Process in Air-Hockey Defend: KCGG gradually shifts the conditioned points in the trajectory (shown in green) towards a predicted future puck position (shown as the orange point).
  • Figure 3: Comparison of Planned Trajectory: KCGG maintains a smooth joint angle profile in the constrained region (dotted lines) while constraint guidance introduces discontinuities.
  • Figure 4: Sampling Speed vs Performance: We compare the Block Rate in AirHockey Defend across sampling times. KCGG scales to better performance with sampling time.
  • Figure : Batched Sampling with KCGG