Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

Kin Man Lee; Sean Ye; Qingyu Xiao; Zixuan Wu; Zulfiqar Zaidi; David B. D'Ambrosio; Pannag R. Sanketi; Matthew Gombolay

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

Kin Man Lee, Sean Ye, Qingyu Xiao, Zixuan Wu, Zulfiqar Zaidi, David B. D'Ambrosio, Pannag R. Sanketi, Matthew Gombolay

TL;DR

This work tackles sample-inefficient, constraint-averse learning for agile robotic striking by introducing Kinematic Constraint Gradient Guidance (KCGG), an offline diffusion-based imitation-learning framework. KCGG computes gradients through both the forward kinematics $F(q)$ and the diffusion model to steer samples toward task constraints while preserving the training data distribution, enabling multimodal, high-speed motions from limited demonstrations. Empirical results in simulated AirHockey and real table-tennis setups show that KCGG outperforms baselines, achieving substantial gains in block/success rates and demonstrating robust constraint satisfaction even under tight timing constraints. The approach offers a practical route to deploy constrained, diverse robotic skills without requiring high-fidelity simulators, with broad applicability to dynamic manipulation tasks.

Abstract

Advances in robot learning have enabled robots to generate skills for a variety of tasks. Yet, robot learning is typically sample inefficient, struggles to learn from data sources exhibiting varied behaviors, and does not naturally incorporate constraints. These properties are critical for fast, agile tasks such as playing table tennis. Modern techniques for learning from demonstration improve sample efficiency and scale to diverse data, but are rarely evaluated on agile tasks. In the case of reinforcement learning, achieving good performance requires training on high-fidelity simulators. To overcome these limitations, we develop a novel diffusion modeling approach that is offline, constraint-guided, and expressive of diverse agile behaviors. The key to our approach is a kinematic constraint gradient guidance (KCGG) technique that computes gradients through both the forward kinematics of the robot arm and the diffusion model to direct the sampling process. KCGG minimizes the cost of violating constraints while simultaneously keeping the sampled trajectory in-distribution of the training data. We demonstrate the effectiveness of our approach for time-critical robotic tasks by evaluating KCGG in two challenging domains: simulated air hockey and real table tennis. In simulated air hockey, we achieved a 25.4% increase in block rate, while in table tennis, we saw a 17.3% increase in success rate compared to imitation learning baselines.

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

TL;DR

and the diffusion model to steer samples toward task constraints while preserving the training data distribution, enabling multimodal, high-speed motions from limited demonstrations. Empirical results in simulated AirHockey and real table-tennis setups show that KCGG outperforms baselines, achieving substantial gains in block/success rates and demonstrating robust constraint satisfaction even under tight timing constraints. The approach offers a practical route to deploy constrained, diverse robotic skills without requiring high-fidelity simulators, with broad applicability to dynamic manipulation tasks.

Abstract

Paper Structure (22 sections, 4 equations, 5 figures, 5 tables, 1 algorithm)

This paper contains 22 sections, 4 equations, 5 figures, 5 tables, 1 algorithm.

Introduction
Related Work
Learning Striking Motions
Diffusion Models
Problem Formulation
Preliminaries: Score-Based Diffusion Models
Methodology
Kinematic Constraint Gradient Guidance
Standard Diffusion Model Sampling Procedure
Constraint Guided Sampling
Kinematic Constraint Gradient Guidance
Sampling Procedure
Results and Discussion
Agile Robotic Domains
Simulated Domain (AirHockey)
...and 7 more sections

Figures (5)

Figure 1: Overview of KCGG: At inference, KCGG computes the cost given the constraint and intermediate trajectory at each denoising step. The gradient with respect to the prior trajectory through the diffusion model is used to update the sample.
Figure 2: Visualization of Diffusion Sampling Process in Air-Hockey Defend: KCGG gradually shifts the conditioned points in the trajectory (shown in green) towards a predicted future puck position (shown as the orange point).
Figure 3: Comparison of Planned Trajectory: KCGG maintains a smooth joint angle profile in the constrained region (dotted lines) while constraint guidance introduces discontinuities.
Figure 4: Sampling Speed vs Performance: We compare the Block Rate in AirHockey Defend across sampling times. KCGG scales to better performance with sampling time.
Figure : Batched Sampling with KCGG

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

TL;DR

Abstract

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

Authors

TL;DR

Abstract

Table of Contents

Figures (5)