Diff-DAgger: Uncertainty Estimation with Diffusion Policy for Robotic Manipulation

Sung-Wook Lee; Xuhui Kang; Yen-Ling Kuo

Diff-DAgger: Uncertainty Estimation with Diffusion Policy for Robotic Manipulation

Sung-Wook Lee, Xuhui Kang, Yen-Ling Kuo

TL;DR

This paper addresses the out-of-distribution failures and multimodal challenges of diffusion-policy-based imitation learning in robotics. It proposes Diff-DAgger, a robot-gated DAgger algorithm that uses diffusion-loss as the uncertainty signal to trigger expert interventions, avoiding reliance on ensemble disagreements. Across stacking, pushing, plugging, and real-world tasks, Diff-DAgger achieves a 39.0% improvement in task-failure prediction, a 20.6% gain in task completion, and up to 7.8x reductions in wall-clock time compared to baselines. The work demonstrates that an expressive diffusion policy can be efficiently integrated into interactive learning, enabling scalable, data-hungry policies to be effectively employed in robotic manipulation settings.

Abstract

Recently, diffusion policy has shown impressive results in handling multi-modal tasks in robotic manipulation. However, it has fundamental limitations in out-of-distribution failures that persist due to compounding errors and its limited capability to extrapolate. One way to address these limitations is robot-gated DAgger, an interactive imitation learning with a robot query system to actively seek expert help during policy rollout. While robot-gated DAgger has high potential for learning at scale, existing methods like Ensemble-DAgger struggle with highly expressive policies: They often misinterpret policy disagreements as uncertainty at multi-modal decision points. To address this problem, we introduce Diff-DAgger, an efficient robot-gated DAgger algorithm that leverages the training objective of diffusion policy. We evaluate Diff-DAgger across different robot tasks including stacking, pushing, and plugging, and show that Diff-DAgger improves the task failure prediction by 39.0%, the task completion rate by 20.6%, and reduces the wall-clock time by a factor of 7.8. We hope that this work opens up a path for efficiently incorporating expressive yet data-hungry policies into interactive robot learning settings. The project website is available at: https://diffdagger.github.io.

Diff-DAgger: Uncertainty Estimation with Diffusion Policy for Robotic Manipulation

TL;DR

Abstract

Diff-DAgger: Uncertainty Estimation with Diffusion Policy for Robotic Manipulation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)