From Demonstrations to Safe Deployment: Path-Consistent Safety Filtering for Diffusion Policies

Ralf Römer; Julian Balletshofer; Jakob Thumm; Marco Pavone; Angela P. Schoellig; Matthias Althoff

From Demonstrations to Safe Deployment: Path-Consistent Safety Filtering for Diffusion Policies

Ralf Römer, Julian Balletshofer, Jakob Thumm, Marco Pavone, Angela P. Schoellig, Matthias Althoff

TL;DR

This work introduces Path-Consistent Safety Filtering (PACS) to safely deploy diffusion policies in dynamic environments by translating action chunks into intended trajectories and verifying safety with set-based reachability while staying on the policy's path. PACS provides formal safety guarantees at real-time rates (≈1 kHz) and maintains task performance, outperforming reactive safety mechanisms such as control barrier functions in both simulation (up to 68% higher task success) and real-world human-robot interaction tasks (up to 37% higher in Sorting). The method relies on an intermediate trajectory planning module that aligns safety checks with the policy’s action chunks, enabling fine-grained, path-consistent braking and failsafe behavior without leaving the learned data distribution. Experimental results on Robomimic benchmarks and real hardware demonstrate PACS’s effectiveness in preventing unsafe states while preserving high task success across three HRI scenarios (Sorting, Handover, Feeding), illustrating practical impact for safety-critical robotics deployment.

Abstract

Diffusion policies (DPs) achieve state-of-the-art performance on complex manipulation tasks by learning from large-scale demonstration datasets, often spanning multiple embodiments and environments. However, they cannot guarantee safe behavior, so external safety mechanisms are needed. These, however, alter actions in ways unseen during training, causing unpredictable behavior and performance degradation. To address these problems, we propose path-consistent safety filtering (PACS) for DPs. Our approach performs path-consistent braking on a trajectory computed from the sequence of generated actions. In this way, we keep execution consistent with the policy's training distribution, maintaining the learned, task-completing behavior. To enable a real-time deployment and handle uncertainties, we verify safety using set-based reachability analysis. Our experimental evaluation in simulation and on three challenging real-world human-robot interaction tasks shows that PACS (a) provides formal safety guarantees in dynamic environments, (b) preserves task success rates, and (c) outperforms reactive safety approaches, such as control barrier functions, by up to 68% in terms of task success. Videos are available at our project website: https://tum-lsy.github.io/pacs/.

From Demonstrations to Safe Deployment: Path-Consistent Safety Filtering for Diffusion Policies

TL;DR

Abstract

From Demonstrations to Safe Deployment: Path-Consistent Safety Filtering for Diffusion Policies

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)