ADPro: a Test-time Adaptive Diffusion Policy via Manifold-constrained Denoising and Task-aware Initialization for Robotic Manipulation
Zezeng Li, Rui Yang, Ruochen Chen, ZhongXuan Luo, Liming Chen
TL;DR
Diffusion policies for robotic manipulation operate on the $SE(3)$ manifold, but offline training and unconstrained denoising limit generalization and efficiency. We present ADP, a training-free test-time adaptive diffusion policy that adds (i) task-manifold guided denoising along geodesics, (ii) a Gaussian spherical prior to constrain step sizes, and (iii) a task-aware initialization via Fast Global Registration between test-time gripper and scene point clouds. Implemented as ADPro by plugging into a pretrained 3D Diffuser Actor (and optionally CLIP-based prompts), ADP yields up to ~25% faster inference and ~9 percentage-point improvements on RLBench, CALVIN, and RealWP, with strong zero-shot generalization. These results show that geometry-aware, on-the-fly guidance can substantially enhance manipulation performance without retraining, widening the practical applicability of diffusion policies in unstructured environments.
Abstract
Diffusion policies have recently emerged as a powerful class of visuomotor controllers for robot manipulation, offering stable training and expressive multi-modal action modeling. However, existing approaches typically treat action generation as an unconstrained denoising process, ignoring valuable a priori knowledge about geometry and control structure. In this work, we propose the Adaptive Diffusion Policy (ADP), a test-time adaptation method that introduces two key inductive biases into the diffusion. First, we embed a geometric manifold constraint that aligns denoising updates with task-relevant subspaces, leveraging the fact that the relative pose between the end-effector and target scene provides a natural gradient direction, and guiding denoising along the geodesic path of the manipulation manifold. Then, to reduce unnecessary exploration and accelerate convergence, we propose an analytically guided initialization: rather than sampling from an uninformative prior, we compute a rough registration between the gripper and target scenes to propose a structured initial noisy action. ADP is compatible with pre-trained diffusion policies and requires no retraining, enabling test-time adaptation that tailors the policy to specific tasks, thereby enhancing generalization across novel tasks and environments. Experiments on RLBench, CALVIN, and real-world datasets show that ADPro, an implementation of ADP, improves success rates, generalization, and sampling efficiency, achieving up to 25% faster execution and 9% points over strong diffusion baselines.
