FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation

Qinglun Zhang; Zhen Liu; Haoqiang Fan; Guanghui Liu; Bing Zeng; Shuaicheng Liu

FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation

Qinglun Zhang, Zhen Liu, Haoqiang Fan, Guanghui Liu, Bing Zeng, Shuaicheng Liu

TL;DR

FlowPolicy tackles the efficiency bottleneck of diffusion-based imitation learning for 3D robot manipulation by casting policy generation as a conditional consistency flow matching problem. It conditions on 3D point-cloud observations and learns velocity-consistent straight-line flows to enable one-step action decoding in real time. Across 37 tasks on Adroit and Metaworld, FlowPolicy achieves substantial runtime reductions while maintaining competitive success rates, highlighting the practical potential of conditional flow-based policies for real-time robotics. This work broadens the applicability of 3D-vision-based imitation learning to real-world, real-time manipulation scenarios.

Abstract

Robots can acquire complex manipulation skills by learning policies from expert demonstrations, which is often known as vision-based imitation learning. Generating policies based on diffusion and flow matching models has been shown to be effective, particularly in robotic manipulation tasks. However, recursion-based approaches are inference inefficient in working from noise distributions to policy distributions, posing a challenging trade-off between efficiency and quality. This motivates us to propose FlowPolicy, a novel framework for fast policy generation based on consistency flow matching and 3D vision. Our approach refines the flow dynamics by normalizing the self-consistency of the velocity field, enabling the model to derive task execution policies in a single inference step. Specifically, FlowPolicy conditions on the observed 3D point cloud, where consistency flow matching directly defines straight-line flows from different time states to the same action space, while simultaneously constraining their velocity values, that is, we approximate the trajectories from noise to robot actions by normalizing the self-consistency of the velocity field within the action space, thus improving the inference efficiency. We validate the effectiveness of FlowPolicy in Adroit and Metaworld, demonstrating a 7$\times$ increase in inference speed while maintaining competitive average success rates compared to state-of-the-art methods. Code is available at https://github.com/zql-kk/FlowPolicy.

FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation

TL;DR

Abstract

FlowPolicy: Enabling Fast and Robust 3D Flow-based Policy via Consistency Flow Matching for Robot Manipulation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)