Generative Predictive Control: Flow Matching Policies for Dynamic and Difficult-to-Demonstrate Tasks
Vince Kurtz, Joel W. Burdick
TL;DR
This work tackles the challenge of controlling fast, nonlinear robotic dynamics without relying on extensive expert demonstrations. It introduces Generative Predictive Control (GPC), which learns a flow-matching policy to emulate the sampling-based predictive control (SPC) target distribution, using cycles of SPC data collection to train the policy and improve subsequent SPC samples. The approach yields high-frequency, temporally-consistent control and demonstrates robustness through risk-aware domain randomization, while exposing scalability limits on very large systems like humanoid standup. Collectively, GPC provides a principled, supervised-learning-based path toward generalist, fast-reacting policies that leverage both generative modeling and predictive control concepts.
Abstract
Generative control policies have recently unlocked major progress in robotics. These methods produce action sequences via diffusion or flow matching, with training data provided by demonstrations. But existing methods come with two key limitations: they require expert demonstrations, which can be difficult to obtain, and they are limited to relatively slow, quasi-static tasks. In this paper, we leverage a tight connection between sampling-based predictive control and generative modeling to address each of these issues. In particular, we introduce generative predictive control, a supervised learning framework for tasks with fast dynamics that are easy to simulate but difficult to demonstrate. We then show how trained flow-matching policies can be warm-started at inference time, maintaining temporal consistency and enabling high-frequency feedback. We believe that generative predictive control offers a complementary approach to existing behavior cloning methods, and hope that it paves the way toward generalist policies that extend beyond quasi-static demonstration-oriented tasks.
