DynaFlow: Dynamics-embedded Flow Matching for Physically Consistent Motion Generation from State-only Demonstrations
Sowoo Lee, Dongyun Kang, Jaehyun Park, Hae-Won Park
TL;DR
DynaFlow tackles the challenge of generating physically feasible motion from state-only demonstrations by embedding a differentiable simulator directly into a flow-matching model. By mapping action trajectories through a differentiable rollout, it guarantees dynamic feasibility by construction while learning to infer the underlying actions. The approach is validated with two datasets, including real hardware deployment on a Go1 quadruped, showing that DynaFlow achieves strict physical consistency without sacrificing distributional fidelity. It enables long-horizon, open-loop locomotion and can transform infeasible retargeted demonstrations into dynamically executable, stylistic behaviors, bridging the gap between kinematic data and real-world execution.
Abstract
This paper introduces DynaFlow, a novel framework that embeds a differentiable simulator directly into a flow matching model. By generating trajectories in the action space and mapping them to dynamically feasible state trajectories via the simulator, DynaFlow ensures all outputs are physically consistent by construction. This end-to-end differentiable architecture enables training on state-only demonstrations, allowing the model to simultaneously generate physically consistent state trajectories while inferring the underlying action sequences required to produce them. We demonstrate the effectiveness of our approach through quantitative evaluations and showcase its real-world applicability by deploying the generated actions onto a physical Go1 quadruped robot. The robot successfully reproduces diverse gait present in the dataset, executes long-horizon motions in open-loop control and translates infeasible kinematic demonstrations into dynamically executable, stylistic behaviors. These hardware experiments validate that DynaFlow produces deployable, highly effective motions on real-world hardware from state-only demonstrations, effectively bridging the gap between kinematic data and real-world execution.
