Generative Factor Chaining: Coordinated Manipulation with Diffusion-based Factor Graph
Utkarsh A. Mishra, Yongxin Chen, Danfei Xu
TL;DR
Generative Factor Chaining (GFC) tackles long-horizon manipulation planning for multi-arm systems by representing states and task constraints as a spatial-temporal factor graph and learning short-horizon skill factors as diffusion models. The joint plan is obtained by composing spatial constraint factors with temporal skill factors into a plan-level distribution and sampling via reverse diffusion, enabling parallel and dependent chaining. A modular plug-and-play skill library plus external spatial constraints supports zero-shot generalization to unseen task-object combinations. The method is demonstrated in simulation and on real bimanual Franka Panda hardware, showing robust long-horizon planning and coordination with improved performance on complex multi-arm tasks. Formally, the plan distribution is $p(\tau) \propto \big(\prod_{\\pi_k} p_{\\pi_k}(S_{\\pi_k},a_{\\pi_k},S'_{\\pi_k})\big)$, illustrating how factor distributions combine to yield feasible plans for sampling via diffusion-based inference.
Abstract
Learning to plan for multi-step, multi-manipulator tasks is notoriously difficult because of the large search space and the complex constraint satisfaction problems. We present Generative Factor Chaining~(GFC), a composable generative model for planning. GFC represents a planning problem as a spatial-temporal factor graph, where nodes represent objects and robots in the scene, spatial factors capture the distributions of valid relationships among nodes, and temporal factors represent the distributions of skill transitions. Each factor is implemented as a modular diffusion model, which are composed during inference to generate feasible long-horizon plans through bi-directional message passing. We show that GFC can solve complex bimanual manipulation tasks and exhibits strong generalization to unseen planning tasks with novel combinations of objects and constraints. More details can be found at: https://generative-fc.github.io/
