BaB-ND: Long-Horizon Motion Planning with Branch-and-Bound and Neural Dynamics
Keyi Shen, Jiangwei Yu, Jose Barreiros, Huan Zhang, Yunzhu Li
TL;DR
BaB-ND tackles the challenge of planning long trajectories with non-linear neural dynamics by introducing a GPU-accelerated branch-and-bound framework that decomposes the action space into subdomains, uses a modified bound propagation mechanism inspired by alpha-beta-CROWN to bound subproblems, and incorporates a search component to find high-quality feasible action sequences. Distinguishing itself from verification-only approaches, BaB-ND focuses on producing near-optimal trajectories for complex, contact-rich manipulation tasks and scales to large neural dynamics models and architectures, including MLPs and GNNs. The method demonstrates superior open-loop planning performance and improved real-world control across non-prehensile pushing with obstacles, object merging, rope routing, and object sorting, while outperforming standard baselines and MIP-based planners in scalability. Key contributions include (1) a general BaB-based framework for long-horizon planning over neural dynamics, (2) novel branching, bounding, and searching components adapted from neural network verification, and (3) demonstrations of applicability and scalability to deformable objects and graph-based dynamics. The results suggest BaB-ND offers a principled, scalable alternative for planning with learned dynamics in real-world manipulation.
Abstract
Neural-network-based dynamics models learned from observational data have shown strong predictive capabilities for scene dynamics in robotic manipulation tasks. However, their inherent non-linearity presents significant challenges for effective planning. Current planning methods, often dependent on extensive sampling or local gradient descent, struggle with long-horizon motion planning tasks involving complex contact events. In this paper, we present a GPU-accelerated branch-and-bound (BaB) framework for motion planning in manipulation tasks that require trajectory optimization over neural dynamics models. Our approach employs a specialized branching heuristics to divide the search space into subdomains, and applies a modified bound propagation method, inspired by the state-of-the-art neural network verifier alpha-beta-CROWN, to efficiently estimate objective bounds within these subdomains. The branching process guides planning effectively, while the bounding process strategically reduces the search space. Our framework achieves superior planning performance, generating high-quality state-action trajectories and surpassing existing methods in challenging, contact-rich manipulation tasks such as non-prehensile planar pushing with obstacles, object sorting, and rope routing in both simulated and real-world settings. Furthermore, our framework supports various neural network architectures, ranging from simple multilayer perceptrons to advanced graph neural dynamics models, and scales efficiently with different model sizes.
