FoldPath: End-to-End Object-Centric Motion Generation via Modulated Implicit Paths
Paolo Rabino, Gabriele Tiboni, Tatiana Tommasi
TL;DR
FoldPath tackles Object-Centric Motion Generation by predicting long-horizon, smooth robot paths directly from 3D point clouds using a neural-field representation. The method abandons disjoint end-effector waypoint sequences and brittle post-processing in favor of end-to-end path prototypes decoded by modulated MLP heads, with path execution sampled via $s_t\in[-1,1]$. A DTW-based Average Precision metric suite is proposed to evaluate curve-aware path quality, and FoldPath achieves state-of-the-art performance on the PaintNet benchmark, including scenarios with limited training samples ($ ext{as few as }$ $70$) and real-world containers, demonstrating practical robustness and readiness for deployment. The results indicate FoldPath can generalize across free-form geometries and complex surface layouts, representing a meaningful advance toward scalable, industrially viable OCMG for spray painting and related robotic tasks.
Abstract
Object-Centric Motion Generation (OCMG) is instrumental in advancing automated manufacturing processes, particularly in domains requiring high-precision expert robotic motions, such as spray painting and welding. To realize effective automation, robust algorithms are essential for generating extended, object-aware trajectories across intricate 3D geometries. However, contemporary OCMG techniques are either based on ad-hoc heuristics or employ learning-based pipelines that are still reliant on sensitive post-processing steps to generate executable paths. We introduce FoldPath, a novel, end-to-end, neural field based method for OCMG. Unlike prior deep learning approaches that predict discrete sequences of end-effector waypoints, FoldPath learns the robot motion as a continuous function, thus implicitly encoding smooth output paths. This paradigm shift eliminates the need for brittle post-processing steps that concatenate and order the predicted discrete waypoints. Particularly, our approach demonstrates superior predictive performance compared to recently proposed learning-based methods, and attains generalization capabilities even in real industrial settings, where only a limited amount of 70 expert samples are provided. We validate FoldPath through comprehensive experiments in a realistic simulation environment and introduce new, rigorous metrics designed to comprehensively evaluate long-horizon robotic paths, thus advancing the OCMG task towards practical maturity.
