Subassembly to Full Assembly: Effective Assembly Sequence Planning through Graph-based Reinforcement Learning
Chang Shu, Anton Kim, Shinkyu Park
TL;DR
The paper tackles the combinatorial complexity of robotic assembly sequence planning by formulating ASP as a graph-based Markov decision process and introducing Subassembly to Assembly (S2A), a graph neural network–driven reinforcement learning framework. It uses a volumetric voxel encoding of subassemblies and a GATv2-based encoder to predict Q-values over admissible edges, with a delayed reward strategy that rewards only actions leading toward a complete assembly, guided by a Double Deep Q-Network. The approach integrates physics-based feasibility checks via $\mathrm{RRT}^*$ and PyBullet during training and inference, enabling reliable planning under kinematic and gravity constraints. Empirical results show that S2A outperforms baselines across problem sizes, generalizes to unseen instances, and demonstrates a real-world robotic demonstration, indicating potential for scalable, automated ASP in complex assemblies.
Abstract
This paper proposes an assembly sequence planning framework, named Subassembly to Assembly (S2A). The framework is designed to enable a robotic manipulator to assemble multiple parts in a prespecified structure by leveraging object manipulation actions. The primary technical challenge lies in the exponentially increasing complexity of identifying a feasible assembly sequence as the number of parts grows. To address this, we introduce a graph-based reinforcement learning approach, where a graph attention network is trained using a delayed reward assignment strategy. In this strategy, rewards are assigned only when an assembly action contributes to the successful completion of the assembly task. We validate the framework's performance through physics-based simulations, comparing it against various baselines to emphasize the significance of the proposed reward assignment approach. Additionally, we demonstrate the feasibility of deploying our framework in a real-world robotic assembly scenario.
