Supervised Representation Learning towards Generalizable Assembly State Recognition
Tim J. Schoonbeek, Goutham Balachandran, Hans Onvlee, Tim Houben, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen
TL;DR
This work reframes assembly state recognition as a representation-learning problem to address scalability and robustness to execution errors, introducing Intermediate-State Informed Loss (ISIL) that uses unlabeled intermediate configurations as negative samples. The authors present a supervised contrastive framework with real, synthetic, and unlabeled data, and demonstrate that ISIL improves clustering and classification across backbones (e.g., ResNet-34, ViT-S) and losses, while enabling generalization to unseen part configurations and unseen error states. Evaluations on the IndustReal dataset show substantial gains in $F_1@1$ and $MAP@R(+)$ over classification baselines, and provide extensive analysis of error-state generalization with new annotations. The approach promises real-time, scalable assembly-state monitoring for industrial settings and sets the stage for further work with real-world data, edge deployment, and weak supervision signals.
Abstract
Assembly state recognition facilitates the execution of assembly procedures, offering feedback to enhance efficiency and minimize errors. However, recognizing assembly states poses challenges in scalability, since parts are frequently updated, and the robustness to execution errors remains underexplored. To address these challenges, this paper proposes an approach based on representation learning and the novel intermediate-state informed loss function modification (ISIL). ISIL leverages unlabeled transitions between states and demonstrates significant improvements in clustering and classification performance for all tested architectures and losses. Despite being trained exclusively on images without execution errors, thorough analysis on error states demonstrates that our approach accurately distinguishes between correct states and states with various types of execution errors. The integration of the proposed algorithm can offer meaningful assistance to workers and mitigate unexpected losses due to procedural mishaps in industrial settings. The code is available at: https://timschoonbeek.github.io/state_rec
