Grasping Partially Occluded Objects Using Autoencoder-Based Point Cloud Inpainting
Alexander Koebler, Ralf Gross, Florian Buettner, Ingo Thon
TL;DR
This work tackles the problem of partial occlusion in robotic grasping by introducing an autoencoder-based point-cloud inpainting pipeline that reconstructs occluded object geometry from single-view scans. It converts unordered 3D data into equidistant depth images to leverage CNN-based segmentation and inpainting, with a loss function that blends pixel, perceptual, and style cues (PSBL). Training relies solely on synthetic data from a digital twin, enabling deployment without disrupting real production, and a real-world deployment achieves 76% successful grasping in occluded scenarios, substantially reducing discarded parts. The approach preserves key geometric features critical for subsequent surface-based 3D matching, enabling existing grasp planners to operate robustly under occlusion and offering practical benefits for industrial manufacturing workflows.
Abstract
Flexible industrial production systems will play a central role in the future of manufacturing due to higher product individualization and customization. A key component in such systems is the robotic grasping of known or unknown objects in random positions. Real-world applications often come with challenges that might not be considered in grasping solutions tested in simulation or lab settings. Partial occlusion of the target object is the most prominent. Examples of occlusion can be supporting structures in the camera's field of view, sensor imprecision, or parts occluding each other due to the production process. In all these cases, the resulting lack of information leads to shortcomings in calculating grasping points. In this paper, we present an algorithm to reconstruct the missing information. Our inpainting solution facilitates the real-world utilization of robust object matching approaches for grasping point calculation. We demonstrate the benefit of our solution by enabling an existing grasping system embedded in a real-world industrial application to handle occlusions in the input. With our solution, we drastically decrease the number of objects discarded by the process.
