SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes
Zhili Ng, Haozhe Wang, Zhengshen Zhang, Francis Tay Eng Hock, Marcelo H. Ang
TL;DR
SynTable introduces a photorealistic, end-to-end synthetic data generation pipeline built on NVIDIA Isaac Sim to address the lack of labeled UOAIS data and the Sim-to-Real gap in cluttered tabletop scenes. By automatically producing rich ground-truth including modal and amodal masks, occlusion data, depth, and occlusion order graphs, and by synthesizing a large-scale dataset (SynTable-Sim) with 1075 novel objects, the approach enables effective training of state-of-the-art UOAIS models. The work also defines the Occlusion Order Accuracy ($ACC_{OO}$) and associated OOAM/OODG representations to quantify occlusion reasoning, and demonstrates substantial improvements in real-world transfer on OSD-Amodal across multiple architectures. The authors provide open-source tooling and datasets to facilitate replication and broader adoption in robotics and AR applications where occlusion-aware perception is essential.
Abstract
In this work, we present SynTable, a unified and flexible Python-based dataset generator built using NVIDIA's Isaac Sim Replicator Composer for generating high-quality synthetic datasets for unseen object amodal instance segmentation of cluttered tabletop scenes. Our dataset generation tool can render complex 3D scenes containing object meshes, materials, textures, lighting, and backgrounds. Metadata, such as modal and amodal instance segmentation masks, object amodal RGBA instances, occlusion masks, depth maps, bounding boxes, and material properties can be automatically generated to annotate the scene according to the users' requirements. Our tool eliminates the need for manual labeling in the dataset generation process while ensuring the quality and accuracy of the dataset. In this work, we discuss our design goals, framework architecture, and the performance of our tool. We demonstrate the use of a sample dataset generated using SynTable for training a state-of-the-art model, UOAIS-Net. Our state-of-the-art results show significantly improved performance in Sim-to-Real transfer when evaluated on the OSD-Amodal dataset. We offer this tool as an open-source, easy-to-use, photorealistic dataset generator for advancing research in deep learning and synthetic data generation. The links to our source code, demonstration video, and sample dataset can be found in the supplementary materials.
