PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness
Anh-Quan Cao, Angela Dai, Raoul de Charette
TL;DR
PaSCo addresses Panoptic Scene Completion (PSC), extending Semantic Scene Completion (SSC) by predicting geometry, semantics, and instance IDs from sparse 3D inputs while providing calibrated uncertainty. It introduces a MIMO-inspired, single-pass ensemble built on a sparse CNN–Transformer backbone with a multiscale generator and a mask-based decoder to output a set of masks $(m_k,c_k)$ for $k=1\dots K$, using non-empty voxels for efficiency; final PSC output is obtained via a permutation-invariant mask ensembling using Hungarian matching, enabling both voxel- and instance-level uncertainty estimation. The method demonstrates state-of-the-art PSC and uncertainty performance on three urban LiDAR datasets and shows robustness to distribution shifts, with code and data released publicly. PaSCo’s combination of multiscale geometric guidance, mask-based predictions, and uncertainty-aware MIMO ensembling represents a practical step toward reliable, complete 3D scene understanding for robotics and autonomous driving.
Abstract
We propose the task of Panoptic Scene Completion (PSC) which extends the recently popular Semantic Scene Completion (SSC) task with instance-level information to produce a richer understanding of the 3D scene. Our PSC proposal utilizes a hybrid mask-based technique on the non-empty voxels from sparse multi-scale completions. Whereas the SSC literature overlooks uncertainty which is critical for robotics applications, we instead propose an efficient ensembling to estimate both voxel-wise and instance-wise uncertainties along PSC. This is achieved by building on a multi-input multi-output (MIMO) strategy, while improving performance and yielding better uncertainty for little additional compute. Additionally, we introduce a technique to aggregate permutation-invariant mask predictions. Our experiments demonstrate that our method surpasses all baselines in both Panoptic Scene Completion and uncertainty estimation on three large-scale autonomous driving datasets. Our code and data are available at https://astra-vision.github.io/PaSCo .
