SuperDec: 3D Scene Decomposition with Superquadric Primitives
Elisabetta Fedele, Boyang Sun, Leonidas Guibas, Marc Pollefeys, Francis Engelmann
TL;DR
SuperDec introduces a locality-driven 3D scene decomposition that represents arbitrary scenes as a compact set of superquadrics. A two-stage pipeline—a Transformer-based feed-forward network predicting per-object superquadrics and a Levenberg–Marquardt refinement—enables accurate, parsimonious decomposition, which is scalable to full scenes via Mask3D. The approach achieves state-of-the-art object-level decomposition on ShapeNet and generalizes to real datasets (ScanNet++ and Replica) without fine-tuning, while supporting robotics tasks and controllable image generation. This compact, interpretable representation enables efficient planning, grasping, and geometry-guided editing, signaling a practical path toward geometry-aware 3D scene understanding.
Abstract
We present SuperDec, an approach for creating compact 3D scene representations via decomposition into superquadric primitives. While most recent works leverage geometric primitives to obtain photorealistic 3D scene representations, we propose to leverage them to obtain a compact yet expressive representation. We propose to solve the problem locally on individual objects and leverage the capabilities of instance segmentation methods to scale our solution to full 3D scenes. In doing that, we design a new architecture which efficiently decompose point clouds of arbitrary objects in a compact set of superquadrics. We train our architecture on ShapeNet and we prove its generalization capabilities on object instances extracted from the ScanNet++ dataset as well as on full Replica scenes. Finally, we show how a compact representation based on superquadrics can be useful for a diverse range of downstream applications, including robotic tasks and controllable visual content generation and editing.
