Residual Primitive Fitting of 3D Shapes with SuperFrusta
Aditya Ganeshan, Matheus Gadelha, Thibault Groueix, Zhiqin Chen, Siddhartha Chaudhuri, Vladimir Kim, Wang Yifan, Daniel Ritchie
TL;DR
This work tackles the reconstruction-editability gap by introducing SuperFrustum, an expressive, 8-parameter analytic primitive, and Residual Primitive Fitting (ResFit), an iterative procedure that couples global shape analysis with local optimization. By alternating between MSD-based initialization and gradient-based refinement, the method assembles compact, high-fidelity primitive programs that can be edited and reused as assets or CSG models. Across diverse benchmarks, it achieves state-of-the-art IoU while using roughly half the primitives, and enables downstream tasks such as image-to-assembly and semantic segmentation enrichment. Overall, the approach bridges dense 3D data and human-controllable design, enabling editable, interpretable shape programs with strong fidelity.
Abstract
We introduce a framework for converting 3D shapes into compact and editable assemblies of analytic primitives, directly addressing the persistent trade-off between reconstruction fidelity and parsimony. Our approach combines two key contributions: a novel primitive, termed SuperFrustum, and an iterative fiting algorithm, Residual Primitive Fitting (ResFit). SuperFrustum is an analytical primitive that is simultaneously (1) expressive, being able to model various common solids such as cylinders, spheres, cones & their tapered and bent forms, (2) editable, being compactly parameterized with 8 parameters, and (3) optimizable, with a sign distance field differentiable w.r.t. its parameters almost everywhere. ResFit is an unsupervised procedure that interleaves global shape analysis with local optimization, iteratively fitting primitives to the unexplained residual of a shape to discover a parsimonious yet accurate decompositions for each input shape. On diverse 3D benchmarks, our method achieves state-of-the-art results, improving IoU by over 9 points while using nearly half as many primitives as prior work. The resulting assemblies bridge the gap between dense 3D data and human-controllable design, producing high-fidelity and editable shape programs.
