DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation
Jeong Joon Park, Peter Florence, Julian Straub, Richard Newcombe, Steven Lovegrove
TL;DR
DeepSDF introduces a continuous, latent-conditioned SDF representation for 3D shapes, enabling high-fidelity surfaces, interpolation, and completion from partial data with a compact memory footprint. It uses an encoder-less auto-decoder framework to learn a latent space of shapes, allowing a single network to represent thousands of shapes via per-shape latent codes and a shared decoder. The approach achieves state-of-the-art results on shape reconstruction and completion tasks and supports smooth latent-space interpolation, demonstrating strong generalization and efficient implicit geometry. Practical limitations include inference time due to optimization and the need for a canonical pose, guiding future work toward faster optimization and broader scene modeling.
Abstract
Computer graphics, 3D computer vision and robotics communities have produced multiple approaches to representing 3D geometry for rendering and reconstruction. These provide trade-offs across fidelity, efficiency and compression capabilities. In this work, we introduce DeepSDF, a learned continuous Signed Distance Function (SDF) representation of a class of shapes that enables high quality shape representation, interpolation and completion from partial and noisy 3D input data. DeepSDF, like its classical counterpart, represents a shape's surface by a continuous volumetric field: the magnitude of a point in the field represents the distance to the surface boundary and the sign indicates whether the region is inside (-) or outside (+) of the shape, hence our representation implicitly encodes a shape's boundary as the zero-level-set of the learned function while explicitly representing the classification of space as being part of the shapes interior or not. While classical SDF's both in analytical or discretized voxel form typically represent the surface of a single shape, DeepSDF can represent an entire class of shapes. Furthermore, we show state-of-the-art performance for learned 3D shape representation and completion while reducing the model size by an order of magnitude compared with previous work.
