Learning to Infer Implicit Surfaces without 3D Supervision

Shichen Liu; Shunsuke Saito; Weikai Chen; Hao Li

Learning to Infer Implicit Surfaces without 3D Supervision

Shichen Liu, Shunsuke Saito, Weikai Chen, Hao Li

TL;DR

The paper tackles unsupervised 3D shape learning by leveraging implicit surface representations trained solely from 2D silhouettes, addressing the lack of 3D supervision. It introduces a field probing framework with sparse 3D anchor points and rays to bridge the implicit occupancy field with image-space supervision, paired with a finite-difference-based geometric regularizer and importance weighting to enforce local smoothness near the surface. The approach achieves high-resolution, topology-flexible reconstructions from single-view cues, outperforming state-of-the-art explicit-representation baselines in 3D IoU and visual quality, and it is validated through extensive ablations. This work broadens the practical applicability of implicit surfaces for single-view 3D digitization and sets a foundation for further unsupervised, texture-aware shape modeling.

Abstract

Recent advances in 3D deep learning have shown that it is possible to train highly effective deep models for 3D shape generation, directly from 2D images. This is particularly interesting since the availability of 3D models is still limited compared to the massive amount of accessible 2D images, which is invaluable for training. The representation of 3D surfaces itself is a key factor for the quality and resolution of the 3D output. While explicit representations, such as point clouds and voxels, can span a wide range of shape variations, their resolutions are often limited. Mesh-based representations are more efficient but are limited by their ability to handle varying topologies. Implicit surfaces, however, can robustly handle complex shapes, topologies, and also provide flexible resolution control. We address the fundamental problem of learning implicit surfaces for shape inference without the need of 3D supervision. Despite their advantages, it remains nontrivial to (1) formulate a differentiable connection between implicit surfaces and their 2D renderings, which is needed for image-based supervision; and (2) ensure precise geometric properties and control, such as local smoothness. In particular, sampling implicit surfaces densely is also known to be a computationally demanding and very slow operation. To this end, we propose a novel ray-based field probing technique for efficient image-to-field supervision, as well as a general geometric regularizer for implicit surfaces, which provides natural shape priors in unconstrained regions. We demonstrate the effectiveness of our framework on the task of single-view image-based 3D shape digitization and show how we outperform state-of-the-art techniques both quantitatively and qualitatively.

Learning to Infer Implicit Surfaces without 3D Supervision

TL;DR

Abstract

Learning to Infer Implicit Surfaces without 3D Supervision

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)