Local Deep Implicit Functions for 3D Shape
Kyle Genova, Forrester Cole, Avneesh Sud, Aaron Sarna, Thomas Funkhouser
TL;DR
LDIF presents a structured local implicit representation that decomposes 3D shapes into overlapping Gaussian-based regions, each with a small latent code, enabling accurate surface reconstruction with far fewer parameters than global DIFs. By coupling a SIF-inspired space decomposition with local deep implicit functions decoded per element, LDIF achieves superior reconstruction and generalization, including unseen classes, while remaining computationally efficient. The approach supports end-to-end training from depth or mesh inputs, delivers effective depth completion, and extends to partial human-body scans without domain-specific templates. Overall, LDIF offers a scalable, high-detail 3D representation that balances structure and local detail to improve both accuracy and generalization across diverse shape collections.
Abstract
The goal of this project is to learn a 3D shape representation that enables accurate surface reconstruction, compact storage, efficient computation, consistency for similar shapes, generalization across diverse shape categories, and inference from depth camera observations. Towards this end, we introduce Local Deep Implicit Functions (LDIF), a 3D shape representation that decomposes space into a structured set of learned implicit functions. We provide networks that infer the space decomposition and local deep implicit functions from a 3D mesh or posed depth image. During experiments, we find that it provides 10.3 points higher surface reconstruction accuracy (F-Score) than the state-of-the-art (OccNet), while requiring fewer than 1 percent of the network parameters. Experiments on posed depth image completion and generalization to unseen classes show 15.8 and 17.8 point improvements over the state-of-the-art, while producing a structured 3D representation for each input with consistency across diverse shape collections.
