Bayesian Diffusion Models for 3D Shape Reconstruction
Haiyang Xu, Yu Lei, Zeyuan Chen, Xiang Zhang, Yue Zhao, Yilin Wang, Zhuowen Tu
TL;DR
BDM introduces a diffusion-based Bayesian framework that tightly couples bottom-up data-driven inference with top-down priors for single-view 3D shape reconstruction. It formalizes a stochastic Langevin-like update and uses denoising diffusion models to fuse prior and data information, offering two fusion variants: BDM-B (blending) and BDM-M (merging). Empirically, BDM achieves state-of-the-art results on ShapeNet-R2N2 and Pix3D across data regimes, with comprehensive ablations demonstrating robust performance gains, efficient inference, and favorable human evaluations. The approach enables effective use of standalone 3D priors to regularize reconstruction, addressing data scarcity and improving geometric fidelity in challenging real-world scenarios. This diffusion-based Bayesian fusion has broad implications for principled, interpretable integration of priors in computer vision tasks beyond 3D reconstruction.
Abstract
We present Bayesian Diffusion Models (BDM), a prediction algorithm that performs effective Bayesian inference by tightly coupling the top-down (prior) information with the bottom-up (data-driven) procedure via joint diffusion processes. We show the effectiveness of BDM on the 3D shape reconstruction task. Compared to prototypical deep learning data-driven approaches trained on paired (supervised) data-labels (e.g. image-point clouds) datasets, our BDM brings in rich prior information from standalone labels (e.g. point clouds) to improve the bottom-up 3D reconstruction. As opposed to the standard Bayesian frameworks where explicit prior and likelihood are required for the inference, BDM performs seamless information fusion via coupled diffusion processes with learned gradient computation networks. The specialty of our BDM lies in its capability to engage the active and effective information exchange and fusion of the top-down and bottom-up processes where each itself is a diffusion process. We demonstrate state-of-the-art results on both synthetic and real-world benchmarks for 3D shape reconstruction.
