Joint Deblurring and 3D Reconstruction for Macrophotography
Yifan Zhao, Liangchen Li, Yuqi Zhou, Kai Wang, Yan Liang, Juyong Zhang
TL;DR
The paper tackles the defocus blur challenge in macrophotography by proposing a self-supervised, joint deblurring and 3D reconstruction framework that operates on multi-view blurred inputs. It models per-pixel defocus as depth-dependent Gaussian blur and optimizes both a sharp 3D scene (via 3D Gaussian Splats) and a BlurNet that predicts spatial blur variances, guided by a differentiable renderer. Key contributions include the first end-to-end joint approach tailored to macro imaging, a depth-aware blur model with a clarity mask, and a multi-stage training strategy that yields high-fidelity 3D appearance and sharp images from few views. The method demonstrates superior deblurring and 3D reconstruction quality over baselines on synthetic and real datasets, highlighting its potential for accurate macro-scale 3D capture without extensive supervised data.
Abstract
Macro lens has the advantages of high resolution and large magnification, and 3D modeling of small and detailed objects can provide richer information. However, defocus blur in macrophotography is a long-standing problem that heavily hinders the clear imaging of the captured objects and high-quality 3D reconstruction of them. Traditional image deblurring methods require a large number of images and annotations, and there is currently no multi-view 3D reconstruction method for macrophotography. In this work, we propose a joint deblurring and 3D reconstruction method for macrophotography. Starting from multi-view blurry images captured, we jointly optimize the clear 3D model of the object and the defocus blur kernel of each pixel. The entire framework adopts a differentiable rendering method to self-supervise the optimization of the 3D model and the defocus blur kernel. Extensive experiments show that from a small number of multi-view images, our proposed method can not only achieve high-quality image deblurring but also recover high-fidelity 3D appearance.
