Null-Space Diffusion Distillation for Efficient Photorealistic Lensless Imaging
Jose Reinaldo Cunha Santos A V Silva Neto, Hodaka Kawachi, Yasushi Yagi, Tomoya Nakamura
TL;DR
This paper tackles the challenge of achieving photorealistic lensless imaging without ground-truth paired supervision. It analyzes diffusion-prior guidance and shows that measurement-space DPS can struggle with consistency, while reconstruction-space guidance via range–null decomposition (DDNM$+$) yields more plausible results. The authors introduce NSDD, a training-free, offline distillation of a $DDNM^+$ teacher into a single-pass student conditioned on $(\boldsymbol{y}, \boldsymbol{A}^{\dagger}\boldsymbol{y})$, achieving near-teacher perceptual quality with substantially reduced latency. Experiments on Lensless FFHQ and PhlatCam demonstrate competitive perceptual quality and faster inference than multi-step diffusion methods, indicating a practical path toward fast, ground-truth-free, photorealistic lensless imaging.
Abstract
State-of-the-art photorealistic reconstructions for lensless cameras often rely on paired lensless-lensed supervision, which can bias models due to lens-lensless domain mismatch. To avoid this, ground-truth-free diffusion priors are attractive; however, generic formulations tuned for conventional inverse problems often break under the noisy, highly multiplexed, and ill-posed lensless deconvolution setting. We observe that methods which separate range-space enforcement from null-space diffusion-prior updates yield stable, realistic reconstructions. Building on this, we introduce Null-Space Diffusion Distillation (NSDD): a single-pass student that distills the null-space component of an iterative DDNM+ solver, conditioned on the lensless measurement and on a range-space anchor. NSDD preserves measurement consistency and achieves photorealistic results without paired supervision at a fraction of the runtime and memory. On Lensless-FFHQ and PhlatCam, NSDD is the second fastest, behind Wiener, and achieves near-teacher perceptual quality (second-best LPIPS, below DDNM+), outperforming DPS and classical convex baselines. These results suggest a practical path toward fast, ground-truth-free, photorealistic lensless imaging.
