Table of Contents
Fetching ...

PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging

Xin Cai, Zhiyuan You, Hailong Zhang, Wentao Liu, Jinwei Gu, Tianfan Xue

TL;DR

PhoCoLens tackles the challenge of reconstructing photorealistic and measurement-consistent images from lensless cameras by framing the forward model as $\mathbf{y} = \mathbf{A} \mathbf{x} + \mathbf{n}$ and applying a range-null space decomposition $\mathbf{x} = \mathbf{A}^{\dagger} \mathbf{A} \mathbf{x} + (\mathbf{I} - \mathbf{A}^{\dagger} \mathbf{A}) \mathbf{x}$. It introduces a two-stage pipeline: first, a spatially varying deconvolution (SVDeconv) recovers the range-space content to enforce data fidelity across a spatially varying PSF; second, a conditional diffusion model (null-space diffusion) uses the recovered low-frequency content as a condition to generate high-frequency details while preserving consistency with the measurements. The approach, validated on PhlatCam and DiffuserCam, achieves a favorable balance between fidelity and photorealism, outperforming traditional and diffusion-augmented baselines in both full-reference and perceptual metrics. This work advances practical lensless imaging by enabling photorealistic, measurement-consistent reconstructions, with potential impact on ultra-compact cameras and real-world imaging systems, albeit with current limitations in real-time applicability due to two-stage processing and diffusion sampling time.

Abstract

Lensless cameras offer significant advantages in size, weight, and cost compared to traditional lens-based systems. Without a focusing lens, lensless cameras rely on computational algorithms to recover the scenes from multiplexed measurements. However, current algorithms struggle with inaccurate forward imaging models and insufficient priors to reconstruct high-quality images. To overcome these limitations, we introduce a novel two-stage approach for consistent and photorealistic lensless image reconstruction. The first stage of our approach ensures data consistency by focusing on accurately reconstructing the low-frequency content with a spatially varying deconvolution method that adjusts to changes in the Point Spread Function (PSF) across the camera's field of view. The second stage enhances photorealism by incorporating a generative prior from pre-trained diffusion models. By conditioning on the low-frequency content retrieved in the first stage, the diffusion model effectively reconstructs the high-frequency details that are typically lost in the lensless imaging process, while also maintaining image fidelity. Our method achieves a superior balance between data fidelity and visual quality compared to existing methods, as demonstrated with two popular lensless systems, PhlatCam and DiffuserCam. Project website: https://phocolens.github.io/.

PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging

TL;DR

PhoCoLens tackles the challenge of reconstructing photorealistic and measurement-consistent images from lensless cameras by framing the forward model as and applying a range-null space decomposition . It introduces a two-stage pipeline: first, a spatially varying deconvolution (SVDeconv) recovers the range-space content to enforce data fidelity across a spatially varying PSF; second, a conditional diffusion model (null-space diffusion) uses the recovered low-frequency content as a condition to generate high-frequency details while preserving consistency with the measurements. The approach, validated on PhlatCam and DiffuserCam, achieves a favorable balance between fidelity and photorealism, outperforming traditional and diffusion-augmented baselines in both full-reference and perceptual metrics. This work advances practical lensless imaging by enabling photorealistic, measurement-consistent reconstructions, with potential impact on ultra-compact cameras and real-world imaging systems, albeit with current limitations in real-time applicability due to two-stage processing and diffusion sampling time.

Abstract

Lensless cameras offer significant advantages in size, weight, and cost compared to traditional lens-based systems. Without a focusing lens, lensless cameras rely on computational algorithms to recover the scenes from multiplexed measurements. However, current algorithms struggle with inaccurate forward imaging models and insufficient priors to reconstruct high-quality images. To overcome these limitations, we introduce a novel two-stage approach for consistent and photorealistic lensless image reconstruction. The first stage of our approach ensures data consistency by focusing on accurately reconstructing the low-frequency content with a spatially varying deconvolution method that adjusts to changes in the Point Spread Function (PSF) across the camera's field of view. The second stage enhances photorealism by incorporating a generative prior from pre-trained diffusion models. By conditioning on the low-frequency content retrieved in the first stage, the diffusion model effectively reconstructs the high-frequency details that are typically lost in the lensless imaging process, while also maintaining image fidelity. Our method achieves a superior balance between data fidelity and visual quality compared to existing methods, as demonstrated with two popular lensless systems, PhlatCam and DiffuserCam. Project website: https://phocolens.github.io/.
Paper Structure (18 sections, 17 equations, 14 figures, 4 tables)

This paper contains 18 sections, 17 equations, 14 figures, 4 tables.

Figures (14)

  • Figure 1: We introduce PhoCoLens, a lensless reconstruction algorithm that achieves both better visual quality and consistency to the ground truth than existing methods. Our method recovers more details compared to traditional reconstruction algorithms (b) and (c), and also maintains better fidelity to the ground truth compared to the generative approach (d).
  • Figure 2: Comparison between results without (left) and with (right) our spatially varying deconvolution.
  • Figure 2: Comparison of deconvolution methods for range space reconstruction and original content reconstruction.
  • Figure 3: Evaluation of fidelity and authenticity on PhlatCam khan2020flatnet dataset.
  • Figure 4: Characterization of PSFs in Lensless Camera. (a) Illustration of light propagation in the lensless camera: two point sources A and B at infinity emitting parallel light beams. Source A emits at angle $\theta$ relative to the optical axis, causing a PSF shift on the sensor plane This PSF shift depends on both the incident angle $\theta$ and the distance $d$ between the sensor and the mask. (b) Simulated PSFs for light sources at angles of 0°, 15°, and 30°. (c) Inner product similarity between the on-axis PSF and off-axis PSFs at different field positions. (d) Reconstruction using PSF at 0°, degradation is more significant at the periphery (red box) than the center (green box).
  • ...and 9 more figures