Table of Contents
Fetching ...

QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction

Ishak Ayad, Nicolas Larue, Maï K. Nguyen

TL;DR

This work tackles sparse-view CT reconstruction by formulating it as a regularized inverse problem and introducing QN-Mixer, a second-order unrolling network that integrates a latent BFGS update with a non-local regularizer. The key innovations are a latent space BFGS update to approximate the inverse Hessian, and Incept-Mixer, a non-local regularization block that captures long-range dependencies via a fusion of Inception and MLP-Mixer concepts. Empirical results on the AAPM and DeepLesion datasets show state-of-the-art PSNR and SSIM, improved data efficiency, and faster convergence with significantly fewer unrolled iterations, along with robust performance in OOD and limited-angle scenarios. The approach offers a principled path to faster, more accurate iterative reconstructions in CT, with potential applicability to a broader class of inverse problems, while also highlighting memory considerations and avenues for future expansion to larger Hessians and unsupervised training.

Abstract

Inverse problems span across diverse fields. In medical contexts, computed tomography (CT) plays a crucial role in reconstructing a patient's internal structure, presenting challenges due to artifacts caused by inherently ill-posed inverse problems. Previous research advanced image quality via post-processing and deep unrolling algorithms but faces challenges, such as extended convergence times with ultra-sparse data. Despite enhancements, resulting images often show significant artifacts, limiting their effectiveness for real-world diagnostic applications. We aim to explore deep second-order unrolling algorithms for solving imaging inverse problems, emphasizing their faster convergence and lower time complexity compared to common first-order methods like gradient descent. In this paper, we introduce QN-Mixer, an algorithm based on the quasi-Newton approach. We use learned parameters through the BFGS algorithm and introduce Incept-Mixer, an efficient neural architecture that serves as a non-local regularization term, capturing long-range dependencies within images. To address the computational demands typically associated with quasi-Newton algorithms that require full Hessian matrix computations, we present a memory-efficient alternative. Our approach intelligently downsamples gradient information, significantly reducing computational requirements while maintaining performance. The approach is validated through experiments on the sparse-view CT problem, involving various datasets and scanning protocols, and is compared with post-processing and deep unrolling state-of-the-art approaches. Our method outperforms existing approaches and achieves state-of-the-art performance in terms of SSIM and PSNR, all while reducing the number of unrolling iterations required.

QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction

TL;DR

This work tackles sparse-view CT reconstruction by formulating it as a regularized inverse problem and introducing QN-Mixer, a second-order unrolling network that integrates a latent BFGS update with a non-local regularizer. The key innovations are a latent space BFGS update to approximate the inverse Hessian, and Incept-Mixer, a non-local regularization block that captures long-range dependencies via a fusion of Inception and MLP-Mixer concepts. Empirical results on the AAPM and DeepLesion datasets show state-of-the-art PSNR and SSIM, improved data efficiency, and faster convergence with significantly fewer unrolled iterations, along with robust performance in OOD and limited-angle scenarios. The approach offers a principled path to faster, more accurate iterative reconstructions in CT, with potential applicability to a broader class of inverse problems, while also highlighting memory considerations and avenues for future expansion to larger Hessians and unsupervised training.

Abstract

Inverse problems span across diverse fields. In medical contexts, computed tomography (CT) plays a crucial role in reconstructing a patient's internal structure, presenting challenges due to artifacts caused by inherently ill-posed inverse problems. Previous research advanced image quality via post-processing and deep unrolling algorithms but faces challenges, such as extended convergence times with ultra-sparse data. Despite enhancements, resulting images often show significant artifacts, limiting their effectiveness for real-world diagnostic applications. We aim to explore deep second-order unrolling algorithms for solving imaging inverse problems, emphasizing their faster convergence and lower time complexity compared to common first-order methods like gradient descent. In this paper, we introduce QN-Mixer, an algorithm based on the quasi-Newton approach. We use learned parameters through the BFGS algorithm and introduce Incept-Mixer, an efficient neural architecture that serves as a non-local regularization term, capturing long-range dependencies within images. To address the computational demands typically associated with quasi-Newton algorithms that require full Hessian matrix computations, we present a memory-efficient alternative. Our approach intelligently downsamples gradient information, significantly reducing computational requirements while maintaining performance. The approach is validated through experiments on the sparse-view CT problem, involving various datasets and scanning protocols, and is compared with post-processing and deep unrolling state-of-the-art approaches. Our method outperforms existing approaches and achieves state-of-the-art performance in terms of SSIM and PSNR, all while reducing the number of unrolling iterations required.
Paper Structure (43 sections, 17 equations, 17 figures, 12 tables, 4 algorithms)

This paper contains 43 sections, 17 equations, 17 figures, 12 tables, 4 algorithms.

Figures (17)

  • Figure 1: CT Reconstruction with 32 views of State-of-the-Art Methods. Comparative analysis with post-processing and first-order unrolling networks highlights QN-Mixer's superiority in artifact removal, training time, and data efficiency.
  • Figure 2: Overall structure of the proposed QN-Mixer for sparse-view CT reconstruction, unrolled from \ref{['algo:qnmixer']}. The method leverages the advantages of the quasi-Newton method for faster convergence while incorporating a latent BFGS update.
  • Figure 3: Architecture of our regularization block. It is referred to as "Incept-Mixer" and denoted as ${\mathcal{G}}$ in \ref{['eq:unrolled_solution']}
  • Figure 4: Visual comparison on AAPM. From top to bottom: the results under the following conditions: first $(n_v{=}32, N_1)$, second $(n_v{=}64, N_1)$, third $(n_v{=}32, N_0)$. The last row presents out-of-distribution (OOD) results with a randomly overlaid circle on a test image. The display window is set to $\left[-1000, 800\right]$ HU.
  • Figure 5: Visual comparison on DeepLesion of state-of-the-art methods. Rows display results under different conditions: $(n_v{=}64, N_1)$ and $(n_v{=}128, N_1)$. Display windows are set to $\left[-1000, 800\right]$ HU for the first row and $\left[-200, 300\right]$ HU for the second row.
  • ...and 12 more figures