TV-based Deep 3D Self Super-Resolution for fMRI
Fernando Pérez-Bueno, Hongwei Bran Li, Matthew S. Rosen, Shahin Nasr, Cesar Caballero-Gaudes, Juan Eugenio Iglesias
TL;DR
The paper tackles the challenge of limited fMRI spatial resolution by introducing a self-supervised 3D DL SR method that integrates a CNN with an analytical degradation model and a Total Variation prior, enabling high-resolution reconstruction without HR ground-truth data. The approach optimizes a MAP-like loss $\mathcal L = \lVert \mathbf x - \mathbf B f^{\boldsymbol\theta}(\mathbf x) \rVert_2^2 + \alpha \mathrm{TV}(f^{\boldsymbol\theta}(\mathbf x))$, leveraging the degradation operator $\mathbf B = \mathbf D\mathbf H$ and training on LR observations only. Empirical results on 7T rs-fMRI data show that the method outperforms interpolation baselines and is competitive with supervised DL SR, while preserving functional maps (seed-based connectivity) with high accuracy and low FDR. This yields a practical pathway to enhance fMRI spatial detail without demanding GT HR data, potentially improving mesoscale brain analyses and clinical/experimental workflows.
Abstract
While functional Magnetic Resonance Imaging (fMRI) offers valuable insights into cognitive processes, its inherent spatial limitations pose challenges for detailed analysis of the fine-grained functional architecture of the brain. More specifically, MRI scanner and sequence specifications impose a trade-off between temporal resolution, spatial resolution, signal-to-noise ratio, and scan time. Deep Learning (DL) Super-Resolution (SR) methods have emerged as a promising solution to enhance fMRI resolution, generating high-resolution (HR) images from low-resolution (LR) images typically acquired with lower scanning times. However, most existing SR approaches depend on supervised DL techniques, which require training ground truth (GT) HR data, which is often difficult to acquire and simultaneously sets a bound for how far SR can go. In this paper, we introduce a novel self-supervised DL SR model that combines a DL network with an analytical approach and Total Variation (TV) regularization. Our method eliminates the need for external GT images, achieving competitive performance compared to supervised DL techniques and preserving the functional maps.
