Schrödinger Diffusion Driven Signal Recovery in 3T BOLD fMRI Using Unmatched 7T Observations
Yujian Xiong, Xuanzhao Dong, Sebastian Waz, Wenhui Zhu, Negar Mallak, Zhong-lin Lu, Yalin Wang
TL;DR
The paper addresses the gap between 3T and 7T fMRI quality for retinotopic mapping by aligning 3T and 7T signals in a shared 2D brain-disk domain and applying an unsupervised Schrödinger Bridge to translate unmatched 7T data into high-SNR 3T reconstructions. The approach combines conformal brain-disk parameterization with a discrete, unpaired Schrödinger Bridge diffusion model learned through neural guidance, enabling high-quality fMRI time series from typical 3T acquisitions. Evaluations on real and synthetic data show improved signal fidelity (lower FID, higher SSIM/PSNR) and enhanced pRF fits (higher $R^2$), suggesting that 7T-level quality can be computationally approximated from 3T scans. This method has practical implications for more accessible high-resolution retinotopic analysis and broader downstream analyses like segmentation and classification in neuroimaging.
Abstract
Ultra-high-field (7 Tesla) BOLD fMRI offers exceptional detail in both spatial and temporal domains, along with robust signal-to-noise characteristics, making it a powerful modality for studying visual information processing in the brain. However, due to the limited accessibility of 7T scanners, the majority of neuroimaging studies are still conducted using 3T systems, which inherently suffer from reduced fidelity in both resolution and SNR. To mitigate this limitation, we introduce a new computational approach designed to enhance the quality of 3T BOLD fMRI acquisitions. Specifically, we project both 3T and 7T datasets, sourced from different individuals and experimental setups, into a shared low-dimensional representation space. Within this space, we employ a lightweight, unsupervised Schrödinger Bridge framework to infer a high-SNR, high-resolution counterpart of the 3T data, without relying on paired supervision. This methodology is evaluated across multiple fMRI retinotopy datasets, including synthetically generated samples, and demonstrates a marked improvement in the reliability and fit of population receptive field (pRF) models applied to the enhanced 3T outputs. Our findings suggest that it is feasible to computationally approximate 7T-level quality from standard 3T acquisitions.
