Brain Effective Connectivity Estimation via Fourier Spatiotemporal Attention
Wen Xiong, Jinduo Liu, Junzhong Ji, Fenglong Ma
TL;DR
This work tackles brain EC estimation from noisy fMRI by introducing Fourier spatiotemporal attention (FSTA-EC), which combines frequency-domain denoising via a learnable Fourier filter with a unified spatiotemporal attention framework to capture inter-series (spatial) and intra-series (temporal) dependencies. The approach uses FA as an encoder to denoise and extract global frequency features, and STA as a decoder to derive EC $\\mathbf{A}$ and fused spatiotemporal representations, with a loss that promotes sparsity in EC. Empirical results on simulated and real resting-state fMRI show superior performance over state-of-the-art methods in EC recovery and downstream disease classification, while maintaining favorable efficiency. The method offers robust EC estimation under high noise and demonstrates potential for biomarker discovery in neurological and psychiatric contexts.
Abstract
Estimating brain effective connectivity (EC) from functional magnetic resonance imaging (fMRI) data can aid in comprehending the neural mechanisms underlying human behavior and cognition, providing a foundation for disease diagnosis. However, current spatiotemporal attention modules handle temporal and spatial attention separately, extracting temporal and spatial features either sequentially or in parallel. These approaches overlook the inherent spatiotemporal correlations present in real world fMRI data. Additionally, the presence of noise in fMRI data further limits the performance of existing methods. In this paper, we propose a novel brain effective connectivity estimation method based on Fourier spatiotemporal attention (FSTA-EC), which combines Fourier attention and spatiotemporal attention to simultaneously capture inter-series (spatial) dynamics and intra-series (temporal) dependencies from high-noise fMRI data. Specifically, Fourier attention is designed to convert the high-noise fMRI data to frequency domain, and map the denoised fMRI data back to physical domain, and spatiotemporal attention is crafted to simultaneously learn spatiotemporal dynamics. Furthermore, through a series of proofs, we demonstrate that incorporating learnable filter into fast Fourier transform and inverse fast Fourier transform processes is mathematically equivalent to performing cyclic convolution. The experimental results on simulated and real-resting-state fMRI datasets demonstrate that the proposed method exhibits superior performance when compared to state-of-the-art methods.
