Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI
Marlène Careil, Yohann Benchetrit, Jean-Rémi King
TL;DR
Dynadiff introduces a single-stage diffusion-based decoder that directly uses continuously evolving fMRI time series to reconstruct natural images, addressing the limitations of time-collapsed preprocessing and multi-stage pipelines. A brain module maps fMRI sequences to conditioning embeddings, which condition a pretrained diffusion model trained jointly in one stage; inference uses a DDIM scheduler for efficient denoising. On the Natural Scenes Dataset, Dynadiff delivers state-of-the-art time-resolved reconstructions, with strong gains on high-level semantics and clear evidence that time-aware decoding reveals dynamic evolution of image representations in brain activity. The work offers a practical, time-resolved brain-to-image decoding approach with implications for neuroprosthetics and neuroscience, while outlining ethical safeguards such as face-blurring and open research practices.
Abstract
Brain-to-image decoding has been recently propelled by the progress in generative AI models and the availability of large ultra-high field functional Magnetic Resonance Imaging (fMRI). However, current approaches depend on complicated multi-stage pipelines and preprocessing steps that typically collapse the temporal dimension of brain recordings, thereby limiting time-resolved brain decoders. Here, we introduce Dynadiff (Dynamic Neural Activity Diffusion for Image Reconstruction), a new single-stage diffusion model designed for reconstructing images from dynamically evolving fMRI recordings. Our approach offers three main contributions. First, Dynadiff simplifies training as compared to existing approaches. Second, our model outperforms state-of-the-art models on time-resolved fMRI signals, especially on high-level semantic image reconstruction metrics, while remaining competitive on preprocessed fMRI data that collapse time. Third, this approach allows a precise characterization of the evolution of image representations in brain activity. Overall, this work lays the foundation for time-resolved brain-to-image decoding.
