Towards Interpretable Visual Decoding with Attention to Brain Representations

Pinyuan Feng; Hossein Adeli; Wenxuan Guo; Fan Cheng; Ethan Hwang; Nikolaus Kriegeskorte

Towards Interpretable Visual Decoding with Attention to Brain Representations

Pinyuan Feng, Hossein Adeli, Wenxuan Guo, Fan Cheng, Ethan Hwang, Nikolaus Kriegeskorte

TL;DR

This work introduces an Image-Brain BI-directional interpretability framework (IBBI) that analyzes cross-attention patterns across diffusion denoising steps to reveal how different cortical areas influence the unfolding generative trajectory and highlights the potential of end-to-end brain-to-image reconstruction.

Abstract

Recent work has demonstrated that complex visual stimuli can be decoded from human brain activity using deep generative models, offering new ways to probe how the brain represents real-world scenes. However, many existing approaches first map brain signals into intermediate image or text feature spaces before guiding the generative process, which obscures the contributions of different brain areas to the final reconstruction output. In this work, we propose NeuroAdapter, a visual decoding framework that directly conditions a latent diffusion model on brain representations, bypassing the need for intermediate feature spaces. Our method demonstrates competitive visual reconstruction quality on public fMRI datasets compared to prior work, while providing greater transparency into how brain signals drive visual reconstruction. To this end, we introduce an Image-Brain BI-directional interpretability framework (IBBI) that analyzes cross-attention patterns across diffusion denoising steps to reveal how different cortical areas influence the unfolding generative trajectory. Our work highlights the potential of end-to-end brain-to-image reconstruction and establishes a path for interpretable neural decoding.

Towards Interpretable Visual Decoding with Attention to Brain Representations

TL;DR

Abstract

Towards Interpretable Visual Decoding with Attention to Brain Representations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (20)