ZEBRA: Towards Zero-Shot Cross-Subject Generalization for Universal Brain Visual Decoding
Haonan Wang, Jingyu Lu, Hongrui Li, Xiaomeng Li
TL;DR
Zebra tackles zero-shot cross-subject brain decoding by disentangling fMRI representations into subject-invariant and semantic-specific components using residual decomposition and adversarial training. The method couples a ViT-based brain encoder with a diffusion prior and introduces Subject-Invariant Feature Extraction (SIFE) and Semantic-Specific Feature Extraction (SSFE) to learn universal semantics while suppressing subject-specific noise. Through adversarial objectives and preservation anchors, Zebra achieves zero-shot generalization to unseen subjects and attains performance close to finetuned models on multiple metrics, demonstrating scalable, real-world potential for brain decoding. The approach advances practical neural decoding by reducing the need for subject-specific data, enabling faster, more accessible brain-computer interface applications, while outlining future work on semantic fidelity and broader applicability.
Abstract
Recent advances in neural decoding have enabled the reconstruction of visual experiences from brain activity, positioning fMRI-to-image reconstruction as a promising bridge between neuroscience and computer vision. However, current methods predominantly rely on subject-specific models or require subject-specific fine-tuning, limiting their scalability and real-world applicability. In this work, we introduce ZEBRA, the first zero-shot brain visual decoding framework that eliminates the need for subject-specific adaptation. ZEBRA is built on the key insight that fMRI representations can be decomposed into subject-related and semantic-related components. By leveraging adversarial training, our method explicitly disentangles these components to isolate subject-invariant, semantic-specific representations. This disentanglement allows ZEBRA to generalize to unseen subjects without any additional fMRI data or retraining. Extensive experiments show that ZEBRA significantly outperforms zero-shot baselines and achieves performance comparable to fully finetuned models on several metrics. Our work represents a scalable and practical step toward universal neural decoding. Code and model weights are available at: https://github.com/xmed-lab/ZEBRA.
