Making Your Dreams A Reality: Decoding the Dreams into a Coherent Video Story from fMRI Signals
Yanwei Fu, Jianxiong Gao, Baofeng Yang, Jianfeng Feng
TL;DR
This work tackles decoding visual content from sleep-time fMRI to produce coherent dream video narratives. It introduces a three-stage pipeline: Visual Stimulus Perception Reconstruction (wakeful image reconstruction from fMRI using cross-subject NSD pretraining and diffusion-based generation), Dream Visual Imagery Decoding (zero-shot decoding of dream content via shared real-dream brain patterns), and Dream Narrative Integration (LLM-driven synthesis of decoded frames into a cohesive dream story). The framework leverages the Natural Scenes Dataset for cross-subject generalization and evaluates both visual reconstructions and narrative quality using CLIP-based similarity and qualitative LLMS-generated scripts. The results demonstrate the feasibility of translating subjective dream experiences into interpretable visual narratives, with implications for dream research, neuroscience, and creative visualization.
Abstract
This paper studies the brave new idea for Multimedia community, and proposes a novel framework to convert dreams into coherent video narratives using fMRI data. Essentially, dreams have intrigued humanity for centuries, offering glimpses into our subconscious minds. Recent advancements in brain imaging, particularly functional magnetic resonance imaging (fMRI), have provided new ways to explore the neural basis of dreaming. By combining subjective dream experiences with objective neurophysiological data, we aim to understand the visual aspects of dreams and create complete video narratives. Our process involves three main steps: reconstructing visual perception, decoding dream imagery, and integrating dream stories. Using innovative techniques in fMRI analysis and language modeling, we seek to push the boundaries of dream research and gain deeper insights into visual experiences during sleep. This technical report introduces a novel approach to visually decoding dreams using fMRI signals and weaving dream visuals into narratives using language models. We gather a dataset of dreams along with descriptions to assess the effectiveness of our framework.
