Generative AI-based closed-loop fMRI system
Mikihiro Kasahara, Taiki Oka, Vincent Taschereau-Dumouchel, Mitsuo Kawato, Hiroki Takakura, Aurelio Cortese
TL;DR
The paper addresses the risk that malicious generative AI could steer cognition or decision-making and the lack of neural or computational countermeasures. It introduces DecNefGAN, a closed-loop architecture that couples decoded neurofeedback with GAN-like generative AI, using MVPA decoding and CLIP-based personalization to map brain states to AI-generated stimuli. The work outlines the architecture, decoding/generation pipeline, potential issues, and ethical considerations, and discusses applications in security and psychiatry. By enabling the study of human resilience to AI manipulation, DecNefGAN aims to illuminate neural mechanisms of counteracting AI influence and to inform safeguards for cognitive safety and robust human-AI interaction.
Abstract
While generative AI is now widespread and useful in society, there are potential risks of misuse, e.g., unconsciously influencing cognitive processes or decision-making. Although this causes a security problem in the cognitive domain, there has been no research about neural and computational mechanisms counteracting the impact of malicious generative AI in humans. We propose DecNefGAN, a novel framework that combines a generative adversarial system and a neural reinforcement model. More specifically, DecNefGAN bridges human and generative AI in a closed-loop system, with the AI creating stimuli that induce specific mental states, thus exerting external control over neural activity. The objective of the human is the opposite, to compete and reach an orthogonal mental state. This framework can contribute to elucidating how the human brain responds to and counteracts the potential influence of generative AI.
