Brain-Model Evaluations Need the NeuroAI Turing Test
Jenelle Feather, Meenakshi Khosla, N. Apurva Ratan Murty, Aran Nayebi
TL;DR
The paper argues that evaluating brain-inspired AI requires more than behavioral indistinguishability and proposes the NeuroAI Turing Test, a benchmark that enforces both behavioral alignment and representational convergence to brain data within inter-subject variability. It formalizes the test by defining a dataset $\mathcal{D}$, constructing inter-organism and model-organism distance sets, and requiring convergence in distribution between these sets under a chosen similarity metric $\mathcal{M}$ with significance $\alpha$, while correcting for noise. The authors review the current state, outline alternate notions of brain-likeness, discuss trade-offs with interpretability and safety, address data and sampling limitations, and argue for the test's achievability through progressive Milestones and frontier datasets. They conclude that a rigorous, flexible benchmark centered on both behavior and internal representations can unify AI and neuroscience, driving the development of truly brain-like AI with wide-ranging scientific and practical impact.
Abstract
What makes an artificial system a good model of intelligence? The classical test proposed by Alan Turing focuses on behavior, requiring that an artificial agent's behavior be indistinguishable from that of a human. While behavioral similarity provides a strong starting point, two systems with very different internal representations can produce the same outputs. Thus, in modeling biological intelligence, the field of NeuroAI often aims to go beyond behavioral similarity and achieve representational convergence between a model's activations and the measured activity of a biological system. This position paper argues that the standard definition of the Turing Test is incomplete for NeuroAI, and proposes a stronger framework called the ``NeuroAI Turing Test'', a benchmark that extends beyond behavior alone and \emph{additionally} requires models to produce internal neural representations that are empirically indistinguishable from those of a brain up to measured individual variability, i.e. the differences between a computational model and the brain is no more than the difference between one brain and another brain. While the brain is not necessarily the ceiling of intelligence, it remains the only universally agreed-upon example, making it a natural reference point for evaluating computational models. By proposing this framework, we aim to shift the discourse from loosely defined notions of brain inspiration to a systematic and testable standard centered on both behavior and internal representations, providing a clear benchmark for neuroscientific modeling and AI development.
