Setting up for failure: automatic discovery of the neural mechanisms of cognitive errors
Puria Radmard, Paul M. Bays, Máté Lengyel
TL;DR
The study tackles the challenge of uncovering neural mechanisms by automatically training biologically plausible RNNs to reproduce full, multimodal behavioural distributions, including suboptimalities like swap errors. It combines synthetic data generation via a Bayesian non-parametric model with diffusion-model-inspired training to produce realistic response distributions, and applies this to a visual working memory task. The results show the trained networks exhibit neural dynamics and representational geometry that resemble macaque prefrontal cortex activity and provide testable predictions about swap-error mechanisms. This approach demonstrates a principled route to automatic mechanism discovery by aligning neural dynamics with rich behavioural statistics rather than optimizing for task performance alone.
Abstract
Discovering the neural mechanisms underpinning cognition is one of the grand challenges of neuroscience. However, previous approaches for building models of RNN dynamics that explain behaviour required iterative refinement of architectures and/or optimisation objectives, resulting in a piecemeal, and mostly heuristic, human-in-the-loop process. Here, we offer an alternative approach that automates the discovery of viable RNN mechanisms by explicitly training RNNs to reproduce behaviour, including the same characteristic errors and suboptimalities, that humans and animals produce in a cognitive task. Achieving this required two main innovations. First, as the amount of behavioural data that can be collected in experiments is often too limited to train RNNs, we use a non-parametric generative model of behavioural responses to produce surrogate data for training RNNs. Second, to capture all relevant statistical aspects of the data, we developed a novel diffusion model-based approach for training RNNs. To showcase the potential of our approach, we chose a visual working memory task as our test-bed, as behaviour in this task is well known to produce response distributions that are patently multimodal (due to swap errors). The resulting network dynamics correctly qualitative features of macaque neural data. Importantly, these results were not possible to obtain with more traditional approaches, i.e., when only a limited set of behavioural signatures (rather than the full richness of behavioural response distributions) were fitted, or when RNNs were trained for task optimality (instead of reproducing behaviour). Our approach also yields novel predictions about the mechanism of swap errors, which can be readily tested in experiments. These results suggest that fitting RNNs to rich patterns of behaviour provides a powerful way to automatically discover mechanisms of important cognitive functions.
