Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Ann Huang, Satpreet H. Singh, Flavio Martinelli, Kanaka Rajan
TL;DR
This work tackles the problem of solution degeneracy in independently trained task-trained RNNs by introducing a unified, multi-level framework that quantifies degeneracy across behavior, neural dynamics, and weights. Through a large-scale study across four neuroscience-relevant tasks and multiple control factors, the authors demonstrate contravariant and covariant relationships: higher task complexity and stronger feature learning tend to make neural dynamics more consistent while expanding weight diversity, whereas larger networks and structural regularization promote convergence across all levels. They validate the Contravariance Principle and provide practical guidance for tuning degeneracy to either reveal shared neural mechanisms or model individual variability observed in biology. The framework and findings offer a principled path toward more interpretable and biologically grounded RNN models, with implications for ensemble modeling and hypothesis testing in neuroscience.
Abstract
Task-trained recurrent neural networks (RNNs) are widely used in neuroscience and machine learning to model dynamical computations. To gain mechanistic insight into how neural systems solve tasks, prior work often reverse-engineers individual trained networks. However, different RNNs trained on the same task and achieving similar performance can exhibit strikingly different internal solutions, a phenomenon known as solution degeneracy. Here, we develop a unified framework to systematically quantify and control solution degeneracy across three levels: behavior, neural dynamics, and weight space. We apply this framework to 3,400 RNNs trained on four neuroscience-relevant tasks: flip-flop memory, sine wave generation, delayed discrimination, and path integration, while systematically varying task complexity, learning regime, network size, and regularization. We find that higher task complexity and stronger feature learning reduce degeneracy in neural dynamics but increase it in weight space, with mixed effects on behavior. In contrast, larger networks and structural regularization reduce degeneracy at all three levels. These findings empirically validate the Contravariance Principle and provide practical guidance for researchers seeking to tune the variability of RNN solutions, either to uncover shared neural mechanisms or to model the individual variability observed in biological systems. This work provides a principled framework for quantifying and controlling solution degeneracy in task-trained RNNs, offering new tools for building more interpretable and biologically grounded models of neural computation.
