Dynamical similarity analysis can identify compositional dynamics developing in RNNs

Quentin Guilhot; Michał Wójcik; Jascha Achterberg; Rui Ponte Costa

Dynamical similarity analysis can identify compositional dynamics developing in RNNs

Quentin Guilhot, Michał Wójcik, Jascha Achterberg, Rui Ponte Costa

TL;DR

This work tackles the challenge of benchmarking dynamic representational metrics by introducing two principled test cases based on attractor dynamics and compositional learning in RNNs. It compares three metrics—CKA, Procrustes, and Dynamic Similarity Analysis (DSA)—and demonstrates that DSA is more noise-robust and capable of linking evolving representations to computations, including in state-space models like Mamba where dynamics resemble reservoir behavior. The findings argue for a benchmark-driven approach to metric development, showing that DSA uniquely captures compositional dynamical motifs and their computational relevance, with implications for mechanistic interpretability and neuroscience-inspired analysis. Overall, the work provides a framework and initial evidence that dynamic metrics can steadily improve our understanding of how recurrent systems develop and deploy computations.

Abstract

Methods for analyzing representations in neural systems have become a popular tool in both neuroscience and mechanistic interpretability. Having measures to compare how similar activations of neurons are across conditions, architectures, and species, gives us a scalable way of learning how information is transformed within different neural networks. In contrast to this trend, recent investigations have revealed how some metrics can respond to spurious signals and hence give misleading results. To identify the most reliable metric and understand how measures could be improved, it is going to be important to identify specific test cases which can serve as benchmarks. Here we propose that the phenomena of compositional learning in recurrent neural networks (RNNs) allows us to build a test case for dynamical representation alignment metrics. By implementing this case, we show it enables us to test whether metrics can identify representations which gradually develop throughout learning and probe whether representations identified by metrics are relevant to computations executed by networks. By building both an attractor- and RNN-based test case, we show that the new Dynamical Similarity Analysis (DSA) is more noise robust and identifies behaviorally relevant representations more reliably than prior metrics (Procrustes, CKA). We also show how test cases can be used beyond evaluating metrics to study new architectures. Specifically, results from applying DSA to modern (Mamba) state space models, suggest that, in contrast to RNNs, these models may not exhibit changes to their recurrent dynamics due to their expressiveness. Overall, by developing test cases, we show DSA's exceptional ability to detect compositional dynamical motifs, thereby enhancing our understanding of how computations unfold in RNNs.

Dynamical similarity analysis can identify compositional dynamics developing in RNNs

TL;DR

Abstract

Dynamical similarity analysis can identify compositional dynamics developing in RNNs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)