Getting aligned on representational alignment

Ilia Sucholutsky; Lukas Muttenthaler; Adrian Weller; Andi Peng; Andreea Bobu; Been Kim; Bradley C. Love; Christopher J. Cueva; Erin Grant; Iris Groen; Jascha Achterberg; Joshua B. Tenenbaum; Katherine M. Collins; Katherine L. Hermann; Kerem Oktar; Klaus Greff; Martin N. Hebart; Nathan Cloos; Nikolaus Kriegeskorte; Nori Jacoby; Qiuyi Zhang; Raja Marjieh; Robert Geirhos; Sherol Chen; Simon Kornblith; Sunayana Rane; Talia Konkle; Thomas P. O'Connell; Thomas Unterthiner; Andrew K. Lampinen; Klaus-Robert Müller; Mariya Toneva; Thomas L. Griffiths

Getting aligned on representational alignment

Ilia Sucholutsky, Lukas Muttenthaler, Adrian Weller, Andi Peng, Andreea Bobu, Been Kim, Bradley C. Love, Christopher J. Cueva, Erin Grant, Iris Groen, Jascha Achterberg, Joshua B. Tenenbaum, Katherine M. Collins, Katherine L. Hermann, Kerem Oktar, Klaus Greff, Martin N. Hebart, Nathan Cloos, Nikolaus Kriegeskorte, Nori Jacoby, Qiuyi Zhang, Raja Marjieh, Robert Geirhos, Sherol Chen, Simon Kornblith, Sunayana Rane, Talia Konkle, Thomas P. O'Connell, Thomas Unterthiner, Andrew K. Lampinen, Klaus-Robert Müller, Mariya Toneva, Thomas L. Griffiths

TL;DR

The paper identifies representational alignment as a cross-disciplinary core concern in cognitive science, neuroscience, and machine learning, and argues that fragmented terminology hinders progress. It proposes a unifying framework that decomposes studies into data, systems, measurements, embeddings, and an alignment function, with three objectives: measure, bridge, and increase alignment. Through a broad survey, the authors illustrate how this framework maps across domains, including RSA/CKA-based measures, hyperalignment, and model-to-model or human-model mappings. They discuss open problems—from data selection and stimulus design to probing black-box systems and potential risks—aiming to catalyze cross-disciplinary collaboration and accelerate progress in building more aligned information processing systems.

Abstract

Biological and artificial information processing systems form representations of the world that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the similarity between the representations formed by these diverse systems? Do similarities in representations then translate into similar behavior? If so, then how can a system's representations be modified to better match those of another system? These questions pertaining to the study of representational alignment are at the heart of some of the most promising research areas in contemporary cognitive science, neuroscience, and machine learning. In this Perspective, we survey the exciting recent developments in representational alignment research in the fields of cognitive science, neuroscience, and machine learning. Despite their overlapping interests, there is limited knowledge transfer between these fields, so work in one field ends up duplicated in another, and useful innovations are not shared effectively. To improve communication, we propose a unifying framework that can serve as a common language for research on representational alignment, and map several streams of existing work across fields within our framework. We also lay out open problems in representational alignment where progress can benefit all three of these fields. We hope that this paper will catalyze cross-disciplinary collaboration and accelerate progress for all communities studying and developing information processing systems.

Getting aligned on representational alignment

TL;DR

Abstract

Paper Structure (53 sections, 6 equations, 2 figures, 2 tables)

This paper contains 53 sections, 6 equations, 2 figures, 2 tables.

Introduction
Background and review
Cognitive Science
Similarity judgments and multidimensional scaling
Human-machine alignment
Semantic representations
Alignment across individual participants' behavior
Alignment across cultures
Neuroscience
Alignment across heterogeneous measurements
Alignment across individuals
Alignment between brain activity and model systems
Alignment for hypothesis testing
Alignment for stimulus selection or design
Alignment as communication
...and 38 more sections

Figures (2)

Figure 1: Examples of contemporary representational alignment research in cognitive science, neuroscience, and machine learning. We discuss three types of representational alignment research. Measuring representational alignment aims to measure the degree of alignment between two systems as a dependent measure in an experiment (a. jacoby2017integer, b. kriegeskorte2008matching, c. pmlr-v97-kornblith19a). Bridging representational spaces aims to bring representations into a shared space to facilitate some downstream application (d. hebart2020revealing, e. o2018predicting, f. gupta2017aligned). Increasing representational alignment aims to update the internal representations or measurements of one system to increase its alignment with another system (g. muttenthaler2023improving, h. khosla2022high, i. tian2019contrastive). (Reproduced with permission from the cited papers.)
Figure 2: A general framework for conducting and describing representational alignment research. Most studies of representational alignment involve five components that researchers can control: data is presented via interfaces to the two systems. The systems form internal representations of the data and researchers take measurements of the systems and map them to some embedding space to try to infer the representations. An alignment function is then applied to those inferred representations to compute a single alignment score. These studies typically have one of three objectives: measuring the representational alignment between the two systems (i.e., the alignment score), bridging between two different representational spaces by finding a shared embedding space, or increasing the representational alignment between the two systems either by updating their internal representations (e.g., via learning) or how they are measured.

Getting aligned on representational alignment

TL;DR

Abstract

Getting aligned on representational alignment

Authors

TL;DR

Abstract

Table of Contents

Figures (2)