Quantifying task-relevant representational similarity using decision variable correlation
Yu, Qian, Wilson S. Geisler, Xue-Xin Wei
TL;DR
The paper introduces decision variable correlation (DVC) as a task-relevant metric to compare neural representations in brains and deep networks, distinguishing task-driven similarity from global representational overlap. By decoding decision variables with linear readouts and correcting for noise, DVC quantifies image-by-image consistency of two observers' decision strategies. Across monkey V4/IT and various networks, brain–brain similarity rivals model–model similarity, while model–brain similarity declines as ImageNet accuracy rises, and adversarial or data-rich training fails to improve brain alignment. The findings imply a divergence between primate ventral pathway representations and those learned by classification-trained models, urging task-centric methods and training paradigms that better capture brain-like, task-relevant representations. The work also shows DVC can complement Cohen's Kappa and RSA by isolating task-relevant decision dimensions and reducing biases inherent in decoders.
Abstract
Previous studies have compared neural activities in the visual cortex to representations in deep neural networks trained on image classification. Interestingly, while some suggest that their representations are highly similar, others argued the opposite. Here, we propose a new approach to characterize the similarity of the decision strategies of two observers (models or brains) using decision variable correlation (DVC). DVC quantifies the image-by-image correlation between the decoded decisions based on the internal neural representations in a classification task. Thus, it can capture task-relevant information rather than general representational alignment. We evaluate DVC using monkey V4/IT recordings and network models trained on image classification tasks. We find that model-model similarity is comparable to monkey-monkey similarity, whereas model-monkey similarity is consistently lower. Strikingly, DVC decreases with increasing network performance on ImageNet-1k. Adversarial training does not improve model-monkey similarity in task-relevant dimensions assessed using DVC, although it markedly increases the model-model similarity. Similarly, pre-training on larger datasets does not improve model-monkey similarity. These results suggest a divergence between the task-relevant representations in monkey V4/IT and those learned by models trained on image classification tasks.
