Probing the contents of semantic representations from text, behavior, and brain data using the psychNorms metabase
Zak Hussain, Rui Mata, Ben R. Newell, Dirk U. Wulff
TL;DR
The paper investigates whether semantic representations from text, behavior, and brain data encode overlapping or distinct information, using representational similarity analysis (RSA) and a novel representational content analysis (RCA). It introduces the psychNorms metabase of 292 human-rated norms to interpret representation content and demonstrates that behavior-based representations can rival or exceed text in capturing psychological dimensions, while also contributing unique variance in affective, agentic, and socio-moral domains. A controlled vocabulary approach with a base set $V_ ext{base}$ enables cross-type RSA, and ensemble RCA shows combining text and behavior often improves norm explainability, suggesting that behavior complements text for human-aligned semantic representations. The findings have implications for evaluating and aligning large language models and for enriching cognitive and affective modeling with behavior-derived semantic information.
Abstract
Semantic representations are integral to natural language processing, psycholinguistics, and artificial intelligence. Although often derived from internet text, recent years have seen a rise in the popularity of behavior-based (e.g., free associations) and brain-based (e.g., fMRI) representations, which promise improvements in our ability to measure and model human representations. We carry out the first systematic evaluation of the similarities and differences between semantic representations derived from text, behavior, and brain data. Using representational similarity analysis, we show that word vectors derived from behavior and brain data encode information that differs from their text-derived cousins. Furthermore, drawing on our psychNorms metabase, alongside an interpretability method that we call representational content analysis, we find that, in particular, behavior representations capture unique variance on certain affective, agentic, and socio-moral dimensions. We thus establish behavior as an important complement to text for capturing human representations and behavior. These results are broadly relevant to research aimed at learning human-aligned semantic representations, including work on evaluating and aligning large language models.
