A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns
Asaf Yehudai, Taelin Karidi, Gabriel Stanovsky, Ariel Goldstein, Omri Abend
TL;DR
This study uses a cognitive-science-inspired cross-domain alignment task to probe whether large language models exhibit human-like conceptual representations. By prompting seven instruction-following LLMs with 75 cross-domain prompts derived from a human dataset and evaluating both mappings and explanations with population- and individual-level analyses, the authors show robust above-chance alignment with human responses and explanation patterns. The findings reveal that LLMs not only produce mappings resembling typical human responses but also generate explanations that distribute across similarity categories in ways similar to humans, suggesting shared underlying representations. The work provides a framework for assessing cognitive-like behavior in LLMs and has implications for cognitive modeling, interpretability, and evaluating alignment between AI systems and human conceptual reasoning.
Abstract
Cross-domain alignment refers to the task of mapping a concept from one domain to another. For example, ``If a \textit{doctor} were a \textit{color}, what color would it be?''. This seemingly peculiar task is designed to investigate how people represent concrete and abstract concepts through their mappings between categories and their reasoning processes over those mappings. In this paper, we adapt this task from cognitive science to evaluate the conceptualization and reasoning abilities of large language models (LLMs) through a behavioral study. We examine several LLMs by prompting them with a cross-domain mapping task and analyzing their responses at both the population and individual levels. Additionally, we assess the models' ability to reason about their predictions by analyzing and categorizing their explanations for these mappings. The results reveal several similarities between humans' and models' mappings and explanations, suggesting that models represent concepts similarly to humans. This similarity is evident not only in the model representation but also in their behavior. Furthermore, the models mostly provide valid explanations and deploy reasoning paths that are similar to those of humans.
