How Deep is Love in LLMs' Hearts? Exploring Semantic Size in Human-like Cognition

Yao Yao; Yifei Yang; Xinbei Ma; Dongjie Yang; Zhuosheng Zhang; Zuchao Li; Hai Zhao

How Deep is Love in LLMs' Hearts? Exploring Semantic Size in Human-like Cognition

Yao Yao, Yifei Yang, Xinbei Ma, Dongjie Yang, Zhuosheng Zhang, Zuchao Li, Hai Zhao

TL;DR

The paper investigates semantic size as a window into human-like cognition in large language models (LLMs) across three strands: external metaphor-based understanding, internal representation probing, and real-world attention-bias in a web-shopping scenario. It systematically builds datasets using Glasgow Norms, trains and evaluates both humans and multiple multimodal LLMs, and applies linear probes to hidden representations. Key findings show that multimodal training yields sharper alignment with human semantic-size reasoning, improves internal encoding of size, and reveals biases toward semantically large, attention-grabbing content, with implications for AI safety and cognitive science. Overall, the work argues that grounding through multiple modalities is crucial for approaching human-like cognition in LLMs and offers insights into how embodied experiences shape conceptual understanding.

Abstract

How human cognitive abilities are formed has long captivated researchers. However, a significant challenge lies in developing meaningful methods to measure these complex processes. With the advent of large language models (LLMs), which now rival human capabilities in various domains, we are presented with a unique testbed to investigate human cognition through a new lens. Among the many facets of cognition, one particularly crucial aspect is the concept of semantic size, the perceived magnitude of both abstract and concrete words or concepts. This study seeks to investigate whether LLMs exhibit similar tendencies in understanding semantic size, thereby providing insights into the underlying mechanisms of human cognition. We begin by exploring metaphorical reasoning, comparing how LLMs and humans associate abstract words with concrete objects of varying sizes. Next, we examine LLMs' internal representations to evaluate their alignment with human cognitive processes. Our findings reveal that multi-modal training is crucial for LLMs to achieve more human-like understanding, suggesting that real-world, multi-modal experiences are similarly vital for human cognitive development. Lastly, we examine whether LLMs are influenced by attention-grabbing headlines with larger semantic sizes in a real-world web shopping scenario. The results show that multi-modal LLMs are more emotionally engaged in decision-making, but this also introduces potential biases, such as the risk of manipulation through clickbait headlines. Ultimately, this study offers a novel perspective on how LLMs interpret and internalize language, from the smallest concrete objects to the most profound abstract concepts like love. The insights gained not only improve our understanding of LLMs but also provide new avenues for exploring the cognitive abilities that define human intelligence.

How Deep is Love in LLMs' Hearts? Exploring Semantic Size in Human-like Cognition

TL;DR

Abstract

How Deep is Love in LLMs' Hearts? Exploring Semantic Size in Human-like Cognition

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)