Can Large Language Models Predict Associations Among Human Attitudes?
Ana Ma, Derek Powell
TL;DR
This work investigates whether frontier LLMs can reason about the latent interrelations among human attitudes beyond surface-text similarity. Using 64 Pew-derived attitude items and a sample of 376 U.S. adults, the authors assess GPT-4o’s ability to (i) reproduce the pairwise attitude correlations and (ii) predict individual responses from other attitudes across semantically similar and dissimilar prompts. GPT-4o closely mirrors human attitude correlations (about $r=0.77$, $p<0.001$) and can predict individual responses, with prediction accuracy sometimes benefiting from semantically dissimilar prompts, demonstrating genuine social inferences beyond surface similarity. The results imply that LLMs encode latent belief-system structure and can perform cross-attitudinal reasoning, while also highlighting safety concerns around personalization, manipulation, and echo chambers in AI-assisted social inference.
Abstract
Prior work has shown that large language models (LLMs) can predict human attitudes based on other attitudes, but this work has largely focused on predictions from highly similar and interrelated attitudes. In contrast, human attitudes are often strongly associated even across disparate and dissimilar topics. Using a novel dataset of human responses toward diverse attitude statements, we found that a frontier language model (GPT-4o) was able to recreate the pairwise correlations among individual attitudes and to predict individuals' attitudes from one another. Crucially, in an advance over prior work, we tested GPT-4o's ability to predict in the absence of surface-similarity between attitudes, finding that while surface similarity improves prediction accuracy, the model was still highly-capable of generating meaningful social inferences between dissimilar attitudes. Altogether, our findings indicate that LLMs capture crucial aspects of the deeper, latent structure of human belief systems.
