Social Comparison without Explicit Inference of Others' Reward Values: A Constructive Approach Using a Probabilistic Generative Model
Yosuke Taniuchi, Chie Hieida, Atsushi Noritake, Kazushi Ikeda, Masaki Isoda
TL;DR
This study investigates whether macaques infer a partner's subjective reward valuation during social comparison or rely solely on observed objective rewards. It compares three probabilistic generative models—Internal Prediction Model, No Comparison Model, and External Comparison Model—using a multimodal Latent Dirichlet Allocation framework within the SERKET infrastructure on a long-term dataset of two monkeys facing six stimulus conditions. The External Comparison Model, which uses partner reward observations without inferring subjective states, shows the strongest alignment with behavioral data (Rand Index ≈ 0.88) and superior predictive and clustering performance, suggesting social comparison is driven by objective reward differences rather than inferred partner valuations. The findings have implications for understanding social valuation in primates and guide future neural and human studies to probe the generality and neural underpinnings of external versus inferred social information in valuation.
Abstract
Social comparison$\unicode{x2014}$the process of evaluating one's rewards relative to others$\unicode{x2014}$plays a fundamental role in primate social cognition. However, it remains unknown from a computational perspective how information about others' rewards affects the evaluation of one's own reward. With a constructive approach, this study examines whether monkeys merely recognize objective reward differences or, instead, infer others' subjective reward valuations. We developed three computational models with varying degrees of social information processing: an Internal Prediction Model (IPM), which infers the partner's subjective values; a No Comparison Model (NCM), which disregards partner information; and an External Comparison Model (ECM), which directly incorporates the partner's objective rewards. To test model performance, we used a multi-layered, multimodal latent Dirichlet allocation. We trained the models on a dataset containing the behavior of a pair of monkeys, their rewards, and the conditioned stimuli. Then, we evaluated the models' ability to classify subjective values across pre-defined experimental conditions. The ECM achieved the highest classification score in the Rand Index (0.88 vs. 0.79 for the IPM) under our settings, suggesting that social comparison relies on objective reward differences rather than inferences about subjective states.
