Social-R1: Towards Human-like Social Reasoning in LLMs

Jincenzi Wu; Yuxuan Lei; Jianxun Lian; Yitian Huang; Lexin Zhou; Haotian Li; Xing Xie; Helen Meng

Social-R1: Towards Human-like Social Reasoning in LLMs

Jincenzi Wu, Yuxuan Lei, Jianxun Lian, Yitian Huang, Lexin Zhou, Haotian Li, Xing Xie, Helen Meng

TL;DR

This work introduces ToMBench-Hard, an adversarial benchmark designed to provide hard training examples for social reasoning and proposes Social-R1, a reinforcement learning framework that aligns model reasoning with human cognition through multi-dimensional rewards.

Abstract

While large language models demonstrate remarkable capabilities across numerous domains, social intelligence - the capacity to perceive social cues, infer mental states, and generate appropriate responses - remains a critical challenge, particularly for enabling effective human-AI collaboration and developing AI that truly serves human needs. Current models often rely on superficial patterns rather than genuine social reasoning. We argue that cultivating human-like social intelligence requires training with challenging cases that resist shortcut solutions. To this end, we introduce ToMBench-Hard, an adversarial benchmark designed to provide hard training examples for social reasoning. Building on this, we propose Social-R1, a reinforcement learning framework that aligns model reasoning with human cognition through multi-dimensional rewards. Unlike outcome-based RL, Social-R1 supervises the entire reasoning process, enforcing structural alignment, logical integrity, and information density. Results show that our approach enables a 4B parameter model to surpass much larger counterparts and generalize robustly across eight diverse benchmarks. These findings demonstrate that challenging training cases with trajectory-level alignment offer a path toward efficient and reliable social intelligence.

Social-R1: Towards Human-like Social Reasoning in LLMs

TL;DR

Abstract

Paper Structure (43 sections, 4 equations, 12 figures, 18 tables)

This paper contains 43 sections, 4 equations, 12 figures, 18 tables.

Introduction
Related Work
Theory-of-Mind Analysis in LLMs
RL for LLM General Reasoning
The ToMBench-Hard Benchmark
Adversarial Data Construction
Diagnostic Analysis: Exposing the Shortcut Illusion
The Social-R1 Framework
Multi-Dimensional Reward Design
Verifiable Format Alignment ($R_{{fmt}}$)
Reward Synthesis and Learning
Experiment
Experiment Setting
SIP Content Integrity Reward Model ($RM_{content}$).
Implementation Details
...and 28 more sections

Figures (12)

Figure 1: Social-R1 for Human-like and Efficient Social Reasoning. By integrating SIP-guided rewards into reinforcement learning, Social-R1 mitigates reasoning shortcuts and enforces structured human-like social inference, improving both accuracy and efficiency across model scales. Detailed cases are in Appendix \ref{['app:0mainfigure']}.
Figure 2: Option-Mention Density across SIP reasoning stages.
Figure 3: Stage-wise SIP accuracy across models.
Figure 4: Case study highlighting the Interpretation Bottleneck. Detailed cases are provided in Appendix \ref{['app:5optionmatching']}.
Figure 5: Robustness study under story-consistent distractors.
...and 7 more figures

Social-R1: Towards Human-like Social Reasoning in LLMs

TL;DR

Abstract

Social-R1: Towards Human-like Social Reasoning in LLMs

Authors

TL;DR

Abstract

Table of Contents

Figures (12)