Table of Contents
Fetching ...

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

Jialu Du, Guiyang Hou, Yihui Fu, Chen Wu, Wenqi Zhang, Yongliang Shen, Weiming Lu

TL;DR

This work addresses the challenge of social reasoning in large language models, where cognitive confusion and conflation of objective events with agents’ beliefs hinder reliable judgment. It proposes an adaptive world-model-enhanced reasoning framework with a cognitive-intervention trigger that constructs a dynamic textual world model and intervenes when confusion signals appear, guiding the model back to coherent reasoning. The approach yields measurable improvements in accuracy and substantial token-efficiency gains across ToMi, Hi-ToM, and ExploreToM benchmarks, with larger models benefiting more from the interventions. By explicitly separating external world states from internal beliefs through stateful world models, the method offers a practical path to deploying LLMs in real-world social reasoning tasks.

Abstract

While large language models (LLMs) excel in mathematical and code reasoning, we observe they struggle with social reasoning tasks, exhibiting cognitive confusion, logical inconsistencies, and conflation between objective world states and subjective belief states. Through deteiled analysis of DeepSeek-R1's reasoning trajectories, we find that LLMs frequently encounter reasoning impasses and tend to output contradictory terms like "tricky" and "confused" when processing scenarios with multiple participants and timelines, leading to erroneous reasoning or infinite loops. The core issue is their inability to disentangle objective reality from agents' subjective beliefs. To address this, we propose an adaptive world model-enhanced reasoning mechanism that constructs a dynamic textual world model to track entity states and temporal sequences. It dynamically monitors reasoning trajectories for confusion indicators and promptly intervenes by providing clear world state descriptions, helping models navigate through cognitive dilemmas. The mechanism mimics how humans use implicit world models to distinguish between external events and internal beliefs. Evaluations on three social benchmarks demonstrate significant improvements in accuracy (e.g., +10% in Hi-ToM) while reducing computational costs (up to 33.8% token reduction), offering a simple yet effective solution for deploying LLMs in social contexts.

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

TL;DR

This work addresses the challenge of social reasoning in large language models, where cognitive confusion and conflation of objective events with agents’ beliefs hinder reliable judgment. It proposes an adaptive world-model-enhanced reasoning framework with a cognitive-intervention trigger that constructs a dynamic textual world model and intervenes when confusion signals appear, guiding the model back to coherent reasoning. The approach yields measurable improvements in accuracy and substantial token-efficiency gains across ToMi, Hi-ToM, and ExploreToM benchmarks, with larger models benefiting more from the interventions. By explicitly separating external world states from internal beliefs through stateful world models, the method offers a practical path to deploying LLMs in real-world social reasoning tasks.

Abstract

While large language models (LLMs) excel in mathematical and code reasoning, we observe they struggle with social reasoning tasks, exhibiting cognitive confusion, logical inconsistencies, and conflation between objective world states and subjective belief states. Through deteiled analysis of DeepSeek-R1's reasoning trajectories, we find that LLMs frequently encounter reasoning impasses and tend to output contradictory terms like "tricky" and "confused" when processing scenarios with multiple participants and timelines, leading to erroneous reasoning or infinite loops. The core issue is their inability to disentangle objective reality from agents' subjective beliefs. To address this, we propose an adaptive world model-enhanced reasoning mechanism that constructs a dynamic textual world model to track entity states and temporal sequences. It dynamically monitors reasoning trajectories for confusion indicators and promptly intervenes by providing clear world state descriptions, helping models navigate through cognitive dilemmas. The mechanism mimics how humans use implicit world models to distinguish between external events and internal beliefs. Evaluations on three social benchmarks demonstrate significant improvements in accuracy (e.g., +10% in Hi-ToM) while reducing computational costs (up to 33.8% token reduction), offering a simple yet effective solution for deploying LLMs in social contexts.

Paper Structure

This paper contains 21 sections, 11 figures, 5 tables.

Figures (11)

  • Figure 1: Our adaptive world model-enhanced reasoning mechanism. The system detects confusing words and adaptively use world model to intervene the reasoning trajectories, achieving significant improvements in both accuracy and token efficiency across three social reasoning benchmarks.
  • Figure 1: Keywords Representing Confusion States
  • Figure 2: Analysis of DeepSeek-R1's reasoning dilemmas in social reasoning tasks. (a): Overthinking examples with confusion words. (b) and (c): Statistics of confusion words and average token consumption from 100 correct cases and 100 wrong cases.
  • Figure 3: Semantic similarity distribution of candidate interruption words in DeepSeek-R1's reasoning trajectories.
  • Figure 4: Comparison of confusion degree and semantic similarity between candidates and traditional interruption words.
  • ...and 6 more figures