Table of Contents
Fetching ...

SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

Keyang Xuan, Pengda Wang, Chongrui Ye, Haofei Yu, Tal August, Jiaxuan You

TL;DR

SocialVeil addresses the gap in evaluating language models under imperfect communication by introducing a barrier-aware social learning environment with a literature-grounded taxonomy of semantic vagueness, sociocultural mismatch, and emotional interference. It presents barrier injection, an episode-based simulation, and a barrier-aware evaluation protocol featuring Unresolved Confusion and Mutual Understanding, validated against human judgments. Across 720 scenarios and four frontier LLMs, barriers consistently degrade social interaction, with human studies confirming barrier fidelity and metric alignment, while adaptation strategies yield only modest improvements. The work advances realistic evaluation of social intelligence in LLMs and motivates developing more robust, barrier-aware training and grounding methods for socially adept AI agents.

Abstract

Large language models (LLMs) are increasingly evaluated in interactive environments to test their social intelligence. However, existing benchmarks often assume idealized communication between agents, limiting our ability to diagnose whether LLMs can maintain and repair interactions in more realistic, imperfect settings. To close this gap, we present \textsc{SocialVeil}, a social learning environment that can simulate social interaction under cognitive-difference-induced communication barriers. Grounded in a systematic literature review of communication challenges in human interaction, \textsc{SocialVeil} introduces three representative types of such disruption, \emph{semantic vagueness}, \emph{sociocultural mismatch}, and \emph{emotional interference}. We also introduce two barrier-aware evaluation metrics, \emph{unresolved confusion} and \emph{mutual understanding}, to evaluate interaction quality under impaired communication. Experiments across 720 scenarios and four frontier LLMs show that barriers consistently impair performance, with mutual understanding reduced by over 45\% on average, and confusion elevated by nearly 50\%. Human evaluations validate the fidelity of these simulated barriers (ICC$\approx$0.78, Pearson r$\approx$0.80). We further demonstrate that adaptation strategies (Repair Instruction and Interactive learning) only have a modest effect far from barrier-free performance. This work takes a step toward bringing social interaction environments closer to real-world communication, opening opportunities for exploring the social intelligence of LLM agents.

SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

TL;DR

SocialVeil addresses the gap in evaluating language models under imperfect communication by introducing a barrier-aware social learning environment with a literature-grounded taxonomy of semantic vagueness, sociocultural mismatch, and emotional interference. It presents barrier injection, an episode-based simulation, and a barrier-aware evaluation protocol featuring Unresolved Confusion and Mutual Understanding, validated against human judgments. Across 720 scenarios and four frontier LLMs, barriers consistently degrade social interaction, with human studies confirming barrier fidelity and metric alignment, while adaptation strategies yield only modest improvements. The work advances realistic evaluation of social intelligence in LLMs and motivates developing more robust, barrier-aware training and grounding methods for socially adept AI agents.

Abstract

Large language models (LLMs) are increasingly evaluated in interactive environments to test their social intelligence. However, existing benchmarks often assume idealized communication between agents, limiting our ability to diagnose whether LLMs can maintain and repair interactions in more realistic, imperfect settings. To close this gap, we present \textsc{SocialVeil}, a social learning environment that can simulate social interaction under cognitive-difference-induced communication barriers. Grounded in a systematic literature review of communication challenges in human interaction, \textsc{SocialVeil} introduces three representative types of such disruption, \emph{semantic vagueness}, \emph{sociocultural mismatch}, and \emph{emotional interference}. We also introduce two barrier-aware evaluation metrics, \emph{unresolved confusion} and \emph{mutual understanding}, to evaluate interaction quality under impaired communication. Experiments across 720 scenarios and four frontier LLMs show that barriers consistently impair performance, with mutual understanding reduced by over 45\% on average, and confusion elevated by nearly 50\%. Human evaluations validate the fidelity of these simulated barriers (ICC0.78, Pearson r0.80). We further demonstrate that adaptation strategies (Repair Instruction and Interactive learning) only have a modest effect far from barrier-free performance. This work takes a step toward bringing social interaction environments closer to real-world communication, opening opportunities for exploring the social intelligence of LLM agents.
Paper Structure (46 sections, 4 equations, 7 figures, 7 tables)

This paper contains 46 sections, 4 equations, 7 figures, 7 tables.

Figures (7)

  • Figure 1: Existing social benchmarks typically assume idealized conditions. A comparison between existing benchmarks scenarios and real-world conversations, highlighting challenges in semantic vagueness, sociocultural mismatch and emotional interference.
  • Figure 2: Overview of SocialVeil. The pipeline consists of three stages: (Barrier Injection) scenarios are neutralized to remove bias and barriers are systematically injected; (Interaction Simulation) agents engage in multi-turn dialogue under barrier conditions; (Evaluation) agent performance is assessed using automatic metrics (goal achievement, knowledge seeking, believability, relationship quality, confusion, mutual understanding) and human evaluation (alignment analysis and barrier navigation success).
  • Figure 3: t-SNE visualization of the final-layer representations from Qwen2.5-7B-Instruct.
  • Figure 4: Linguistic signal and metrics correlations across four models. *$p$ < .05, **$p$ < .01, ***$p$ < .001.
  • Figure 5: Signatures of simulated communication barriers. Data are shown as percentage deviations relative to other types *$p$ < .05.
  • ...and 2 more figures