Large Language Models have Chain-of-Affective
Junjie Xu, Xingjiao Wu, Luwei Xiao, Yuzhe Yang, Jie Zhou, Zihao Zhang, Luhan Wang, Yi Huang, Nan Wu, Yingbin Zheng, Chao Yan, Cheng Jin, Honglin Li, Liang He
TL;DR
This work probes whether contemporary LLMs implement a structured chain-of-affective by examining internal affective dynamics across eight families and their external consequences. Through two experimental modules, it maps baseline affect fingerprints, longitudinal reactions to negative input, affect-driven content selection, and downstream effects on task performance, human experience, and inter-agent dynamics. The findings reveal stable, family-specific affective profiles, a three-phase response to sustained negative input, and robust yet context-sensitive impacts on generation, user perception, and group behavior; collectively, affect emerges as an emergent control layer requiring explicit evaluation and governance in alignment and system design. The study introduces methodologies like 9S-State-Eval, KURC-Bench, and Affective-Enhanced Agent Reconstruction to quantify and reconstruct these dynamics, advocating for affect-aware safety and multi-agent architecture as a core focus for future work.
Abstract
Large language models (LLMs) are increasingly deployed as collaborative agents in emotionally charged settings, yet most evaluations treat them as purely cognitive systems and largely ignore their affective behaviour. Here we take a functional perspective and ask whether contemporary LLMs implement a structured chain-of-affective: organised affective dynamics that are family-specific, temporally coherent and behaviourally consequential. Across eight major LLM families (GPT, Gemini, Claude, Grok, Qwen, DeepSeek, GLM, Kimi), we combine two experimental modules. The first characterises inner chains-of-affective via baseline ''affective fingerprints'', 15-round sad-news exposure, and a 10-round news self-selection paradigm. We find stable, family-specific affective profiles, a reproducible three-phase trajectory under sustained negative input (accumulation, overload, defensive numbing), distinct defence styles, and human-like negativity biases that induce self-reinforcing affect-choice feedback loops. The second module probes outer consequences using a composite performance benchmark, human-AI dialogues on contentious topics, and multi-agent LLM interactions. We demonstrate that induced affect preserves core reasoning while reshaping high-freedom generation. Sentiment metrics predict user comfort and empathy but reveal trade-offs in resisting problematic views. In multi-agent settings, group structure drives affective contagion, role specialization (initiators, absorbers, firewalls), and bias. We characterize affect as an emergent control layer, advocating for 'chains-of-affect' as a primary target for evaluation and alignment.
