Affective Flow Language Model for Emotional Support Conversation
Chenghui Zou, Ning Wang, Tiesunlong Shen, Luwei Xiao, Chuan Ma, Xiangpeng Li, Rui Mao, Erik Cambria
TL;DR
This work tackles the problem of long-horizon emotional support conversation where traditional alignment signals are sparse and poorly credit intermediate strategy decisions. It introduces AFlow, a framework that injects dense prefix-level supervision through affective flow signals derived from search-distilled trajectories and a Generative Flow Network perspective. AFlow couples AFPO, a subpath flow-balance objective, with a policy and value heads so that downstream value propagates coherently across dialogue prefixes, enabling stable and diverse strategy progression. Empirical results on two ESC datasets show consistent improvements in strategy alignment and response diversity, with robust performance across open-source backbones and even competitive standing against strong proprietary LLMs, underscoring the practical value of flow-shaped process supervision for long-horizon emotional support.
Abstract
Large language models (LLMs) have been widely applied to emotional support conversation (ESC). However, complex multi-turn support remains challenging.This is because existing alignment schemes rely on sparse outcome-level signals, thus offering limited supervision for intermediate strategy decisions. To fill this gap, this paper proposes affective flow language model for emotional support conversation (AFlow), a framework that introduces fine-grained supervision on dialogue prefixes by modeling a continuous affective flow along multi-turn trajectories. AFlow can estimate intermediate utility over searched trajectories and learn preference-consistent strategy transitions. To improve strategy coherence and empathetic response quality, a subpath-level flow-balance objective is presented to propagate preference signals to intermediate states. Experiment results show consistent and significant improvements over competitive baselines in diverse emotional contexts. Remarkably, AFlow with a compact open-source backbone outperforms proprietary LMMs such as GPT-4o and Claude-3.5 on major ESC metrics. Our code is available at https://github.com/chzou25-lgtm/AffectiveFlow.
