Positive Text Reframing under Multi-strategy Optimization
Shutong Jia, Biwei Cao, Qingqing Gao, Jiuxin Cao, Bo Liu
TL;DR
The paper tackles positive text reframing, aiming to preserve original meaning while adding a complementary positive perspective. It introduces the Multi-Strategy Optimization Framework (MSOF), which combines reinforcement-based training with a positive sentiment and content-preservation rewards, decoding improvements, and a multi-dimensional re-ranking scheme to enforce strategy-consistency, content fidelity, and fluency. Extensive experiments on BART and T5 show significant gains in both unconstrained and controlled reframing tasks, across automatic metrics and human judgments, with the Top-k decoding variant often delivering the best overall performance. The work provides a practical, modular approach to controllable text reframing and introduces the RTQE metric to quantify reframing quality, contributing a robust toolchain for future research and applications in psychology-informed text generation.
Abstract
Differing from sentiment transfer, positive reframing seeks to substitute negative perspectives with positive expressions while preserving the original meaning. With the emergence of pre-trained language models (PLMs), it is possible to achieve acceptable results by fine-tuning PLMs. Nevertheless, generating fluent, diverse and task-constrained reframing text remains a significant challenge. To tackle this issue, a \textbf{m}ulti-\textbf{s}trategy \textbf{o}ptimization \textbf{f}ramework (MSOF) is proposed in this paper. Starting from the objective of positive reframing, we first design positive sentiment reward and content preservation reward to encourage the model to transform the negative expressions of the original text while ensuring the integrity and consistency of the semantics. Then, different decoding optimization approaches are introduced to improve the quality of text generation. Finally, based on the modeling formula of positive reframing, we propose a multi-dimensional re-ranking method that further selects candidate sentences from three dimensions: strategy consistency, text similarity and fluency. Extensive experiments on two Seq2Seq PLMs, BART and T5, demonstrate our framework achieves significant improvements on unconstrained and controlled positive reframing tasks.
