C$^3$TG: Conflict-aware, Composite, and Collaborative Controlled Text Generation
Yu Li, Zhe Yang, Yi Huang, Xin Liu, Guilin Qi
TL;DR
C3TG tackles multi-dimensional controlled text generation where attributes can conflict by coupling an LLM with lightweight attribute classifiers and a two-phase pipeline. The generation phase fuses attribute priors via a weighted KL divergence, yielding $P^{*}(x|x_{1:t-1})$ from a weighted geometric mean of priors, while the optimization phase uses an energy function $E(x)$ that combines classifier alignment $\sum_i \alpha_i |C_{A_i}(x)-T_i|$ and stability penalties $\Omega_{overlap}(x)$ to iteratively refine text through an agent-guided chain of prompts with a convergence threshold $\tau=0.025$. The framework handles 17 sub-dimensions with classifier feedback and a three-stage rewrite loop (core calibration, balancing, global fine-tuning), enabling conflict resolution and preserving fluency and diversity. Experiments on ROCStories and WritingPrompts show C3TG achieves higher attribute accuracy, reduced toxicity, and better human judgments than decoding-based and indirect CTG baselines, validating its flexible, scalable approach to multi-attribute control without extensive model modification.
Abstract
Recent advancements in large language models (LLMs) have demonstrated remarkable text generation capabilities. However, controlling specific attributes of generated text remains challenging without architectural modifications or extensive fine-tuning. Current methods typically toggle a single, basic attribute but struggle with precise multi-attribute control. In scenarios where attribute requirements conflict, existing methods lack coordination mechanisms, causing interference between desired attributes. Furthermore, these methods fail to incorporate iterative optimization processes in the controlled generation pipeline. To address these limitations, we propose Conflict-aware, Composite, and Collaborative Controlled Text Generation (C$^3$TG), a two-phase framework for fine-grained, multi-dimensional text attribute control. During generation, C$^3$TG selectively pairs the LLM with the required attribute classifiers from the 17 available dimensions and employs weighted KL-divergence to adjust token probabilities. The optimization phase then leverages an energy function combining classifier scores and penalty terms to resolve attribute conflicts through iterative feedback, enabling precise control over multiple dimensions simultaneously while preserving natural text flow. Experiments show that C$^3$TG significantly outperforms baselines across multiple metrics including attribute accuracy, linguistic fluency, and output diversity, while simultaneously reducing toxicity. These results establish C$^3$TG as an effective and flexible solution for multi-dimensional text attribute control that requires no costly model modifications.
