Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation
Tianqi Zhong, Zhaoyi Li, Quan Wang, Linqi Song, Ying Wei, Defu Lian, Zhendong Mao
TL;DR
The paper tackles the challenge of compositional generalization in multi-aspect controllable text generation by introducing CompMCTG, a holistic benchmark built from Fyelp, Amazon, YELP, and Mixture with a three-dimensional Hold-Out/ACD/Few-Shot evaluation protocol. It shows that existing MCTG methods, including decoding-time, separate-training, and joint-training approaches, suffer noticeable generalization gaps when tested on novel attribute combinations, with LLMs offering fluent but less controllable outputs. To mitigate this, the authors propose Meta-MCTG, a meta-learning framework inspired by MAML that jointly trains with pseudo-compositional batches to encourage better generalization to unseen attribute recombinations. Across experiments on eight baselines and two LLMs, Meta-MCTG improves compositional testing performance in the majority of cases (up to 3.64% gain) while preserving fluency and often enhancing in-distribution performance, underscoring the benchmark’s value and the potential of meta-learning to enhance compositional robustness in MCTG.
Abstract
Compositional generalization, representing the model's ability to generate text with new attribute combinations obtained by recombining single attributes from the training data, is a crucial property for multi-aspect controllable text generation (MCTG) methods. Nonetheless, a comprehensive compositional generalization evaluation benchmark of MCTG is still lacking. We propose CompMCTG, a benchmark encompassing diverse multi-aspect labeled datasets and a crafted three-dimensional evaluation protocol, to holistically evaluate the compositional generalization of MCTG approaches. We observe that existing MCTG works generally confront a noticeable performance drop in compositional testing. To mitigate this issue, we introduce Meta-MCTG, a training framework incorporating meta-learning, where we enable models to learn how to generalize by simulating compositional generalization scenarios in the training phase. We demonstrate the effectiveness of Meta-MCTG through achieving obvious improvement (by at most 3.64%) for compositional testing performance in 94.4% cases.
