LingGen: Scalable Multi-Attribute Linguistic Control via Power-Law Masking
Mohamed Elgaar, Hadi Amiri
TL;DR
LingGen addresses the challenge of fine-grained, real-valued linguistic control in controlled text generation by introducing a dedicated attribute encoder whose output is injected into the BOS embedding of a base LLM. A key contribution is P-MASKING, which samples per-example masking rates from a truncated Pareto distribution to enable robust control across variable attribute subsets from 1 to $k$ (up to $k=40$). Empirical results show LingGen achieves the lowest average attribute-control error (MSE) while preserving high fluency and inference efficiency, outperforming a wide set of baselines including prompting, fine-tuning, and decoding-time control methods. The work also analyzes attribute interactions, uncovering synergies and conflicts among linguistic attributes, and demonstrates LingGen’s robustness across different attribute counts and seeds. This approach enables scalable, controllable text generation with practical impact in accessibility, personalization, and education while highlighting ethical considerations for responsible deployment.
Abstract
We present LingGen, a controlled text generation model that allows fine-grained control over a large number of real-valued linguistic attributes. It encodes target attribute values with a dedicated linguistic attribute encoder and conditions the language model by injecting the resulting representation into the language model using the beginning-of-sequence (BOS) embeddings. To improve robustness when controlling different attribute subsets, we introduce P-MASKING, which samples per-example attribute masking rates from a truncated Pareto distribution during training. Across 1-40 control attributes, LingGen achieves the lowest average control error among evaluated methods, while remaining efficient at inference and receiving the highest fluency scores in human evaluation. Ablations show that Pareto-sampled masking and BOS-based injection are effective choices compared to alternative masking and integration variants.
