Gradient-Based Constrained Sampling from Language Models
Sachin Kumar, Biswajit Paria, Yulia Tsvetkov
TL;DR
MuCoLa introduces a gradient-based, non-autoregressive constrained sampling framework for language models by performing Langevin Dynamics in embedding space. It seamlessly combines LM likelihood with differentiable constraints through a Lagrangian formulation, enabling soft and hard constraints without compromising fluency or task performance. Empirical results across toxicity avoidance, sentiment control, and keyword-guided generation demonstrate competitive constraint satisfaction, strong fluency, and preserved diversity, with scalable memory advantages over vocabulary-space approaches. The approach offers a flexible, plug-in method for controllable generation without fine-tuning the base LM.
Abstract
Large pretrained language models generate fluent text but are notoriously hard to controllably sample from. In this work, we study constrained sampling from such language models: generating text that satisfies user-defined constraints, while maintaining fluency and the model's performance in a downstream task. We propose MuCoLa -- a sampling procedure that combines the log-likelihood of the language model with arbitrary (differentiable) constraints in a single energy function, and then generates samples in a non-autoregressive manner. Specifically, it initializes the entire output sequence with noise and follows a Markov chain defined by Langevin Dynamics using the gradients of the energy function. We evaluate MuCoLa on text generation with soft and hard constraints as well as their combinations obtaining significant improvements over competitive baselines for toxicity avoidance, sentiment control, and keyword-guided generation.
