Structured Voronoi Sampling
Afra Amini, Li Du, Ryan Cotterell
TL;DR
Structured Voronoi Sampling (SVS) provides a principled gradient-based framework for text generation by encoding discrete language-model distributions as densities over embeddings via $$(\mathcal{K}, \mu)$$-Voronoi measures and sampling with a refractive Hamiltonian Monte Carlo, addressing discontinuities with reflection/refraction. It extends to structured sequences and controlled generation through $p_V(\mathbf{V})$ and $p_V(\mathbf{V}|t)$, with a base-measure design that yields a tractable gradient $\nabla_{\mathbf{x}} \log p_V(\mathbf{x})$. The authors prove detailed balance for the SVS sampler and demonstrate empirical advantages on toy distributions, language-model sampling, and controlled-generation tasks over baselines like MuCoLa, fudge, and Langevin, particularly in distributional fidelity and adherence to control targets while maintaining fluency and diversity. The work advances a theoretically grounded MCMC approach for discrete text generation and lays groundwork for principled, controllable generation with potential practical impact in safer and more faithful LM applications.
Abstract
Gradient-based sampling algorithms have demonstrated their effectiveness in text generation, especially in the context of controlled text generation. However, there exists a lack of theoretically grounded and principled approaches for this task. In this paper, we take an important step toward building a principled approach for sampling from language models with gradient-based methods. We use discrete distributions given by language models to define densities and develop an algorithm based on Hamiltonian Monte Carlo to sample from them. We name our gradient-based technique Structured Voronoi Sampling (SVS). In an experimental setup where the reference distribution is known, we show that the empirical distribution of SVS samples is closer to the reference distribution compared to alternative sampling schemes. Furthermore, in a controlled generation task, SVS is able to generate fluent and diverse samples while following the control targets significantly better than other methods.
