Constrained Sampling with Primal-Dual Langevin Monte Carlo
Luiz F. O. Chamon, Mohammad Reza Karimi, Anna Korba
TL;DR
This paper tackles sampling from a target distribution π while enforcing distribution-level constraints on moments and other statistics. It introduces PD-LMC, a discrete-time primal-dual Langevin Monte Carlo method operating in Wasserstein space to jointly optimize over the sample distribution μ and dual variables (λ,ν). Under (strong) convexity and log-Sobolev inequalities, the authors establish sublinear convergence of the algorithm in KL divergence and Wasserstein distance, respectively, and extend results to LSIs with a two-timescale DLMC variant. Through experiments on constrained Gaussian sampling, fairness in Bayesian inference, and counterfactual stock-market analysis, PD-LMC demonstrates effective constraint satisfaction with limited excursions outside feasible regions and provides interpretable dual variables that quantify constraint sensitivity and counterfactual impact.
Abstract
This work considers the problem of sampling from a probability distribution known up to a normalization constant while satisfying a set of statistical constraints specified by the expected values of general nonlinear functions. This problem finds applications in, e.g., Bayesian inference, where it can constrain moments to evaluate counterfactual scenarios or enforce desiderata such as prediction fairness. Methods developed to handle support constraints, such as those based on mirror maps, barriers, and penalties, are not suited for this task. This work therefore relies on gradient descent-ascent dynamics in Wasserstein space to put forward a discrete-time primal-dual Langevin Monte Carlo algorithm (PD-LMC) that simultaneously constrains the target distribution and samples from it. We analyze the convergence of PD-LMC under standard assumptions on the target distribution and constraints, namely (strong) convexity and log-Sobolev inequalities. To do so, we bring classical optimization arguments for saddle-point algorithms to the geometry of Wasserstein space. We illustrate the relevance and effectiveness of PD-LMC in several applications.
