Moreau-Yoshida Variational Transport: A General Framework For Solving Regularized Distributional Optimization Problems
Dai Hai Nguyen, Tetsuya Sakurai
TL;DR
This work tackles regularized distributional optimization by combining a variational representation with a non-smooth convex regularizer. It introduces Moreau-Yoshida Variational Transport (MYVT), which smooths the regularizer via a Moreau-Yoshida envelope and reformulates the objective as a concave-convex saddle point, solved with a primal-dual neural-transport approach. Theoretical results show convergence of the smoothed problem to the original and characterize gradient-flow dynamics under geodesic strong convexity; practically, the method parameterizes the target distribution with a neural transport map and optimizes a saddle-point objective. Empirically, MYVT achieves improved sparsity and smoothness in synthetic settings and superior FID/IS on noisy real-world image datasets compared to baselines like VT and GAN variants, demonstrating its effectiveness and regularization benefits for robust distributional modeling. The work advances scalable, regularized distributional optimization by blending proximal smoothing, variational representations, and neural transport, with broad implications for Bayesian inference, MCMC, and generative modeling.
Abstract
We consider a general optimization problem of minimizing a composite objective functional defined over a class of probability distributions. The objective is composed of two functionals: one is assumed to possess the variational representation and the other is expressed in terms of the expectation operator of a possibly nonsmooth convex regularizer function. Such a regularized distributional optimization problem widely appears in machine learning and statistics, such as proximal Monte-Carlo sampling, Bayesian inference and generative modeling, for regularized estimation and generation. We propose a novel method, dubbed as Moreau-Yoshida Variational Transport (MYVT), for solving the regularized distributional optimization problem. First, as the name suggests, our method employs the Moreau-Yoshida envelope for a smooth approximation of the nonsmooth function in the objective. Second, we reformulate the approximate problem as a concave-convex saddle point problem by leveraging the variational representation, and then develope an efficient primal-dual algorithm to approximate the saddle point. Furthermore, we provide theoretical analyses and report experimental results to demonstrate the effectiveness of the proposed method.
