Robust SGLD algorithm for solving non-convex distributionally robust optimisation problems
Ariel Neufeld, Matthew Ng Cheng En, Ying Zhang
TL;DR
This work develops a robust Stochastic Gradient Langevin Dynamics (SGLD) method to solve a class of non-convex distributionally robust optimization problems. By applying Wasserstein-type duality, finite-grid discretisation, and Nesterov smoothing, the authors obtain a differentiable objective suitable for SGLD and establish non-asymptotic excess-risk bounds with explicit constants. The framework is demonstrated on a regression task with adversarial perturbations, showing both theoretical convergence guarantees and empirical improvements over vanilla SGLD in test accuracy. The approach highlights the practical value of incorporating model uncertainty in data-driven stochastic optimization and provides implementable methodology and code for robust learning under perturbations.
Abstract
In this paper we develop a Stochastic Gradient Langevin Dynamics (SGLD) algorithm tailored for solving a certain class of non-convex distributionally robust optimisation (DRO) problems. By deriving non-asymptotic convergence bounds, we build an algorithm which for any prescribed accuracy $\varepsilon>0$ outputs an estimator whose expected excess risk is at most $\varepsilon$. As a concrete application, we consider the problem of identifying the best non-linear estimator of a given regression model involving a neural network using adversarially corrupted samples. We formulate this problem as a DRO problem and demonstrate both theoretically and numerically the applicability of the proposed robust SGLD algorithm. Moreover, numerical experiments show that the robust SGLD estimator outperforms the estimator obtained using vanilla SGLD in terms of test accuracy, which highlights the advantage of incorporating model uncertainty when optimising with perturbed samples.
