Langevin dynamics based algorithm e-TH$\varepsilon$O POULA for stochastic optimization problems with discontinuous stochastic gradient
Dong-Young Lim, Ariel Neufeld, Sotirios Sabanis, Ying Zhang
TL;DR
The paper tackles stochastic optimization with discontinuous stochastic gradients, a setting arising in quantile/CVaR problems, vector quantization, and ReLU-based neural nets. It introduces e-TH$\varepsilon$O POULA, a Langevin-dynamics–based algorithm that uses polygonal Euler approximations and a taming/boosting gradient scheme to achieve non-asymptotic convergence to the invariant measure $\pi_{\beta}\propto e^{-\beta u}$, along with explicit bounds for the Wasserstein distance and the expected excess risk. Theoretical contributions include assumptions under which $W_1$ and $W_2$ convergence rates are exponential in the discretization step and polynomial in moments, plus an explicit excess-risk bound with controllable $\beta$ and $\lambda$. Empirically, e-TH$\varepsilon$O POULA outperforms SGLD, TUSLA, ADAM, and AMSGrad in high-dimensional, discontinuous-gradient tasks, including multi-period portfolio optimization with transfer learning and nonlinear Gamma regression on real data, highlighting its practical impact for finance, insurance, and deep learning contexts where convergence guarantees are critical.
Abstract
We introduce a new Langevin dynamics based algorithm, called e-TH$\varepsilon$O POULA, to solve optimization problems with discontinuous stochastic gradients which naturally appear in real-world applications such as quantile estimation, vector quantization, CVaR minimization, and regularized optimization problems involving ReLU neural networks. We demonstrate both theoretically and numerically the applicability of the e-TH$\varepsilon$O POULA algorithm. More precisely, under the conditions that the stochastic gradient is locally Lipschitz in average and satisfies a certain convexity at infinity condition, we establish non-asymptotic error bounds for e-TH$\varepsilon$O POULA in Wasserstein distances and provide a non-asymptotic estimate for the expected excess risk, which can be controlled to be arbitrarily small. Three key applications in finance and insurance are provided, namely, multi-period portfolio optimization, transfer learning in multi-period portfolio optimization, and insurance claim prediction, which involve neural networks with (Leaky)-ReLU activation functions. Numerical experiments conducted using real-world datasets illustrate the superior empirical performance of e-TH$\varepsilon$O POULA compared to SGLD, TUSLA, ADAM, and AMSGrad in terms of model accuracy.
