Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

Haoyang Zheng; Hengrong Du; Qi Feng; Wei Deng; Guang Lin

Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

Haoyang Zheng, Hengrong Du, Qi Feng, Wei Deng, Guang Lin

TL;DR

This work introduces Reflected Replica Exchange Langevin Diffusion (r2LD) to perform constrained non-convex exploration within a bounded domain, combining reflection with temperature-swapped Langevin dynamics. The authors prove rigorous convergence guarantees in $\chi^2$-divergence and $W_2$ distance, with rates that scale quadratically with the domain diameter via Poincaré and Log-Sobolev inequalities, and they quantify discretization error in $W_1$ for the practical algorithm. The practical instantiation, r2SGLD, uses mini-batch gradients, reflection, and a corrected swapping term to accelerate mixing across multimodal landscapes while avoiding boundary leakage. Empirically, r2SGLD improves Lorenz system identification under physical constraints, robustly captures all modes in constrained multimodal distributions, and enables larger learning rates with improved uncertainty estimates in CIFAR-100 classification. Overall, the paper demonstrates that constrained exploration, enabled by reflection and replica exchange, yields substantial efficiency gains and broader applicability to dynamical systems and deep learning tasks.

Abstract

Replica exchange stochastic gradient Langevin dynamics (reSGLD) is an effective sampler for non-convex learning in large-scale datasets. However, the simulation may encounter stagnation issues when the high-temperature chain delves too deeply into the distribution tails. To tackle this issue, we propose reflected reSGLD (r2SGLD): an algorithm tailored for constrained non-convex exploration by utilizing reflection steps within a bounded domain. Theoretically, we observe that reducing the diameter of the domain enhances mixing rates, exhibiting a $\textit{quadratic}$ behavior. Empirically, we test its performance through extensive experiments, including identifying dynamical systems with physical constraints, simulations of constrained multi-modal distributions, and image classification tasks. The theoretical and empirical findings highlight the crucial role of constrained exploration in improving the simulation efficiency.

Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

TL;DR

-divergence and

distance, with rates that scale quadratically with the domain diameter via Poincaré and Log-Sobolev inequalities, and they quantify discretization error in

for the practical algorithm. The practical instantiation, r2SGLD, uses mini-batch gradients, reflection, and a corrected swapping term to accelerate mixing across multimodal landscapes while avoiding boundary leakage. Empirically, r2SGLD improves Lorenz system identification under physical constraints, robustly captures all modes in constrained multimodal distributions, and enables larger learning rates with improved uncertainty estimates in CIFAR-100 classification. Overall, the paper demonstrates that constrained exploration, enabled by reflection and replica exchange, yields substantial efficiency gains and broader applicability to dynamical systems and deep learning tasks.

Abstract

behavior. Empirically, we test its performance through extensive experiments, including identifying dynamical systems with physical constraints, simulations of constrained multi-modal distributions, and image classification tasks. The theoretical and empirical findings highlight the crucial role of constrained exploration in improving the simulation efficiency.

Paper Structure (24 sections, 9 theorems, 39 equations, 13 figures, 1 table, 2 algorithms)

This paper contains 24 sections, 9 theorems, 39 equations, 13 figures, 1 table, 2 algorithms.

Introduction
Methodology
Reflected Replica Exchange Langevin Diffusion
The Infinitesimal Generator
The Proposed Algorithm
Theoretical Analysis
Convergence in $\chi^2$-Divergence
Convergence in 2-Wasserstein Distance
Discretization Analysis
Experiments
Identifying the Lorenz Systems
Constrained Multi-modal Simulations
Non-convex Optimization for Image Classifications
Conclusion and Discussion
Experimental Details
...and 9 more sections

Key Result

Lemma 3.2

$\{{\boldsymbol \beta}_t\}_{t\ge 0}$ is reversible and its invariant distribution $\pi$ is given by pt_density_main.

Figures (13)

Figure 1: Trajectory of r2SGLD.
Figure 2: Schematic of the r2SGLD algorithm, demonstrated on identifying the Lorenz system.
Figure 3: Simulation of the Lorenz system based on the empirical posterior modes of model parameters.
Figure 4: Posterior distributions of the identified model parameters.
Figure 5: Empirical behavior on multi-mode distributions with flower-shaped boundaries.
...and 8 more figures

Theorems & Definitions (13)

Remark 1.1
Remark 3.1
Lemma 3.2
Lemma 3.3
Theorem 3.4
Lemma 3.5
Remark 3.6
Lemma 3.7
Theorem 3.8
Lemma 3.9
...and 3 more

Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

TL;DR

Abstract

Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (13)