Mini-batch descent in semiflows
Alberto Domínguez Corella, Martín Hernández
TL;DR
The paper develops a continuous mini-batch descent framework for gradient flows of convex lower semicontinuous functionals on Hilbert spaces, introducing a time-discretized, randomized batching mechanism with waiting time $\varepsilon$. It proves that the mini-batch trajectory $v_{\varepsilon}$ converges in expectation to the full gradient flow $u$ under a non-biased subgradient decomposition and a variance measure $\Lambda$, with explicit bounds and rates that improve under Lipschitz/Hölder assumptions. A randomized minimizing movement scheme is also analyzed, yielding convergence to the gradient flow and, under inf-compactness, convergence to minimizers of the full loss; the framework is extended to a variety of problems, including constrained optimization, sparse inversion, and random domain decomposition for parabolic obstacle problems, with several illustrative numerical examples. The results provide a versatile, variance-aware approach to trajectory-level optimization in infinite-dimensional settings, enabling efficient approximations of complex evolution equations and PDEs with practical gains in computation and robustness to stochastic subproblem selection.
Abstract
This paper investigates the application of mini-batch gradient descent to semiflows (gradient flows). Given a loss function (potential), we introduce a continuous version of mini-batch gradient descent by randomly selecting sub-loss functions over time, defining a piecewise flow. We prove that, under suitable assumptions on the potential generating the semiflow, the \textit{mini-batch descent flow} trajectory closely approximates the original semiflow trajectory on average. In addition, we study a randomized minimizing movement scheme that also approximates the semiflow of the full loss function. We illustrate the versatility of this approach across various problems, including constrained optimization, sparse inversion, and domain decomposition. Finally, we validate our results with several numerical examples.
