Multiple Scale Methods For Optimization Of Discretized Continuous Functions
Nicholas J. E. Richardson, Noah Marusenko, Michael P. Friedlander
TL;DR
This work addresses optimization over spaces of Lipschitz functions by introducing a multiscale framework that solves discretized problems at progressively finer grids and uses interpolation to warm-start subsequent scales. Two variants, greedy and lazy, provide convergence guarantees and extend to any base algorithm with fixed-rate iterate convergence, while incorporating constraint scaling to preserve feasibility across scales. Theoretical results bound interpolation errors and relate discrete solutions to the continuous problem, showing that multiscale optimization can outperform single-scale projected gradient descent in terms of both convergence speed and computational cost. Empirical results on density demixing tasks—including synthetic and real geological data—demonstrate substantial speedups and memory savings, highlighting practical impact for large-scale, discretized continuous optimization problems.
Abstract
A multiscale optimization framework for problems over a space of Lipschitz continuous functions is developed. The method solves a coarse-grid discretization followed by linear interpolation to warm-start project gradient descent on progressively finer grids. Greedy and lazy variants are analyzed and convergence guarantees are derived that show the multiscale approach achieves provably tighter error bounds at lower computational cost than single-scale optimization. The analysis extends to any base algorithm with iterate convergence at a fixed rate. Constraint modification techniques preserve feasibility across scales. Numerical experiments on probability density estimation problems, including geological data, demonstrate speedups of an order of magnitude or better.
