Make Optimization Once and for All with Fine-grained Guidance
Mingjia Shi, Ruihan Lin, Xuxi Chen, Yuhao Zhou, Zezhen Ding, Pingzhi Li, Tong Wang, Kai Wang, Zhangyang Wang, Jiheng Zhang, Tianlong Chen
TL;DR
Diff-L2O addresses the inefficiency and limited generalization of traditional L2O by modeling the solution space with diffusion processes and incorporating guidance from optimization meta-features. It presents a unified theoretical framework including ISHD ODE dynamics, SDEs, and diffusion-based sampling, plus a PAC-Bayesian style generalization bound linking sample diversity to performance. Empirically, Diff-L2O achieves rapid initialization and faster convergence on LASSO, Rastrigin, Ackley, and MNIST, with minute-level training and strong compatibility with conventional optimizers via a hybrid approach. The work highlights the practical impact of broad solution-space exploration for robust optimization across diverse tasks.
Abstract
Learning to Optimize (L2O) enhances optimization efficiency with integrated neural networks. L2O paradigms achieve great outcomes, e.g., refitting optimizer, generating unseen solutions iteratively or directly. However, conventional L2O methods require intricate design and rely on specific optimization processes, limiting scalability and generalization. Our analyses explore general framework for learning optimization, called Diff-L2O, focusing on augmenting sampled solutions from a wider view rather than local updates in real optimization process only. Meanwhile, we give the related generalization bound, showing that the sample diversity of Diff-L2O brings better performance. This bound can be simply applied to other fields, discussing diversity, mean-variance, and different tasks. Diff-L2O's strong compatibility is empirically verified with only minute-level training, comparing with other hour-levels.
