Accelerating Multi-Block Constrained Optimization Through Learning to Optimize
Ling Liang, Cameron Austin, Haizhao Yang
TL;DR
This work addresses accelerating convergence of multi-block ADMM-type methods by integrating Learning to Optimize (L2O) into the Majorized Proximal Augmented Lagrangian Method (MPALM). It introduces a supervised-learning framework to adaptively select the penalty parameters $\{\sigma_j\}$, preserves the convergence guarantees of MPALM via a symmetric Gauss-Seidel-based proximal update, and demonstrates superior empirical performance on Lasso and discrete optimal transport compared with fixed-parameter MPALM and standard baselines. The approach leverages the majorization $q_\xi(y;y')$ and the SGS decomposition to enable efficient, block-separable updates, while the ERM training tunes hyperparameters to the distribution of problem instances. The resulting method provides a flexible, data-driven pathway to apply convergent MPALM to large-scale, linearly constrained composite problems, with potential impact across sparse regression, transport optimization, and beyond.
Abstract
Learning to Optimize (L2O) approaches, including algorithm unrolling, plug-and-play methods, and hyperparameter learning, have garnered significant attention and have been successfully applied to the Alternating Direction Method of Multipliers (ADMM) and its variants. However, the natural extension of L2O to multi-block ADMM-type methods remains largely unexplored. Such an extension is critical, as multi-block methods leverage the separable structure of optimization problems, offering substantial reductions in per-iteration complexity. Given that classical multi-block ADMM does not guarantee convergence, the Majorized Proximal Augmented Lagrangian Method (MPALM), which shares a similar form with multi-block ADMM and ensures convergence, is more suitable in this setting. Despite its theoretical advantages, MPALM's performance is highly sensitive to the choice of penalty parameters. To address this limitation, we propose a novel L2O approach that adaptively selects this hyperparameter using supervised learning. We demonstrate the versatility and effectiveness of our method by applying it to the Lasso problem and the optimal transport problem. Our numerical results show that the proposed framework outperforms popular alternatives. Given its applicability to generic linearly constrained composite optimization problems, this work opens the door to a wide range of potential real-world applications.
