Distributed Consensus Optimization with Consensus ALADIN
Xu Du, Jingzhe Wang
TL;DR
This work addresses distributed consensus optimization (DC) by introducing Consensus ALADIN, which converts the original ALADIN coupled QP into a consensus QP with a global variable $z$ to coordinate across agents. Building on this, the authors develop two efficient variants: BFGS Consensus ALADIN, which uses a BFGS Hessian approximation to reduce communication and computation, and Reduced Consensus ALADIN, which further lowers cost by substituting a scaled identity for the Hessian and exploiting a closed-form $z$ update. The paper establishes global convergence for convex DC and local linear convergence for non-convex DC, and demonstrates effectiveness on a non-convex sensor allocation problem, where RCA outperforms Consensus ADMM and BFGS CAALADIN achieves the best performance. The results suggest that Consensus ALADIN offers scalable, communication-efficient, and theoretically grounded solutions for large-scale distributed optimization tasks in applications like sensor networks and federated settings.
Abstract
TThe paper proposes the Consensus Augmented Lagrange Alternating Direction Inexact Newton (Consensus ALADIN) algorithm, a novel approach for solving distributed consensus optimization problems (DC). Consensus ALADIN allows each agent to independently solve its own nonlinear programming problem while coordinating with other agents by solving a consensus quadratic programming (QP) problem. Building on this, we propose Broyden-Fletcher-Goldfarb-Shanno (BFGS) Consensus ALADIN, a communication-and-computation-efficient Consensus ALADIN.BFGS Consensus ALADIN improves communication efficiency through BFGS approximation techniques and enhances computational efficiency by deriving a closed form for the consensus QP problem. Additionally, by replacing the BFGS approximation with a scaled identity matrix, we develop Reduced Consensus ALADIN, a more computationally efficient variant. We establish the convergence theory for Consensus ALADIN and demonstrate its effectiveness through application to a non-convex sensor allocation problem.
