Modified limited memory BFGS with displacement aggregation and its application to the largest eigenvalue problem

Manish Kumar Sahu; Suvendu Ranjan Pattanaik

Modified limited memory BFGS with displacement aggregation and its application to the largest eigenvalue problem

Manish Kumar Sahu, Suvendu Ranjan Pattanaik

TL;DR

This work introduces AggMBFGS, a modified limited-memory quasi-Newton method that uses displacement aggregation to refine curvature information in nonconvex optimization, achieving per-iteration complexity of $O(\tau d)$ and memory of $O(\tau d)$. It establishes global convergence under a backtracking Modified Armijo line search and proves local superlinear convergence, improving theoretical guarantees over standard M-LBFGS. Empirical tests on CUTEst problems show AggMBFGS reduces iterations and function evaluations compared to M-LBFGS, and it efficiently handles large-scale eigenvalue computations by minimizing $f(x)=\frac{\|x\|^4}{4}-\frac{x^T A x}{2}$ to estimate the largest eigenvalue with lower relative error than baselines. The results suggest AggMBFGS as a practical and scalable tool for large-scale nonconvex optimization and eigenvalue problems, while leaving open questions about worst-case complexity bounds.

Abstract

We present a modified limited memory BFGS method with displacement aggregation (AggMBFGS) for solving nonconvex optimization problems. AggMBFGS refines curvature pair updates by removing linearly dependent variable variations, ensuring that the inverse Hessian approximation retains essential curvature properties. As a result, its per iteration complexity and storage requirement is $\mathcal{O}(τd)$ where $τ\leq d$ represents the memory size and $d$ is the problem dimension. We establish the global convergence of both M-LBFGS and AggMBFGS under a backtracking modified Armijo line search (MALS) and prove the local superlinear convergence of AggMBFGS, demonstrating its theoretical advantages over M-LBFGS with the classical Armijo line search~\cite{Shi2016ALM}. Numerical experiments on CUTEst test problems~\cite{gould2015cutest} confirm that AggMBFGS outperforms M-LBFGS in reducing the number of iterations and function evaluations. Additionally, we apply AggMBFGS to compute the largest eigenvalue of high-dimensional real symmetric positive definite matrices, achieving lower relative errors than M-LBFGS~\cite{Shi2016ALM} while maintaining computational efficiency. These results suggest that AggMBFGS is a promising alternative for large-scale nonconvex optimization and eigenvalue computation.

Modified limited memory BFGS with displacement aggregation and its application to the largest eigenvalue problem

TL;DR

and memory of

. It establishes global convergence under a backtracking Modified Armijo line search and proves local superlinear convergence, improving theoretical guarantees over standard M-LBFGS. Empirical tests on CUTEst problems show AggMBFGS reduces iterations and function evaluations compared to M-LBFGS, and it efficiently handles large-scale eigenvalue computations by minimizing

to estimate the largest eigenvalue with lower relative error than baselines. The results suggest AggMBFGS as a practical and scalable tool for large-scale nonconvex optimization and eigenvalue problems, while leaving open questions about worst-case complexity bounds.

Abstract

where

represents the memory size and

is the problem dimension. We establish the global convergence of both M-LBFGS and AggMBFGS under a backtracking modified Armijo line search (MALS) and prove the local superlinear convergence of AggMBFGS, demonstrating its theoretical advantages over M-LBFGS with the classical Armijo line search~\cite{Shi2016ALM}. Numerical experiments on CUTEst test problems~\cite{gould2015cutest} confirm that AggMBFGS outperforms M-LBFGS in reducing the number of iterations and function evaluations. Additionally, we apply AggMBFGS to compute the largest eigenvalue of high-dimensional real symmetric positive definite matrices, achieving lower relative errors than M-LBFGS~\cite{Shi2016ALM} while maintaining computational efficiency. These results suggest that AggMBFGS is a promising alternative for large-scale nonconvex optimization and eigenvalue computation.

Paper Structure (12 sections, 6 theorems, 25 equations, 5 tables, 3 algorithms)

This paper contains 12 sections, 6 theorems, 25 equations, 5 tables, 3 algorithms.

Introduction
Preliminaries
Background on MBFGS and M-LBFGS
Modified Armijo line search (MALS)
M-LBFGS with displacement aggregation (AggMBFGS) and its convergence
Convergence analysis of AggMBFGS
Computational complexity of AggMBFGS
Comparison between AggMBFGS and M-LBFGS
Application of AggMBFGS in finding the largest Eigenvalue problem
Numerical experiment
Error analysis
Conclusion

Key Result

Theorem 2.1

( Auchmuty1989UnconstrainedVP,Theorem 12) Let $f(x)$ be defined as (420), a twice continuously differentiable, non-convex function, and A is a real symmetric positive definite matrix. Then

Theorems & Definitions (15)

Theorem 2.1
Remark 1
Remark 2
Proposition 3.1
proof
Remark 3
Proposition 3.2
proof
Theorem 3.3
proof
...and 5 more

Modified limited memory BFGS with displacement aggregation and its application to the largest eigenvalue problem

TL;DR

Abstract

Modified limited memory BFGS with displacement aggregation and its application to the largest eigenvalue problem

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (15)