Table of Contents
Fetching ...

Optimal Preconditioning is a Geodesically Convex Optimization Problem

M. Levent Doğan, Alperen Ergür, Elias Tsigaridas

TL;DR

The paper presents a unified, geometry-aware approach to preconditioning by recasting condition-number reduction as a geodesically convex optimization over matrix-group actions. It demonstrates that minimizing log condition numbers with respect to unitarily invariant norms yields globally convergent first-order algorithms on symmetric Lie groups, with explicit gradient expressions and convergence guarantees. The framework encompasses both linear and nonlinear (polynomial) systems, providing Frobenius-norm surrogates, shuffling-and-change-of-variables techniques, and clear pathways to practical preconditioners such as diagonal and block-diagonal forms. Empirical results on sparse and dense matrices validate substantial conditioning improvements and show strong correlations between Frobenius-based improvements and Euclidean conditioning, highlighting practical impact for accelerated solvers and robust polynomial system solving.

Abstract

We introduce a unified framework for computing approximately-optimal preconditioners for solving linear and non-linear systems of equations. We demonstrate that the condition number minimization problem, under structured transformations such as diagonal and block-diagonal preconditioners, is geodesically convex with respect to unitarily invariant norms, including the Frobenius and Bombieri--Weyl norms. This allows us to introduce efficient first-order algorithms with precise convergence guarantees. For linear systems, we analyze the action of symmetric Lie subgroups $G \subseteq \GL_m(\CC) \times \GL_n(\CC)$ on the input matrix and prove that the logarithm of the condition number is a smooth geodesically convex function on the associated Riemannian quotient manifold. We obtain explicit gradient formulas, show Lipschitz continuity, and prove convergence rates for computing the optimal Frobenius condition number: $\widetilde{O}(1/\eps^2)$ iterations for general two-sided preconditioners and $\widetilde{O}(κ_F^2 \log(1/\eps))$ for strongly convex cases such as left preconditioning. We extend our framework to consider preconditioning of polynomial systems $\f(x) = 0$, where $\f$ is a system of multivariate polynomials. We analyze the local condition number $μ(\f, ξ)$, at a root $ξ$ and prove that it also admits a geodesically convex formulation under appropriate group actions. We deduce explicit formulas for the Riemannian gradients and present convergence bounds for the corresponding optimization algorithms. To the best of our knowledge, this is the first preconditioning algorithm with theoretical guarantees for polynomial systems.

Optimal Preconditioning is a Geodesically Convex Optimization Problem

TL;DR

The paper presents a unified, geometry-aware approach to preconditioning by recasting condition-number reduction as a geodesically convex optimization over matrix-group actions. It demonstrates that minimizing log condition numbers with respect to unitarily invariant norms yields globally convergent first-order algorithms on symmetric Lie groups, with explicit gradient expressions and convergence guarantees. The framework encompasses both linear and nonlinear (polynomial) systems, providing Frobenius-norm surrogates, shuffling-and-change-of-variables techniques, and clear pathways to practical preconditioners such as diagonal and block-diagonal forms. Empirical results on sparse and dense matrices validate substantial conditioning improvements and show strong correlations between Frobenius-based improvements and Euclidean conditioning, highlighting practical impact for accelerated solvers and robust polynomial system solving.

Abstract

We introduce a unified framework for computing approximately-optimal preconditioners for solving linear and non-linear systems of equations. We demonstrate that the condition number minimization problem, under structured transformations such as diagonal and block-diagonal preconditioners, is geodesically convex with respect to unitarily invariant norms, including the Frobenius and Bombieri--Weyl norms. This allows us to introduce efficient first-order algorithms with precise convergence guarantees. For linear systems, we analyze the action of symmetric Lie subgroups on the input matrix and prove that the logarithm of the condition number is a smooth geodesically convex function on the associated Riemannian quotient manifold. We obtain explicit gradient formulas, show Lipschitz continuity, and prove convergence rates for computing the optimal Frobenius condition number: iterations for general two-sided preconditioners and for strongly convex cases such as left preconditioning. We extend our framework to consider preconditioning of polynomial systems , where is a system of multivariate polynomials. We analyze the local condition number , at a root and prove that it also admits a geodesically convex formulation under appropriate group actions. We deduce explicit formulas for the Riemannian gradients and present convergence bounds for the corresponding optimization algorithms. To the best of our knowledge, this is the first preconditioning algorithm with theoretical guarantees for polynomial systems.

Paper Structure

This paper contains 30 sections, 40 theorems, 183 equations, 5 figures, 2 algorithms.

Key Result

Theorem 2.2

Suppose $m,n\geq 1$ and $G\subseteq \mathop{\mathrm{GL}}\nolimits_m(\mathbb{C})\times\mathop{\mathrm{GL}}\nolimits_n(\mathbb{C})$ is a symmetric Lie subgroup with maximal compact subgroup $K\coloneqq G\cap (\mathrm{U}_m(\mathbb{C})\times \mathrm{U}_n(\mathbb{C}))$. Furthermore, assume that $\Vert\cd

Figures (5)

  • Figure 1: The figure shows the improvement (defined as $\kappa_F(A)/\kappa_F^\star$) of the Frobenius condition number for the block diagonal preconditioner with $5\times 5$ blocks (blue) and for the diagonal preconditioner (green) for a collection of $425$SuiteSparse matrices. On average, the block diagonal preconditioner with $5\times 5$-blocks improve the Frobenius condition number by $\approx 1602$ whereas the diagonal preconditioner improves it by $\approx 601$. The red plot shows the ratio of the improvement of the block-diagonal preconditioner vs. diagonal preconditioner. On average, the improvement of the optimal block-diagonal preconditioner is $\approx 2.43$ times the improvement of the optimal diagonal preconditioner.
  • Figure 2: The figure illustrates the improvement in the Frobenius condition number, defined as $\kappa_F(A)/\kappa_F^\star$, for block-diagonal preconditioning with $5 \times 5$ blocks (blue) and diagonal preconditioning (green), applied to $500 \times 500$ matrices with i.i.d. standard normal entries. On average, the block-diagonal preconditioner achieves a reduction factor of $1.6$, compared to $1.16$ for the diagonal case. The red curve shows the ratio of these improvements, quantifying the relative advantage of block-diagonal preconditioning.
  • Figure 3: The correlation between the improvement on the Frobenius and the Euclidean condition number.
  • Figure 4: The same as \ref{['fig:Gauss-Frobenius-BlockvsDiag']}, for the SuiteSparse dataset.
  • Figure 5: The correlation between the improvement on the Frobenius and the Euclidean condition number for the SuiteSparse dataset.

Theorems & Definitions (90)

  • Example 1
  • Example 2
  • Definition 2.1
  • Theorem 2.2
  • Definition 2.3
  • Lemma 2.4
  • Theorem 2.5: Left preconditioners
  • Theorem 2.6
  • Theorem 2.7
  • Remark 1
  • ...and 80 more