Conjugate Direction Methods Under Inconsistent Systems
Alexander Lim, Yang Liu, Fred Roosta
TL;DR
The paper studies CG and CR for symmetric inconsistent systems, formalizing difficulties in recovering normal solutions and introducing CG_pis to obtain the pseudo-inverse solution x^+ = A^+ b. It shows that CR can converge to a normal solution under no unlucky breakdown and is essentially equivalent to MINRES in this setting, with a simple CR_pis variant to reach the pseudo-inverse solution. A unifying framework clarifies that many properties of CD methods extend beyond the PD case, and extensive numerical experiments (synthetic and real-world) reveal that CG can be catastrophically unstable under inconsistency while CR and MINRES remain comparatively stable and effective. The findings have practical impact for choosing solvers in applications such as PDEs, image processing, and kernel methods where inconsistency or near-singularity arises.
Abstract
Since the development of the conjugate gradient (CG) method in 1952 by Hestenes and Stiefel, CG, has become an indispensable tool in computational mathematics for solving positive definite linear systems. On the other hand, the conjugate residual (CR) method, closely related CG and introduced by Stiefel in 1955 for the same settings, remains relatively less known outside the numerical linear algebra community. Since their inception, these methods -- henceforth collectively referred to as conjugate direction methods -- have been extended beyond positive definite to indefinite, albeit consistent, settings. Going one step further, in this paper, we investigate the theoretical and empirical properties of these methods under inconsistent systems. Among other things, we show that small modifications to the original algorithms allow for the pseudo-inverse solution. Furthermore, we show that CR is essentially equivalent to the minimum residual method, proposed by Paige and Saunders in 1975, in such contexts. Lastly, we conduct a series of numerical experiments to shed lights on their numerical stability (or lack thereof) and their performance for inconsistent systems. Surprisingly, we will demonstrate that, unlike CR and contrary to popular belief, CG can exhibit significant numerical instability, bordering on catastrophe in some instances.
