Tight Error Bounds for the Sign-Constrained Stiefel Manifold
Xiaojun Chen, Yifan He, Zaikun Zhang
TL;DR
This work derives explicit global and local error bounds that relate the distance to sign-constrained Stiefel manifolds \\mathbb{S}^{n,r}_{S} to computable residuals, with constants \\nu and exponents \\tfrac{1}{2} or 1, and shows these bounds are tight in the regime 1 < r < n. It extends the nonnegative Stiefel results to general sign constraints, including special-case and general-case bounds, and establishes linear regularity characterizations. Leveraging these bounds, the authors develop exact penalty formulations for minimizing Lipschitz continuous objectives under orthogonality and sign constraints, detailing precise thresholds for penalty exponents and parameters. They validate the theory with synthetic experiments and Yale face data, demonstrating improved reconstruction quality when using the sign-constrained penalties. Overall, the results provide a rigorous, dimension-free toolkit for penalty methods and error analysis in constrained Stiefel-manifold optimization with signs.
Abstract
The sign-constrained Stiefel manifold in $\mathbb{R}^{n\times r}$ is a segment of the Stiefel manifold with fixed signs (nonnegative or nonpositive) for some columns of the matrices. It includes the nonnegative Stiefel manifold as a special case. We present global and local error bounds that provide an inequality with easily computable residual functions and explicit coefficients to bound the distance from matrices in $\mathbb{R}^{n\times r}$ to the sign-constrained Stiefel manifold. Moreover, we show that the error bounds cannot be improved except for the multiplicative constants under some mild conditions, which explains why two square-root terms are necessary in the bounds when $1< r <n$ and why the $\ell_1$ norm can be used in the bounds when $r = n$ or $r = 1$ for the sign constraints and orthogonality, respectively. The error bounds are applied to derive exact penalty methods for minimizing a Lipschitz continuous function with orthogonality and sign constraints.
