Table of Contents
Fetching ...

Quantitative Convergence of Quadratically Regularized Linear Programs

Alberto González-Sanz, Marcel Nutz

TL;DR

The paper provides a quantitative analysis of quadratically regularized linear programs over polytopes, showing that the regularized solution $x^{\eta}$ converges stationarily to the minimum-norm LP solution $x^*$ and giving an explicit threshold $\eta^*$ beyond which $x^{\eta}=x^*$. It derives a precise formula for $\eta^*$ and a slope bound for the suboptimality $\mathcal{E}(\eta)$ on the approach to $\eta^*$, and also establishes a linear rate $\|x^{\eta}-x^0\|\le \tfrac{1}{2}\|c\|\eta$ as $\eta\to0$. The framework is applied to optimal transport, where the problem reduces to a quadratic-regularized LP on the Birkhoff polytope, yielding explicit $\eta^*$ expressions and slope bounds that scale with the number of data points $N$, along with corollaries for separated-cost structures and symmetric costs. The results illuminate when quadratic regularization recovers exact LP solutions and quantify suboptimality and convergence rates, with practical implications for sparse OT couplings and data-driven transport problems.

Abstract

Linear programs with quadratic regularization are attracting renewed interest due to their applications in optimal transport: unlike entropic regularization, the squared-norm penalty gives rise to sparse approximations of optimal transport couplings. It is well known that the solution of a quadratically regularized linear program over any polytope converges stationarily to the minimal-norm solution of the linear program when the regularization parameter tends to zero. However, that result is merely qualitative. Our main result quantifies the convergence by specifying the exact threshold for the regularization parameter, after which the regularized solution also solves the linear program. Moreover, we bound the suboptimality of the regularized solution before the threshold. These results are complemented by a convergence rate for the regime of large regularization. We apply our general results to the setting of optimal transport, where we shed light on how the threshold and suboptimality depend on the number of data points.

Quantitative Convergence of Quadratically Regularized Linear Programs

TL;DR

The paper provides a quantitative analysis of quadratically regularized linear programs over polytopes, showing that the regularized solution converges stationarily to the minimum-norm LP solution and giving an explicit threshold beyond which . It derives a precise formula for and a slope bound for the suboptimality on the approach to , and also establishes a linear rate as . The framework is applied to optimal transport, where the problem reduces to a quadratic-regularized LP on the Birkhoff polytope, yielding explicit expressions and slope bounds that scale with the number of data points , along with corollaries for separated-cost structures and symmetric costs. The results illuminate when quadratic regularization recovers exact LP solutions and quantify suboptimality and convergence rates, with practical implications for sparse OT couplings and data-driven transport problems.

Abstract

Linear programs with quadratic regularization are attracting renewed interest due to their applications in optimal transport: unlike entropic regularization, the squared-norm penalty gives rise to sparse approximations of optimal transport couplings. It is well known that the solution of a quadratically regularized linear program over any polytope converges stationarily to the minimal-norm solution of the linear program when the regularization parameter tends to zero. However, that result is merely qualitative. Our main result quantifies the convergence by specifying the exact threshold for the regularization parameter, after which the regularized solution also solves the linear program. Moreover, we bound the suboptimality of the regularized solution before the threshold. These results are complemented by a convergence rate for the regime of large regularization. We apply our general results to the setting of optimal transport, where we shed light on how the threshold and suboptimality depend on the number of data points.
Paper Structure (5 sections, 8 theorems, 62 equations, 3 figures)

This paper contains 5 sections, 8 theorems, 62 equations, 3 figures.

Key Result

Lemma 2.1

Given $\eta>0$, QLP admits a unique minimizer $\mathbf{x}^{\eta}$. It is characterized as the unique $\mathbf{x}^\eta\in \mathcal{P}$ such that In particular, if $\mathbf{x}^\eta\in {\rm ri}(C)$ for some convex set $C\subset\mathcal{P}$, then also

Figures (3)

  • Figure 1: Suboptimality $\mathcal{E}(\eta)$ of \ref{['QOT']} when $\mu = \nu =\frac{1}{3} \sum_{i=1}^{3} \delta_{i/3}$ and $c(x,y) = \|x-y\|^2$. Theorem \ref{['th:main']} characterizes the location of $\eta^{*}$ and bounds the slope to the left of $\eta^{*}$.
  • Figure 2: The minimizer $\mathbf{x}^{\eta}$ of \ref{['QLP']} is the projection of $-\eta \mathbf{c}/2$ onto $\mathcal{P}$. The curve $\eta\mapsto \mathbf{x}^{\eta}$ is piecewise affine and converges stationarily to a point $\mathbf{x}^{*}$; i.e., $\mathbf{x}^{\eta}=\mathbf{x}^{*}$ for all $\eta\geq \eta^{*}$.
  • Figure 3: Accuracy of the bound \ref{['upperBoundforExample']}. Plot of $N \mapsto \lim_{\eta\to \eta^*} \frac{\int c(x, y) d\gamma^{\eta}(x, y)- \int c(x, y) d\gamma^{*}(x, y)}{\eta^*-\eta}$ (blue) and the upper bound $N \mapsto \frac{N-1}{N^6}$ (red) in double logarithmic scale.

Theorems & Definitions (19)

  • Lemma 2.1
  • Lemma 2.2
  • Remark 2.3
  • Lemma 2.4
  • Theorem 2.5
  • Corollary 2.6
  • Proposition 2.7
  • Remark 2.8
  • Remark 2.9
  • Proposition 3.1
  • ...and 9 more