Extrapolated Hard Thresholding Algorithms with Finite Length for Composite $\ell_0$ Penalized Problems

Fan Wu; Jiazhen Wei; Wei Bian

Extrapolated Hard Thresholding Algorithms with Finite Length for Composite $\ell_0$ Penalized Problems

Fan Wu, Jiazhen Wei, Wei Bian

TL;DR

The paper tackles sparse optimization with a composite $\ell_0$ penalty involving the Heaviside function by formulating $F(x)=f(x)+\lambda_1\|x_+\|_0+\lambda_2\|x_-\|_0$ over a box constraint set $\Omega$, where $f$ is convex with a Lipschitz gradient. It develops an extrapolated hard-thresholding algorithm that discretizes an inertial gradient system augmented with dry friction and Hessian-driven damping, and proves a finite-length trajectory $\sum_k\|x^{k+1}-x^k\|<\infty$ for $\epsilon>0$ without using the Kurdyka-Łojasiewicz property, with convergence to an $\epsilon$-local minimizer; for $\epsilon=0$, accumulation points are local minimizers. The authors provide equivalent local-minimizer characterizations for both $\lambda_2>0$ and $\lambda_2=0$, analyze perturbations and errors showing robustness, and validate the method through numerical experiments that demonstrate improved speed and sparsity recovery compared to existing approaches. The work offers a scalable, robust framework for nonconvex, nonsmooth $\ell_0$-penalized problems with practical relevance in sparse modeling.

Abstract

For a class of sparse optimization problems with the penalty function of $\|(\cdot)_+\|_0$, we first characterize its local minimizers and then propose an extrapolated hard thresholding algorithm to solve such problems. We show that the iterates generated by the proposed algorithm with $ε>0$ (where $ε$ is the dry friction coefficient) have finite length, without relying on the Kurdyka-Łojasiewicz inequality. Furthermore, we demonstrate that the algorithm converges to an $ε$-local minimizer of this problem. For the special case that $ε=0$, we establish that any accumulation point of the iterates is a local minimizer of the problem. Additionally, we analyze the convergence when an error term is present in the algorithm, showing that the algorithm still converges in the same manner as before, provided that the errors asymptotically approach zero. Finally, we conduct numerical experiments to verify the theoretical results of the proposed algorithm.

Extrapolated Hard Thresholding Algorithms with Finite Length for Composite $\ell_0$ Penalized Problems

TL;DR

The paper tackles sparse optimization with a composite

penalty involving the Heaviside function by formulating

over a box constraint set

, where

is convex with a Lipschitz gradient. It develops an extrapolated hard-thresholding algorithm that discretizes an inertial gradient system augmented with dry friction and Hessian-driven damping, and proves a finite-length trajectory

for

without using the Kurdyka-Łojasiewicz property, with convergence to an

-local minimizer; for

, accumulation points are local minimizers. The authors provide equivalent local-minimizer characterizations for both

and

, analyze perturbations and errors showing robustness, and validate the method through numerical experiments that demonstrate improved speed and sparsity recovery compared to existing approaches. The work offers a scalable, robust framework for nonconvex, nonsmooth

-penalized problems with practical relevance in sparse modeling.

Abstract

For a class of sparse optimization problems with the penalty function of

, we first characterize its local minimizers and then propose an extrapolated hard thresholding algorithm to solve such problems. We show that the iterates generated by the proposed algorithm with

(where

is the dry friction coefficient) have finite length, without relying on the Kurdyka-Łojasiewicz inequality. Furthermore, we demonstrate that the algorithm converges to an

-local minimizer of this problem. For the special case that

, we establish that any accumulation point of the iterates is a local minimizer of the problem. Additionally, we analyze the convergence when an error term is present in the algorithm, showing that the algorithm still converges in the same manner as before, provided that the errors asymptotically approach zero. Finally, we conduct numerical experiments to verify the theoretical results of the proposed algorithm.

Paper Structure (7 sections, 11 theorems, 103 equations, 8 figures, 2 algorithms)

This paper contains 7 sections, 11 theorems, 103 equations, 8 figures, 2 algorithms.

Introduction
Notations and preliminaries
Characteristics of local minimizers to problem \ref{['pro1']}
Algorithm and its convergence analysis
Errors and perturbations
Numerical experiments
Conclusion

Key Result

Lemma 2.1

Zhou2021Quadratic For any fixed $\tilde{x}\in\mathbb{R}^n$, the limiting subdifferential of function $\|(\cdot)_+\|_0:\mathbb{R}^n\to\mathbb{R}$ at $\tilde{x}$ is

Figures (8)

Figure 1: Numerical results for Example \ref{['exa:5.1']}.
Figure 2: The trajectories of $\{y^k\}$ generated by different algorithms for Example \ref{['exa:5.1']}.
Figure 3: The averages of numerical results for Example \ref{['ex2']} with $m=2000$, $n=400$ and $\hbox{Spar}=20\%$.
Figure 4: Convergence of $\{\sum\|x^{k+1}-x^k\|\}$ and the performance profiles of algorithms for Example \ref{['ex2']}.
Figure 5: The relative errors and sparsity regression rates for Example \ref{['ex2']} with $m=3000$, $n=600$ and $\hbox{Spar}=10\%$.
...and 3 more figures

Theorems & Definitions (27)

Definition 1
Lemma 2.1
Lemma 2.2
Lemma 2.3
Proposition 3.1
proof
Definition 2
Proposition 3.2
proof
Remark 1
...and 17 more

Extrapolated Hard Thresholding Algorithms with Finite Length for Composite $\ell_0$ Penalized Problems

TL;DR

Abstract

Extrapolated Hard Thresholding Algorithms with Finite Length for Composite $\ell_0$ Penalized Problems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (27)