An NEPv Approach for Feature Selection via Orthogonal OCCA with the (2,1)-norm Regularization
Li Wang, Lei-Hong Zhang, Ren-Cang Li
TL;DR
This work addresses supervised feature selection in high-dimensional settings by introducing OCCA21, an orthogonal canonical correlation analysis model regularized with the $\|P\|_{2,1}$ norm to induce row sparsity. The authors formulate the optimization on the Stiefel manifold and solve it via a nonlinear eigenvalue problem (NEPv) framework, yielding the OCCA-FS feature selector. They show an exact connection to the OLSR-OS model with an optimally eliminated scaling parameter, and develop a practical NEPv approach with a perturbed objective $f_{\varepsilon_0}$ and a LOCG-accelerated solver to ensure monotone convergence and scalability. Extensive experiments across six real datasets demonstrate superior classification performance and robust, globally convergent optimization compared to existing methods such as PEB-FS, while highlighting and correcting optimization issues in prior alternating schemes. The proposed method provides a scalable, theoretically grounded pipeline for high-dimensional feature ranking and selection with tangible improvements in predictive accuracy.
Abstract
A novel feature selection model via orthogonal canonical correlation analysis with the $(2,1)$-norm regularization is proposed, and the model is solved by a practical NEPv approach (nonlinear eigenvalue problem with eigenvector dependency), yielding a feature selection method named OCCA-FS. It is proved that OCCA-FS always produces a sequence of approximations with monotonic objective values and is globally convergent. Extensive numerical experiments are performed to compare OCCA-FS against existing feature selection methods. The numerical results demonstrate that OCCA-FS produces superior classification performance and often comes out on the top among all feature selection methods in comparison.
