Tuning-Free Structured Sparse Recovery of Multiple Measurement Vectors using Implicit Regularization
Lakshmi Jayalal, Sheetal Kalyani
TL;DR
The paper tackles the MMV recovery problem with joint row sparsity by introducing a tuning-free approach based on implicit regularization through overparameterization. It reparameterizes the estimate as $\mathbf{X} = (\mathbf{g}^{\odot 2} \mathbf{1}_L) \odot \mathbf{V}$ and optimizes a standard least-squares loss via gradient descent on the factors, promoting row-sparse solutions without explicit regularizers or prior knowledge of sparsity or noise variance. Theoretical results reveal invariant global and row-wise balancedness and a momentum-like dynamics that favor learning rows with larger norms, along with a convergence guarantee to a rank-$K$ trajectory when initialized near the origin. Simulations show the method achieving performance comparable to traditional MMV algorithms while remaining tuning-free, underscoring its practical relevance and bridging implicit-regularization theory with MMV structured recovery.
Abstract
Recovering jointly sparse signals in the multiple measurement vectors (MMV) setting is a fundamental problem in machine learning, but traditional methods like multiple measurement vectors orthogonal matching pursuit (M-OMP) and multiple measurement vectors FOCal Underdetermined System Solver (M-FOCUSS) often require careful parameter tuning or prior knowledge of the sparsity of the signal and/or noise variance. We introduce a novel tuning-free framework that leverages Implicit Regularization (IR) from overparameterization to overcome this limitation. Our approach reparameterizes the estimation matrix into factors that decouple the shared row-support from individual vector entries. We show that the optimization dynamics inherently promote the desired row-sparse structure by applying gradient descent to a standard least-squares objective on these factors. We prove that with a sufficiently small and balanced initialization, the optimization dynamics exhibit a "momentum-like" effect, causing the norms of rows in the true support to grow significantly faster than others. This formally guarantees that the solution trajectory converges towards an idealized row-sparse solution. Additionally, empirical results demonstrate that our approach achieves performance comparable to established methods without requiring any prior information or tuning.
