Straight-Through meets Sparse Recovery: the Support Exploration Algorithm
Mimoun Mohamed, François Malgouyres, Valentin Emiya, Caroline Chaux
TL;DR
This work repurposes the straight-through estimator (STE) from quantized neural networks to the sparse support recovery problem, formulating a sparsification-based objective $F(H(\mathcal{X}))$ and introducing the Support Exploration Algorithm (SEA). SEA maintains a dense exploration vector $\mathcal{X}$, selects a $k$-sparse support via $S^t=\text{largest}_k(\mathcal{X}^t)$, and updates $\mathcal{X}$ through an STE-inspired gradient, enabling broader exploration of candidate supports than traditional greedy methods. The authors establish RIP-based recovery guarantees (Recovery-RIP and related corollaries) showing that SEA can recover the true support under certain incoherence/noise conditions, and they demonstrate substantial empirical gains in coherent settings (e.g., spike deconvolution) where standard methods falter. The results highlight SEA’s potential as both a standalone sparse-recovery method and a post-processing step to improve existing solvers, with practical implications for real-world inverse problems and potential extensions to neural-network sparsification contexts.
Abstract
The {\it straight-through estimator} (STE) is commonly used to optimize quantized neural networks, yet its contexts of effective performance are still unclear despite empirical successes.To make a step forward in this comprehension, we apply STE to a well-understood problem: {\it sparse support recovery}. We introduce the {\it Support Exploration Algorithm} (SEA), a novel algorithm promoting sparsity, and we analyze its performance in support recovery (a.k.a. model selection) problems. SEA explores more supports than the state-of-the-art, leading to superior performance in experiments, especially when the columns of $A$ are strongly coherent.The theoretical analysis considers recovery guarantees when the linear measurements matrix $A$ satisfies the {\it Restricted Isometry Property} (RIP).The sufficient conditions of recovery are comparable but more stringent than those of the state-of-the-art in sparse support recovery. Their significance lies mainly in their applicability to an instance of the STE.
