Pivotal Auto-Encoder via Self-Normalizing ReLU
Nelson Goldenstein, Jeremias Sulam, Yaniv Romano
TL;DR
The paper addresses the lack of robustness of sparse auto-encoders to unknown test-time noise by formulating a transform-learning objective with a square-root loss, yielding a noise-level invariant encoder. It introduces Self Normalizing ReLU (NeLU), an iterative proximal-gradient solver unrolled into a trainable network that learns $W$ and $\lambda$ end-to-end. The authors prove support recovery and estimation bounds under bounded and Gaussian noise, and demonstrate through synthetic and natural image denoising that the NeLU architecture offers superior stability across unseen noise levels. This approach provides a principled path to noise-robust, transform-based auto-encoders and points toward deeper architectures and CNN extensions for broader applications.
Abstract
Sparse auto-encoders are useful for extracting low-dimensional representations from high-dimensional data. However, their performance degrades sharply when the input noise at test time differs from the noise employed during training. This limitation hinders the applicability of auto-encoders in real-world scenarios where the level of noise in the input is unpredictable. In this paper, we formalize single hidden layer sparse auto-encoders as a transform learning problem. Leveraging the transform modeling interpretation, we propose an optimization problem that leads to a predictive model invariant to the noise level at test time. In other words, the same pre-trained model is able to generalize to different noise levels. The proposed optimization algorithm, derived from the square root lasso, is translated into a new, computationally efficient auto-encoding architecture. After proving that our new method is invariant to the noise level, we evaluate our approach by training networks using the proposed architecture for denoising tasks. Our experimental results demonstrate that the trained models yield a significant improvement in stability against varying types of noise compared to commonly used architectures.
