Learning Two-layer Neural Networks with Symmetric Inputs
Rong Ge, Rohith Kuditipudi, Zhize Li, Xiang Wang
TL;DR
This work tackles the theory of learning two-layer ReLU networks under symmetric input distributions, a setting where prior results struggle with nonconvex optimization. It develops a method-of-moments framework combined with tensor-decomposition techniques (inspired by FOOBI) and a distinguishing-matrix construct to identify and recover the first-layer weights W and second-layer weights A. A key innovation is reducing two-layer learning to multiple single-layer problems via a Pure Neuron Detector and a linearization of higher-order moments, enabling polynomial-time recovery under nondegeneracy and smoothed-analysis guarantees. Empirical results demonstrate strong sample efficiency and robustness to noise and conditioning across diverse symmetric inputs, highlighting the practical potential of spectral methods for structured, non-Gaussian data. The work opens avenues for broader input-distribution classes and deeper connections between optimization landscapes and identifiability in neural architectures.
Abstract
We give a new algorithm for learning a two-layer neural network under a general class of input distributions. Assuming there is a ground-truth two-layer network $$ y = A σ(Wx) + ξ, $$ where $A,W$ are weight matrices, $ξ$ represents noise, and the number of neurons in the hidden layer is no larger than the input or output, our algorithm is guaranteed to recover the parameters $A,W$ of the ground-truth network. The only requirement on the input $x$ is that it is symmetric, which still allows highly complicated and structured input. Our algorithm is based on the method-of-moments framework and extends several results in tensor decompositions. We use spectral algorithms to avoid the complicated non-convex optimization in learning neural networks. Experiments show that our algorithm can robustly learn the ground-truth neural network with a small number of samples for many symmetric input distributions.
